Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-684

DataMining - create Notebooks \w example of analytics for public datasets

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:

      Description

      The number of public datasets is growing every day and Zeppelin is a natural choice for exploratory data analytics using one of the tools from Apache BigData ecosystem.

      The goal of this project is to build set of "data products" - notebooks, using different backend systems (Flink, Spark, etc) that can be used as a library of examples, showcasing zeppelin's interactive visualizations, rich GUI though Helium as well as data exploration capabilities.

      This will include developing practical skills in using major tools from BigData eco-system through various Apache projects.

      For this particular project communication with users and engagement in the community to demonstrate the results of your work would be a very important requirement.

        Issue Links

          Activity

          Hide
          bzz Alexander Bezzubov added a comment -

          It's great to see such an interest in this project. Thank you for expressing it here and on public mailing list!

          Hope you guys have submitted the proposals before the official deadline today!

          Show
          bzz Alexander Bezzubov added a comment - It's great to see such an interest in this project. Thank you for expressing it here and on public mailing list! Hope you guys have submitted the proposals before the official deadline today!
          Hide
          Gaspard Baye added a comment - - edited

          Hi,
          I am a BSc student in ICT schooling at ICT-University USA, Cameroon campus, Africa and had some experience with Machine learning libs , data mining and really dedicated to solve many data problems in africa.I love data mining and love to contribute in open source technology.I really wish to go further in that.Please I wish to be part in GSOCs 2016.

          Thanks
          Baye gaspard

          Show
          Gaspard Baye added a comment - - edited Hi, I am a BSc student in ICT schooling at ICT-University USA, Cameroon campus, Africa and had some experience with Machine learning libs , data mining and really dedicated to solve many data problems in africa.I love data mining and love to contribute in open source technology.I really wish to go further in that.Please I wish to be part in GSOCs 2016. Thanks Baye gaspard
          Hide
          fan_wcting fan wcting added a comment - - edited

          Hello ,all
          I am a 2rd undergraduate from South China Agricultural University , majoring in Information and Computing Scineces (department of mathematics). I think it might be too late now. Because I didn't know the project until March 22. But I think life is always full of hope and opportunity.

          I'm interested in data mining.I had use java to obtain and analyse data for the competition---“China Undergraduate Mathematical Contest in Modeling”.Had use python numpy and matplotlib module to do the practice of the book “machine learning in action”.

          Besides , in order to update their equipment.I had completed commercial projects for outsourcing. So I'm familiar with backend and easy to learn new things.

          Although I'm decided to take participate in this project next year,if you can give me chance this time, I will be happy and try my best to do it.
          Thanks,
          Fan

          Show
          fan_wcting fan wcting added a comment - - edited Hello ,all I am a 2rd undergraduate from South China Agricultural University , majoring in Information and Computing Scineces (department of mathematics). I think it might be too late now. Because I didn't know the project until March 22. But I think life is always full of hope and opportunity. I'm interested in data mining.I had use java to obtain and analyse data for the competition---“China Undergraduate Mathematical Contest in Modeling”.Had use python numpy and matplotlib module to do the practice of the book “machine learning in action”. Besides , in order to update their equipment.I had completed commercial projects for outsourcing. So I'm familiar with backend and easy to learn new things. Although I'm decided to take participate in this project next year,if you can give me chance this time, I will be happy and try my best to do it. Thanks, Fan
          Hide
          irfan_amu MD IRFAN added a comment -

          Hi
          Alexander Bezzubov
          I am student of B.Tech 3rd year of CSE from Zakir husain college of engineering and technology, Aligarh Muslim University,India . I make a some project related to data base and have good understanding of data mining and Big data.I have great interest in this project if i got a chance work with you on this project i would give you my 100%.I request to be guided me further more study related to the current issue so the i will get to full prepare to this project..

          Thank You
          MD IRFAN

          Show
          irfan_amu MD IRFAN added a comment - Hi Alexander Bezzubov I am student of B.Tech 3rd year of CSE from Zakir husain college of engineering and technology, Aligarh Muslim University,India . I make a some project related to data base and have good understanding of data mining and Big data.I have great interest in this project if i got a chance work with you on this project i would give you my 100%.I request to be guided me further more study related to the current issue so the i will get to full prepare to this project.. Thank You MD IRFAN
          Hide
          sbfirodiya Shivani Firodiya added a comment -

          Hello Alexander,
          I am an undergraduate from Pune Institute of Computer Technology, Pune, India pursuing Computer Science Degree.I have some experience in Machine Learning and on some python libraries viz. scikit, pandas. I have made some projects in Spark . I am currently working on Deep Learning. I would love to participate in this project for GSOC 2016.
          I created a Zeppelin Notebook on Movie Recommendation. Here's the link: https://github.com/thumblas/Zeppelin-Notebook . I would be happy to know whether i am on the right track.

          Show
          sbfirodiya Shivani Firodiya added a comment - Hello Alexander, I am an undergraduate from Pune Institute of Computer Technology, Pune, India pursuing Computer Science Degree.I have some experience in Machine Learning and on some python libraries viz. scikit, pandas. I have made some projects in Spark . I am currently working on Deep Learning. I would love to participate in this project for GSOC 2016. I created a Zeppelin Notebook on Movie Recommendation. Here's the link: https://github.com/thumblas/Zeppelin-Notebook . I would be happy to know whether i am on the right track.
          Hide
          ray0301 wei added a comment -

          Hi all,
          I am a graduate student pursuing information systems degree for Big Data and Predictive Analytics Track at Washington university in St.louis. I am a big fun of data mining like you guys. I am interested in using python numpy and matplotlib module to analysis data with machine learning methods. So I am very interested in the project. It's pretty happy to learn about different kinds of technology to solve the real problem and know you guys. Welcome to contact me if you guys want to discuss the related problems to the project.

          Thanks,
          Ray

          Show
          ray0301 wei added a comment - Hi all, I am a graduate student pursuing information systems degree for Big Data and Predictive Analytics Track at Washington university in St.louis. I am a big fun of data mining like you guys. I am interested in using python numpy and matplotlib module to analysis data with machine learning methods. So I am very interested in the project. It's pretty happy to learn about different kinds of technology to solve the real problem and know you guys. Welcome to contact me if you guys want to discuss the related problems to the project. Thanks, Ray
          Hide
          bzz Alexander Bezzubov added a comment - - edited

          Anuj Godase Visitha Manujaya Freddie Vargus Thamali Wijewardhana ANISH SINGH thank you guys for your interest!

          First thing to do would be to subscribe to project mailing lists and read carefully the material I shared with ANISH SINGH http://markmail.org/thread/naocktanol5iuot3

          Next steps would be - engage in community and then, at some point, start a thread there with Subj: "[GSoC2016] ZEPPELIN-684 proposal ...." with the draft of your particular way of attacking this problem, using the advices from the thread mentioned above.

          Please mind that there is an application deadline for submitting final version of your proposal.

          Show
          bzz Alexander Bezzubov added a comment - - edited Anuj Godase Visitha Manujaya Freddie Vargus Thamali Wijewardhana ANISH SINGH thank you guys for your interest! First thing to do would be to subscribe to project mailing lists and read carefully the material I shared with ANISH SINGH http://markmail.org/thread/naocktanol5iuot3 Next steps would be - engage in community and then, at some point, start a thread there with Subj: " [GSoC2016] ZEPPELIN-684 proposal ...." with the draft of your particular way of attacking this problem, using the advices from the thread mentioned above. Please mind that there is an application deadline for submitting final version of your proposal.
          Hide
          1byxero Anuj Godase added a comment -

          Hello Alexander,
          I am junior year student pursuing Computer Science degree at Pune Institute of Computer Technology,Pune,India. I do not have any prior experience but am deeply interested in big-data, data science and analytics and this project being a huge opportunity for learning the things that interest me, I would like to work on this project. Depending on my past experiences of learning new things, I will not require much time to get acquainted with the technologies that are being used in the project. I would like to request guidance for getting acquainted with the project.

          Thank you,
          Anuj Godase.

          Show
          1byxero Anuj Godase added a comment - Hello Alexander, I am junior year student pursuing Computer Science degree at Pune Institute of Computer Technology,Pune,India. I do not have any prior experience but am deeply interested in big-data, data science and analytics and this project being a huge opportunity for learning the things that interest me, I would like to work on this project. Depending on my past experiences of learning new things, I will not require much time to get acquainted with the technologies that are being used in the project. I would like to request guidance for getting acquainted with the project. Thank you, Anuj Godase.
          Hide
          Visitha Visitha Manujaya added a comment -

          Hi Alexander,
          I am a 3r year under graduate in the department of Computer Science and Engineering at University of Moratuwa,Sri Lanka. I am interested in this project as I have no prior experiences in data mining, But I have worked with Apache tomcat (have created custom Valves and Realms) and it was very interesting. So i would like to work on another product of Apache which has many things to learn. I think this project is a great opportunity for that. Could u please provide me with more information about the project as well as resources to get an approach to product technology.

          Thanks
          Visitha

          Show
          Visitha Visitha Manujaya added a comment - Hi Alexander, I am a 3r year under graduate in the department of Computer Science and Engineering at University of Moratuwa,Sri Lanka. I am interested in this project as I have no prior experiences in data mining, But I have worked with Apache tomcat (have created custom Valves and Realms) and it was very interesting. So i would like to work on another product of Apache which has many things to learn. I think this project is a great opportunity for that. Could u please provide me with more information about the project as well as resources to get an approach to product technology. Thanks Visitha
          Hide
          FreddieV4 Freddie Vargus added a comment - - edited

          Hi Alexander,

          I'm currently a sophomore at Boston University, majoring in Computer Science. I've been interested in data mining for a while and have some experience with web scraping/web crawling and would love to participate in this project for GSOC 2016!

          I'd also like to use this opportunity to learn more about the ASF and become more engaged with the ASF community.

          Could you tell me how I could find out more or get involved in this project?

          Thank you!
          Freddie

          Show
          FreddieV4 Freddie Vargus added a comment - - edited Hi Alexander, I'm currently a sophomore at Boston University, majoring in Computer Science. I've been interested in data mining for a while and have some experience with web scraping/web crawling and would love to participate in this project for GSOC 2016! I'd also like to use this opportunity to learn more about the ASF and become more engaged with the ASF community. Could you tell me how I could find out more or get involved in this project? Thank you! Freddie
          Hide
          ThamaliW Thamali Wijewardhana added a comment - - edited

          Hi Alexander,
          I am a Computer Science and Engineering undergraduate at University of Moratuwa,Sri Lanka. I am interested in Apache and data mining. I have worked a lot with Apache Spark ml library when doing machine learning projects during my internship. I am interested in this project and I would like to proceed with this project in GSOC 2016. Please kindly give me further information on how I could proceed.

          Thanks
          Thamali

          Show
          ThamaliW Thamali Wijewardhana added a comment - - edited Hi Alexander, I am a Computer Science and Engineering undergraduate at University of Moratuwa,Sri Lanka. I am interested in Apache and data mining. I have worked a lot with Apache Spark ml library when doing machine learning projects during my internship. I am interested in this project and I would like to proceed with this project in GSOC 2016. Please kindly give me further information on how I could proceed. Thanks Thamali
          Hide
          anishsingh ANISH SINGH added a comment -

          Hello Alexander,
          I'm Anish, a sophomore CSE student. Last winter, I decided to develop a share price prediction program through Apache Spark's ml libraries and in the process extensively studied Apache Spark and its structure and working. I would like to take up the project for the Summer of 2016. I request to be guided further to study more about the current issues in the project so that I may be able to draw up my proposal.
          ThankYou.

          Show
          anishsingh ANISH SINGH added a comment - Hello Alexander, I'm Anish, a sophomore CSE student. Last winter, I decided to develop a share price prediction program through Apache Spark's ml libraries and in the process extensively studied Apache Spark and its structure and working. I would like to take up the project for the Summer of 2016. I request to be guided further to study more about the current issues in the project so that I may be able to draw up my proposal. ThankYou.

            People

            • Assignee:
              Unassigned
              Reporter:
              bzz Alexander Bezzubov
            • Votes:
              0 Vote for this issue
              Watchers:
              22 Start watching this issue

              Dates

              • Created:
                Updated:

                Development