Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4304

Add Dumbo to contrib

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Later
    • None
    • None
    • None
    • None
    • Dumbo (a project that aims to make Hadoop Streaming as easy as possible) added to contrib.

    Description

      Originally, Dumbo was a simple Python module developed at Last.fm to make writing and running Hadoop Streaming programs very easy, but now it also consists of some (up till now unreleased) helper code in Java (although it can still be used without the Java code). We propose to add Dumbo to "src/contrib" such that the Java classes get build/installed together with the rest of Hadoop, and the Python module can be installed separately at will. A tar.gz of the directory that would have to be added to "src/contrib" is available at

      http://static.last.fm/dumbo/dumbo-contrib.tar.gz

      and more info about Dumbo can be found here:

      For some of the more advanced features of Dumbo (in particular the ones for which the Java classes are needed) there is no public documentation yet, but we could easily fill that gap by moving some of the internal Last.fm documentation to the Hadoop wiki.

      Attachments

        1. hadoop-4304-v3.patch
          49 kB
          Klaas Bosteels
        2. hadoop-4304-v2.patch
          45 kB
          Klaas Bosteels
        3. hadoop-4304.patch
          43 kB
          Klaas Bosteels

        Activity

          People

            klbostee Klaas Bosteels
            klbostee Klaas Bosteels
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: