Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-326

The lowest level map-reduce APIs should be byte oriented

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      As discussed here:
      https://issues.apache.org/jira/browse/HADOOP-1986#action_12551237

      The templates, serializers and other complexities that allow map-reduce to use arbitrary types complicate the design and lead to lots of object creates and other overhead that a byte oriented design would not suffer. I believe the lowest level implementation of hadoop map-reduce should have byte string oriented APIs (for keys and values). This API would be more performant, simpler and more easily cross language.

      The existing API could be maintained as a thin layer on top of the leaner API.

      Attachments

        1. MAPREDUCE-326-api.patch
          13 kB
          Thomas White
        2. MAPREDUCE-326.pdf
          78 kB
          Thomas White

        Issue Links

          Activity

            People

              Unassigned Unassigned
              eric14 Eric Baldeschwieler
              Votes:
              2 Vote for this issue
              Watchers:
              38 Start watching this issue

              Dates

                Created:
                Updated: