Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-52

KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      1) Currently, KeyFieldBasedPartitioner only support utf8 encoded recored, we should use text or byteswriteable data types.

      2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field, the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting that record!

      Attachments

        1. HADOOP-5779-v1.0.patch.patch
          5 kB
          Amar Kamat
        2. HADOOP-5779-partial.patch
          3 kB
          Amar Kamat
        3. encode-free-KeyFieldBasedPartitioner-v1.patch
          2 kB
          ZhuGuanyin
        4. encode-free-KeyFieldBasedPartitioner.patch
          2 kB
          ZhuGuanyin

        Issue Links

          Activity

            People

              Unassigned Unassigned
              buptzhugy ZhuGuanyin
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: