Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5395

Update Teragen algorithm

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.23.7
    • None
    • examples
    • None

    Description

      The Teragen algorithm is no longer up to date with the sortbenchmark.org gensort tool used for the official sort benchmark. The new algorithm is supposed to generate data that isn't very compressible.

      Also the new version of gensort can generate skewed data so we should add that option to teragen also.

      Attachments

        Activity

          People

            Unassigned Unassigned
            tgraves Thomas Graves
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated: