Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1977

[Rumen] Do Not Store CDFs in Output JSON Files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tools/rumen
    • None

    Description

      Per-Job Cumulative Distribution Functions (CDFs) stored in JSON files emitted by Rumen are redundant and waste space (~30%).
      They can easily be re-computed upon loading or just-in-time upon request.

      Attachments

        Activity

          People

            amar_kamat Amar Kamat
            ranjit Ranjit Mathew
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: