Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2853

Add "teraread" example

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.23.0
    • None
    • benchmarks, examples
    • None

    Description

      Teragen is a good benchmark of raw DFS write throughput. Terasort is a good benchmark of the whole MR system (input, shuffle, output). I've added a simple "teraread" example which reads through the terasort input data without performing any processing: this acts as a good benchmark of a read-only workload (similar to real-life "find a needle in a haystack" MR jobs)

      Attachments

        1. mapreduce-2853.txt
          5 kB
          Todd Lipcon

        Activity

          People

            tlipcon Todd Lipcon
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: