Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1295

We need a job trace manipulator to build gridmix runs.

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.21.0
    • Component/s: tools/rumen
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Rumen produces "job traces", which are JSON format files describing important aspects of all jobs that are run [successfully or not] on a hadoop map/reduce cluster. There are two packages under development that will consume these trace files and produce actions in that cluster or another cluster: gridmix3 [see jira MAPREDUCE-1124 ] and Mumak [a simulator -- see MAPREDUCE-728 ].

      It would be useful to be able to do two things with job traces, so we can run experiments using these two tools: change the duration, and change the density. I would like to provide a "folder", a tool that can wrap a long-duration execution trace to redistribute its jobs over a shorter interval, and also change the density by duplicating or culling away jobs from the folded combined job trace.

        Attachments

        1. mapreduce-1295--2009-12-23.patch
          297 kB
          Dick King
        2. mapreduce-1295--2009-12-22.patch
          298 kB
          Dick King
        3. mapreduce-1295--2009-12-21.patch
          298 kB
          Dick King
        4. mapreduce-1295--2009-12-17.patch
          299 kB
          Dick King
        5. mapreduce-1297--2009-12-14.patch
          301 kB
          Dick King

          Activity

            People

            • Assignee:
              dking Dick King
              Reporter:
              dking Dick King
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: