Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4749

reducer should output input data size when shuffling is done

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.19.0
    • 0.20.0
    • None
    • None
    • Reviewed
    • Added a new counter REDUCE_INPUT_BYTES.

    Description

      Sometimes we see a single slow reducer because of the load balancing problem. This information will be very useful to understand how imbalanced the load is.

      Should be easy to fix I guess, since reducer should have all information needed at the end of the shuffling phase.

      Attachments

        1. 4749.patch
          2 kB
          He Yongqiang

        Issue Links

          Activity

            People

              he yongqiang He Yongqiang
              zshao Zheng Shao
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: