Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-17469 IOStatistics Phase II
  3. HADOOP-18374

DistCP: Aggregate IOStatistics Counters in MapReduce Counters

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.5
    • None
    • tools/distcp
    • None

    Description

      Distcp can collect IOStatisticsContext counter values and report them to the console. it can't do the timings in min/mean/max though, as there's no way to aggregate them properly.

      1. Publish statistics to MapReduce counters in the tasks within CopyMapper.copyFileWithRetry().
      2. The counters will be automatically logged in Job.monitorAndPrintJob() when DistCp is executed with the -verbose option; no need for changes there.
      3. We could also publish the iOStatistic means by publishing sample count and total sum as two separate counters
      4. In AbstractContractDistCpTest, add an override point for subclasses to list which metrics they will issue; assert that values are generated.

      Attachments

        Activity

          People

            mehakmeetSingh Mehakmeet Singh
            stevel@apache.org Steve Loughran
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: