Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-2052

distcp mapper's status report misleading

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • 0.15.0
    • None
    • None

    Description

      When the mappers of distcp finish, the status page in the web gui reports the data copied.
      However, the reported number is far away from the real number, which is very misleading.
      For example, a particular mapper task_200710131713_0001_m_000000_0 reported:

      Finished. Bytes copied: 4.3g

      However, it does not say which file.
      I thought it was for part-00000. But the file size of part-00000
      is about 9GB.

      It will be much clearer if the status report say something like:

      Finished copy file-xxxx: 4.3g
      That way, I can easily check whether the size is correct.

      Attachments

        Issue Links

          Activity

            People

              cdouglas Christopher Douglas
              runping Runping Qi
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: