Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-2052

distcp mapper's status report misleading

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: 0.15.0
    • Component/s: None
    • Labels:
      None

      Description

      When the mappers of distcp finish, the status page in the web gui reports the data copied.
      However, the reported number is far away from the real number, which is very misleading.
      For example, a particular mapper task_200710131713_0001_m_000000_0 reported:

      Finished. Bytes copied: 4.3g

      However, it does not say which file.
      I thought it was for part-00000. But the file size of part-00000
      is about 9GB.

      It will be much clearer if the status report say something like:

      Finished copy file-xxxx: 4.3g
      That way, I can easily check whether the size is correct.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                cdouglas Christopher Douglas
                Reporter:
                runping Runping Qi
              • Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: