Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-5353

add progress callback feature to the slow FileUtil operations with ability to cancel the work

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Minor
    • Resolution: Unresolved
    • 0.21.0
    • None
    • fs
    • None

    Description

      This is something only of relevance of people doing front ends to FS operations, and as they could take the code in FSUtil and add something with this feature, its a blocker to none of them.

      Current FileUtil.copy can take a long time to move large files around, but there is no progress indicator to GUIs, or a way to cancel the operation mid-way, j interrupting the thread or closing the filesystem.

      I propose a FileIOProgress interface to the copy ops, one that had a single method to notify listeners of bytes read and written, and the number of files handled.

      
      

      interface FileIOProgress {
      boolean progress(int files, long bytesRead, long bytesWritten);
      }

      The return value would be true to continue the operation, or false to stop the copy and leave the FS in whatever incomplete state it is in currently.

      it could even be fancier: have beginFileOperation and endFileOperation callbacks to pass in the name of the current file being worked on, though I don't have a personal need for that.

      GUIs could show progress bars and cancel buttons, other tools could use the interface to pass any cancellation notice upstream.

      The FileUtil.copy operations would call this interface (blocking) after every block copy, so the frequency of invocation would depend on block size and network/disk speeds. Which is also why I don't propose having any percentage done indicators; it's too hard to predict percentage of time done for distributed file IO with any degree of accuracy.

      Attachments

        1. HADOOP-5353.002.patch
          38 kB
          Pranav Prakash
        2. HADOOP-5353.001.patch
          33 kB
          Pranav Prakash
        3. HADOOP-5353.000.patch
          5 kB
          Lei (Eddy) Xu

        Activity

          People

            pranavprakash Pranav Prakash
            stevel@apache.org Steve Loughran
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated: