Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-15880

WASB doesn't honor fs.trash.interval and this fails to auto purge trash folder

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 2.7.3
    • None
    • documentation, fs/azure
    • Any HDInsigth cluster pointing to WASB. 

    Description

      when "fs.trash.interval" is set to a value,  trash for the local hdfs got cleared where as the trash folder on WASB doesn't get deleted and the files get piled up on WASB store..

      WASB doesn't pick up  fs.trash.interval value and this fails to auto purge trash folder on WASB store.

       

      Issue : WASB doesn't honor fs.trash.interval and this fails to auto purge trash folder

      Steps to reproduce Scenario:

      Delete any file stored on HDFS

      hdfs dfs -D "fs.default.name=hdfs://mycluster/" -rm /hivestore.txt
      18/10/23 06:18:05 INFO fs.TrashPolicyDefault: Moved: 'hdfs://mycluster/hivestore.txt' to trash at: hdfs://mycluster/user/sshuser/.Trash/Current/hivestore.txt

      When deleted the file is moved to trash folder
      hdfs dfs -rm wasb:///hivestore.txt
      18/10/23 06:19:13 INFO fs.TrashPolicyDefault: Moved: 'wasb://kcspark-2018-10-18t17-07-40-524z@kcdnsproxy.blob.core.windows.net/hivestore.txt' to trash at: wasb://kcspark-2018-10-18t17-07-40-524z@kcdnsproxy.blob.core.windows.net/user/sshuser/.Trash/Current/hivestore.txt

      Reduced the fs.trash.interval from 360 to 1 and restarted all related services.

      Trash for the local hdfs gets cleared honoring the "fs.trash.interval" value.

      hdfs dfs -D "fs.default.name=hdfs://mycluster/" -ls hdfs://mycluster/user/sshuser/.Trash/Current/
      ls: File hdfs://mycluster/user/sshuser/.Trash/Current does not exist.

      Where as the trash for WASB doesn't get cleared.

      hdfs dfs -ls wasb://kcspark-2018-10-18t17-07-40-524z@kcdnsproxy.blob.core.windows.net/user/sshuser/.Trash/Current/
      Found 1 items
      rw-rr- 1 sshuser supergroup 1084 2018-10-23 06:19 wasb://kcspark-2018-10-18t17-07-40-524z@kcdnsproxy.blob.core.windows.net/user/sshuser/.Trash/Current/hivestore.txt

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            Sunilkc Sunil Kumar Chakrapani

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 20m
                20m

                Slack

                  Issue deployment