Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10735

Distcp using webhdfs on secure HA clusters fails with StandbyException

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.7.2
    • Fix Version/s: 2.8.0, 3.0.0-alpha2
    • Component/s: webhdfs
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      A distcp running on HADOOP 2.7.2 cluster , copying from HADOOP 2.4.1 cluster fails.
      Both are secure clusters.
      The logs indicates that task contacts the StandBy name node on the remote cluster i first and gets a standByException.
      Instead of retrying with the active name node on the remote cluster, the execution errors out throwing InvalidToken

      The exception is attached

      1. HDFS-10735-001.patch
        1 kB
        Benoy Antony
      2. exception.txt
        2 kB
        Benoy Antony

        Activity

        Hide
        benoyantony Benoy Antony added a comment -

        Attaching the patch.
        The patch is tested on our clusters.

        Show
        benoyantony Benoy Antony added a comment - Attaching the patch. The patch is tested on our clusters.
        Hide
        hadoopqa Hadoop QA added a comment -
        -1 overall



        Vote Subsystem Runtime Comment
        0 reexec 0m 11s Docker mode activated.
        +1 @author 0m 0s The patch does not contain any @author tags.
        -1 test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
        +1 mvninstall 8m 10s trunk passed
        +1 compile 0m 34s trunk passed
        +1 checkstyle 0m 15s trunk passed
        +1 mvnsite 0m 37s trunk passed
        +1 mvneclipse 0m 12s trunk passed
        +1 findbugs 1m 32s trunk passed
        +1 javadoc 0m 21s trunk passed
        +1 mvninstall 0m 34s the patch passed
        +1 compile 0m 29s the patch passed
        +1 javac 0m 29s the patch passed
        +1 checkstyle 0m 13s the patch passed
        +1 mvnsite 0m 35s the patch passed
        +1 mvneclipse 0m 9s the patch passed
        +1 whitespace 0m 0s The patch has no whitespace issues.
        +1 findbugs 1m 37s the patch passed
        +1 javadoc 0m 19s the patch passed
        +1 unit 1m 0s hadoop-hdfs-client in the patch passed.
        +1 asflicense 0m 18s The patch does not generate ASF License warnings.
        18m 23s



        Subsystem Report/Notes
        Docker Image:yetus/hadoop:9560f25
        JIRA Patch URL https://issues.apache.org/jira/secure/attachment/12822711/HDFS-10735-001.patch
        JIRA Issue HDFS-10735
        Optional Tests asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle
        uname Linux c59464009f0f 3.13.0-92-generic #139-Ubuntu SMP Tue Jun 28 20:42:26 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
        Build tool maven
        Personality /testptch/hadoop/patchprocess/precommit/personality/provided.sh
        git revision trunk / 0ad48aa
        Default Java 1.8.0_101
        findbugs v3.0.0
        Test Results https://builds.apache.org/job/PreCommit-HDFS-Build/16352/testReport/
        modules C: hadoop-hdfs-project/hadoop-hdfs-client U: hadoop-hdfs-project/hadoop-hdfs-client
        Console output https://builds.apache.org/job/PreCommit-HDFS-Build/16352/console
        Powered by Apache Yetus 0.4.0-SNAPSHOT http://yetus.apache.org

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 reexec 0m 11s Docker mode activated. +1 @author 0m 0s The patch does not contain any @author tags. -1 test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. +1 mvninstall 8m 10s trunk passed +1 compile 0m 34s trunk passed +1 checkstyle 0m 15s trunk passed +1 mvnsite 0m 37s trunk passed +1 mvneclipse 0m 12s trunk passed +1 findbugs 1m 32s trunk passed +1 javadoc 0m 21s trunk passed +1 mvninstall 0m 34s the patch passed +1 compile 0m 29s the patch passed +1 javac 0m 29s the patch passed +1 checkstyle 0m 13s the patch passed +1 mvnsite 0m 35s the patch passed +1 mvneclipse 0m 9s the patch passed +1 whitespace 0m 0s The patch has no whitespace issues. +1 findbugs 1m 37s the patch passed +1 javadoc 0m 19s the patch passed +1 unit 1m 0s hadoop-hdfs-client in the patch passed. +1 asflicense 0m 18s The patch does not generate ASF License warnings. 18m 23s Subsystem Report/Notes Docker Image:yetus/hadoop:9560f25 JIRA Patch URL https://issues.apache.org/jira/secure/attachment/12822711/HDFS-10735-001.patch JIRA Issue HDFS-10735 Optional Tests asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle uname Linux c59464009f0f 3.13.0-92-generic #139-Ubuntu SMP Tue Jun 28 20:42:26 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux Build tool maven Personality /testptch/hadoop/patchprocess/precommit/personality/provided.sh git revision trunk / 0ad48aa Default Java 1.8.0_101 findbugs v3.0.0 Test Results https://builds.apache.org/job/PreCommit-HDFS-Build/16352/testReport/ modules C: hadoop-hdfs-project/hadoop-hdfs-client U: hadoop-hdfs-project/hadoop-hdfs-client Console output https://builds.apache.org/job/PreCommit-HDFS-Build/16352/console Powered by Apache Yetus 0.4.0-SNAPSHOT http://yetus.apache.org This message was automatically generated.
        Hide
        arpitagarwal Arpit Agarwal added a comment -

        Hi Benoy Antony, the change looks fine to me. It will trigger the failover condition in FailoverOnNetworkExceptionRetry policy.

        Does checking that the wrapped exception is StandbyException work, instead of re.getMessage().endsWith? You probably tried it already.

        Show
        arpitagarwal Arpit Agarwal added a comment - Hi Benoy Antony , the change looks fine to me. It will trigger the failover condition in FailoverOnNetworkExceptionRetry policy. Does checking that the wrapped exception is StandbyException work, instead of re.getMessage().endsWith ? You probably tried it already.
        Hide
        arpitagarwal Arpit Agarwal added a comment -

        Does checking that the wrapped exception is StandbyException work,

        So that is not going to work since the wrapped exception is not StandbyException.

        +1 for the fix. Can you please look into adding a unit test if it's feasible?

        Show
        arpitagarwal Arpit Agarwal added a comment - Does checking that the wrapped exception is StandbyException work, So that is not going to work since the wrapped exception is not StandbyException. +1 for the fix. Can you please look into adding a unit test if it's feasible?
        Hide
        benoyantony Benoy Antony added a comment -

        Arpit Agarwal, Thanks for the review. Since the issue happens when interacting with an older version of HDFS, it will be difficult to test.
        If there are no further comments, I'll commit this in a couple of days.

        Show
        benoyantony Benoy Antony added a comment - Arpit Agarwal , Thanks for the review. Since the issue happens when interacting with an older version of HDFS, it will be difficult to test. If there are no further comments, I'll commit this in a couple of days.
        Hide
        hadoopqa Hadoop QA added a comment -
        -1 overall



        Vote Subsystem Runtime Comment
        0 reexec 0m 21s Docker mode activated.
        +1 @author 0m 0s The patch does not contain any @author tags.
        -1 test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
        +1 mvninstall 7m 11s trunk passed
        +1 compile 0m 30s trunk passed
        +1 checkstyle 0m 15s trunk passed
        +1 mvnsite 0m 34s trunk passed
        +1 mvneclipse 0m 12s trunk passed
        +1 findbugs 1m 24s trunk passed
        +1 javadoc 0m 19s trunk passed
        +1 mvninstall 0m 29s the patch passed
        +1 compile 0m 27s the patch passed
        +1 javac 0m 27s the patch passed
        +1 checkstyle 0m 12s the patch passed
        +1 mvnsite 0m 31s the patch passed
        +1 mvneclipse 0m 9s the patch passed
        +1 whitespace 0m 0s The patch has no whitespace issues.
        +1 findbugs 1m 30s the patch passed
        +1 javadoc 0m 16s the patch passed
        +1 unit 0m 52s hadoop-hdfs-client in the patch passed.
        +1 asflicense 0m 18s The patch does not generate ASF License warnings.
        16m 39s



        Subsystem Report/Notes
        Docker Image:yetus/hadoop:9560f25
        JIRA Issue HDFS-10735
        JIRA Patch URL https://issues.apache.org/jira/secure/attachment/12822711/HDFS-10735-001.patch
        Optional Tests asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle
        uname Linux 585ce7383ddb 3.13.0-96-generic #143-Ubuntu SMP Mon Aug 29 20:15:20 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
        Build tool maven
        Personality /testptch/hadoop/patchprocess/precommit/personality/provided.sh
        git revision trunk / dbe663d
        Default Java 1.8.0_101
        findbugs v3.0.0
        Test Results https://builds.apache.org/job/PreCommit-HDFS-Build/17160/testReport/
        modules C: hadoop-hdfs-project/hadoop-hdfs-client U: hadoop-hdfs-project/hadoop-hdfs-client
        Console output https://builds.apache.org/job/PreCommit-HDFS-Build/17160/console
        Powered by Apache Yetus 0.4.0-SNAPSHOT http://yetus.apache.org

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 reexec 0m 21s Docker mode activated. +1 @author 0m 0s The patch does not contain any @author tags. -1 test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. +1 mvninstall 7m 11s trunk passed +1 compile 0m 30s trunk passed +1 checkstyle 0m 15s trunk passed +1 mvnsite 0m 34s trunk passed +1 mvneclipse 0m 12s trunk passed +1 findbugs 1m 24s trunk passed +1 javadoc 0m 19s trunk passed +1 mvninstall 0m 29s the patch passed +1 compile 0m 27s the patch passed +1 javac 0m 27s the patch passed +1 checkstyle 0m 12s the patch passed +1 mvnsite 0m 31s the patch passed +1 mvneclipse 0m 9s the patch passed +1 whitespace 0m 0s The patch has no whitespace issues. +1 findbugs 1m 30s the patch passed +1 javadoc 0m 16s the patch passed +1 unit 0m 52s hadoop-hdfs-client in the patch passed. +1 asflicense 0m 18s The patch does not generate ASF License warnings. 16m 39s Subsystem Report/Notes Docker Image:yetus/hadoop:9560f25 JIRA Issue HDFS-10735 JIRA Patch URL https://issues.apache.org/jira/secure/attachment/12822711/HDFS-10735-001.patch Optional Tests asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle uname Linux 585ce7383ddb 3.13.0-96-generic #143-Ubuntu SMP Mon Aug 29 20:15:20 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux Build tool maven Personality /testptch/hadoop/patchprocess/precommit/personality/provided.sh git revision trunk / dbe663d Default Java 1.8.0_101 findbugs v3.0.0 Test Results https://builds.apache.org/job/PreCommit-HDFS-Build/17160/testReport/ modules C: hadoop-hdfs-project/hadoop-hdfs-client U: hadoop-hdfs-project/hadoop-hdfs-client Console output https://builds.apache.org/job/PreCommit-HDFS-Build/17160/console Powered by Apache Yetus 0.4.0-SNAPSHOT http://yetus.apache.org This message was automatically generated.
        Hide
        benoyantony Benoy Antony added a comment -

        Committed to trunk , branch-2 and branch-2.8

        Show
        benoyantony Benoy Antony added a comment - Committed to trunk , branch-2 and branch-2.8
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #10612 (See https://builds.apache.org/job/Hadoop-trunk-Commit/10612/)
        HDFS-10735 Distcp using webhdfs on secure HA clusters fails with (benoy: rev 701c27a7762294e1a5fb2b3ac81f5534aa37f667)

        • (edit) hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/web/WebHdfsFileSystem.java
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #10612 (See https://builds.apache.org/job/Hadoop-trunk-Commit/10612/ ) HDFS-10735 Distcp using webhdfs on secure HA clusters fails with (benoy: rev 701c27a7762294e1a5fb2b3ac81f5534aa37f667) (edit) hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/web/WebHdfsFileSystem.java

          People

          • Assignee:
            benoyantony Benoy Antony
            Reporter:
            benoyantony Benoy Antony
          • Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development