Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-7597

DelegationTokenIdentifier should cache the TokenIdentifier to UGI mapping

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 2.0.0-alpha
    • 2.8.0, 3.0.0-alpha1
    • webhdfs
    • None
    • Reviewed

    Description

      Webhdfs seeks involve closing the current connection, and reissuing a new open request with the new offset. The RPC layer caches connections so the DN keeps a lingering connection open to the NN. Connection caching is in part based on UGI. Although the client used the same token for the new offset request, the UGI is different which forces the DN to open another unnecessary connection to the NN.

      A job that performs many seeks will easily crash the NN due to fd exhaustion.

      Attachments

        1. HDFS-7597.01.patch
          7 kB
          Xiao Chen
        2. HDFS-7597.patch
          5 kB
          Bob Hansen
        3. HDFS-7597.patch
          5 kB
          Daryn Sharp
        4. HDFS-7597.patch
          4 kB
          Daryn Sharp

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            daryn Daryn Sharp
            daryn Daryn Sharp
            Votes:
            0 Vote for this issue
            Watchers:
            23 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment