Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-6876

FileInputFormat.listStatus should not fetch delegation tokens

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      FileInputFormat.listStatus fetches delegation tokens: https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapred/FileInputFormat.java#L213

      AFAICT, this is unnecessary. listStatus doesn't delegate those tokens to another process. This is causing issues described in the attached Spark Kerberos ticket, because TokenCache.obtainTokensForNameNodes, which is used to fetch the delegation tokens, assumes that certain MapReduce configuration variables are set, which isn't true in the Spark calling code. This is a separate problem, but nonetheless it wouldn't have arisen if listStatus weren't fetching delegation tokens.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mgummelt Michael Gummelt
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: