Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14841 Replication - Phase 2
  3. HIVE-17289

EXPORT and IMPORT shouldn't perform distcp with doAs privileged user.

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersConvert to IssueMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      Currently, EXPORT uses distcp to dump data files to dump directory and IMPORT uses distcp to copy the larger files/large number of files from dump directory to table staging directory. But, this copy fails as distcp is always done with doAs user specified in hive.distcp.privileged.doAs, which is "hdfs' by default.
      Need to remove usage of doAs user when try to distcp from EXPORT/IMPORT flow.
      Privileged user based distcp should be done only for REPL DUMP/LOAD commands.
      Also, need to set the default config for hive.distcp.privileged.doAs to "hive" as "hdfs" super-user is never allowed.

      Attachments

        1. HIVE-17289.01.patch
          32 kB
          Sankar Hariappan

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sankarh Sankar Hariappan Assign to me
            sankarh Sankar Hariappan
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment