Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-1484

Escape the partition value in HiveSyncTool

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Currently Hudi will encode the partition value when URL_ENCODE_PARTITIONING_OPT_KEY set true. However the HiveSyncTool has not decode the partition value when sync partition to hive. And Hive will encode the partition value twice which lead to an exception when query with hive sql or spark sql.

      For example the partition "2020/12/20" will encode to "2020%2F12%2F20" by Hudi. When HiveSyncTool sync the "2020%2F12%2F20" to hive, Hive will encode it to "

      "2020%252F12%252F20". This will result to query exception for "select xx from tbl where dt = '2020/12/20'

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            pzw2018 pengzhiwei
            pzw2018 pengzhiwei
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment