Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2311

TRANSFORM statements should come with their own ROW FORMATs.

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None

    Description

      Sometimes Hive tables contain tabs and/or other characters that could easily be misinterpreted by a transformer as a delimiter. This can break many TRANSFORM queries.

      The solution is to have a ROW FORMAT semantics that can be attached to an individual TRANSFORM instance. It would have the same semantics as table creation, but during serialization it would ensure that any formal delimiter characters that did not indicate an actual break between columns would be escaped.

      At the very least, it is a bug that TRANSFORM statement deserialization does not backslash out literal tabs in the current implementation.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            akramer Adam Kramer

            Dates

              Created:
              Updated:

              Slack

                Issue deployment