Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-7779

Parquet Scanner can write binary data into profile

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • Impala 3.1.0
    • Impala 4.0.0
    • Backend

    Description

      In hdfs-parquet-scanner.cc:1224 we log an invalid file version string. Whatever 4 bytes that that pointer points to will end up in the profile. These can be non-ascii characters, thus potentially breaking tools that parse the profiles and expect their content to be plain text. We should either remove the bytes from the message, or escape them as hex.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sql_forever Qifan Chen
            lv Lars Volker
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment