Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-5193

Impala reads gzip compressed text as binary when skip.header.line.count > 0

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: Impala 2.7.0
    • Fix Version/s: Impala 2.9.0
    • Component/s: Backend
    • Labels:
    • Epic Color:
      ghx-label-9

      Description

      When creating an external table with a gzip compressed text file, using the skip.header.line.count (>0) causes Impala to read the gzip file as binary.

      See my attached testcase and test data.

      1. TC1
        3 kB
        Vincent Tran
      2. test.txt.gz
        0.1 kB
        Vincent Tran

        Issue Links

          Activity

          Hide
          lv Lars Volker added a comment -

          This has been fixed in a recent commit to address IMPALA-3905: https://gerrit.cloudera.org/#/c/6000/

          Show
          lv Lars Volker added a comment - This has been fixed in a recent commit to address IMPALA-3905 : https://gerrit.cloudera.org/#/c/6000/
          Hide
          lv Lars Volker added a comment -

          Tests for this were added in IMPALA-5287.

          Show
          lv Lars Volker added a comment - Tests for this were added in IMPALA-5287 .

            People

            • Assignee:
              lv Lars Volker
              Reporter:
              thundergun Vincent Tran
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development