Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-5037

Change default Parquet array resolution according to Parquet standard.

    XMLWordPrintableJSON

Details

    Description

      With IMPALA-4725 we've introduced query options to control the field resolution behavior when scanning Parquet files with nested arrays. The current default strategy currently tries to auto-detect the array encoding within Parquet files, but this strategy can sometimes subtly go wrong and return incorrect results due to the inherent ambiguity of the 2/3-level encoding schemes in Parquet.

      We should switch the default resolution strategy according to the Parquet standard 3-level encoding, instead of the current auto-detect.

      Attachments

        Issue Links

          Activity

            People

              alex.behm Alexander Behm
              alex.behm Alexander Behm
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: