Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1402

[C++] incorrect calculation column start offset for files created by parquet-mr 1.8.1

    XMLWordPrintableJSON

Details

    Description

      parquet-mr (at least version 1.8.1-fast-201712141648170019-ab0622b)

      writes to ColumnChunk's metadata dictionary_page_offset == 0 when it is (supposed?) equal to data_page_offset.

      calculation of col_start inĀ std::unique_ptr<PageReader> GetColumnPageReader(int i)

      works incorrectly in this case.

      Attachments

        1. test.parquet
          2 kB
          Renat Valiullin

        Issue Links

          Activity

            People

              rip.nsk@gmail.com Renat Valiullin
              rip.nsk@gmail.com Renat Valiullin
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 10m
                  1h 10m