Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-17904

[C++] Parquet support read page with crc32 checking

    XMLWordPrintableJSON

Details

    Description

      Currently, C++'s Parquet support write page with checksum, but `ReadPage` doesn't have check any checksum. And I would like to fix it

      I'd like to split this patch to different parts:

      1. Implement the crc in DataPageV1, which requires a write crc config, counting crc in read mode, crc verification, migrate testing from parquet mr
      2. Implement crc for DataPageV2
      3. Implement crc for Dict

      Attachments

        Activity

          People

            mwish Xuwei Fu
            mwish Xuwei Fu
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 8.5h
                8.5h