Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-2067

[C++] null_count and num_nulls incorrect for repeated columns

    XMLWordPrintableJSON

Details

    Description

      Currently only nulls at the leaf are accounted for in the null count statstics.  For nested lists this is incorrect because null lists have zero elements and don't show up in the leaf.

       

      Example from mailing list discussion

       

      [[0, 1], None, [2, None, 3]]

       

      should have a null count of 2 (it currently reports as 1).

      Attachments

        Issue Links

          Activity

            People

              willb_google William Butler
              emkornfield Micah Kornfield
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 20m
                  2h 20m