Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-2066

[C++][Parquet] num_rows is incorrect for nested types

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • parquet-cpp
    • None

    Description

      Data pages v2 have:

      • num_rows
      • num_values

      we write num_rows equal to the num_values. However, they represent different aspects.

      Given a list such as "[[0, 1], None, [2, None, 3]]", num_rows = 3 and num_values = 6. We currently report 6 in both instances.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jorgecarleitao Jorge Leitão
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: