Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-324

row count incorrect if data file has more than 2^31 rows

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.7.0, 1.8.0
    • 1.8.0
    • parquet-mr
    • None

    Description

      If a parquet file has more than 2^31 rows, the row count written into the file metadata is incorrect.
      The cause of the problem is the use of an int instead of long data type for numRows in ParquetMetadataConverter, toParquetMetadata:
      int numRows = 0;
      for (BlockMetaData block : blocks)

      { numRows += block.getRowCount(); addRowGroup(parquetMetadata, rowGroups, block); }

      Attachments

        Issue Links

          Activity

            People

              tfriedr Thomas Friedrich
              tfriedr Thomas Friedrich
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: