Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-6005

[C++] parquet::arrow::FileReader::GetRecordBatchReader() does not behave as documented since ARROW-1012

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.14.0, 0.14.1
    • 0.15.0
    • C++

    Description

      GetRecordBatchReader() should

      "Return a RecordBatchReader of row groups selected from row_group_indices, the
      ordering in row_group_indices matters." (that is what the doxygen string says),

      but:

      Since change ARROW-1012, it ignores the row_group_indices argument.

      The row_group_indices_ in the RowGroupRecordBatchReader that is created are never used.

      Either the documentation should be changed, or the behavior should be reverted. I would prefer the latter, as I do not know how to make sure to read a specific row groups anymore...

      Attachments

        Issue Links

          Activity

            People

              hatem Hatem Helal
              Drohmann Martin
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m