Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9749

[C++][Dataset] Extract format-specific scan options from FileFormat

    XMLWordPrintableJSON

Details

    Description

      Currently format specific scan options are embedded as members of the corresponding subclass of FileFormat. Extracting these to an options struct would provide better separation of concerns; currently the only way to scan a parquet formatted dataset with different options is to reconstruct it in a differently optioned format from its component files.

      CsvFileFormat could retain ParseOptions as a member, since (for example) tab-separated vs comma-separated values can justifiably be considered different formats.

      Attachments

        Issue Links

          Activity

            People

              lidavidm David Li
              bkietz Ben Kietzman
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h
                  2h