Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-4912

[C++, Python] Allow specifying column names to CSV reader

    XMLWordPrintableJSON

Details

    Description

      Currently I think there is no way to specify custom column names for CSV files. It's possible to specify the full schema of the file, but not just column names.

      See the related discussion here: ARROW-3722

      The goal of this is to re-use the CSV type-inference but still allow people to specify custom names for the columns. As far as I know, there is currently no way to set column names post-hoc, so we should provide a way to specify them before reading the file.

      Related to this, ParseOptions(header_rows=0) is not currently implemented.

      Is there any current way to do this or does this need to be implmented?

      Attachments

        Issue Links

          Activity

            People

              bkietz Ben Kietzman
              pcmoritz Philipp Moritz
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 40m
                  1h 40m