Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-18352 [R] Datasets API interface improvements
  3. ARROW-18354

[R] Better document the CSV read/parse/convert options we can use with open_dataset()

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • R
    • None

    Description

      When a user opens a CSV dataset using open_dataset, they can take advantage of a lot of different options which can be specified via CsvReadOptions$create() etc.

      However, as they are passed in via the ellipses (...) argument, it's not particularly clear to users which arguments are supported or not. They are not documented in the open_dataset() docs, and further confused (see the code for CsvFileFormat$create() by the fact that we support a mix of Arrow and readr parameters (though users can't use both at the same time).

      We should better document the arguments we do support.

      Attachments

        Activity

          People

            Unassigned Unassigned
            thisisnic Nicola Crane
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: