Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-17130

Enable multiple character delimiters in read_csv

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 8.0.1
    • None
    • Format
    • None

    Description

      Read_CSV ParseOptions allows only a single character delimiter. Single character delimiters are highly susceptible to the candidate value existing within the data to be loaded, negating the ability to serve as a delimiter.

      If a double character delimiter is used, the current limit of a single character returns "only single character unicode strings can be converted to Py_UCS4, got length 2"

      Attachments

        Activity

          People

            Unassigned Unassigned
            jhowa208 Jack Howard
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: