Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7869

CSV files can't mix line breaks \x0d Vs. \x0d\x0a

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.19.0
    • None
    • Storage - Text & CSV
    • None

    Description

      Querying CSV files with \x0d new line delimiters results in "DATA_READ ERROR: Column exceeds maximum length of 1024" with the default configuration.

      The \x0d new line isn't used to break lines resulting in the entire file being read in as a single record. This is configurable as "delimiter" in the format but if you have mixed csv files with different line endings it's problematic. If I have files with both \x0d and \x0d\x0a new lines (\r\n) and need to be able to read both without having to change the configuration between queries.

      Attachments

        Activity

          People

            Unassigned Unassigned
            curtis283 Curtis Lambert
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: