Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3393

Quotes not being recognized in tab delimited (tsv) files

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 1.0.0
    • Fix Version/s: 1.2.0
    • Component/s: Storage - Text & CSV
    • Labels:
      None

      Description

      Drill doesn't seem to recognise quotes in tsv, while working fine for csv files.

      For example, given the following files

      test.tsv
      -------
      foobar bar
      "aa" "bc"
      -------

      test.csv
      ----------
      foobar,bar
      "aa","bc"
      ----------

      I get these results:

      0: jdbc:drill:zk=local> select columns[0], columns[1] from dfs.`test.csv`;
      ----------------+

      EXPR$0 EXPR$1

      ----------------+

      foobar bar
      aa bc

      ----------------+
      2 rows selected (0.259 seconds)
      0: jdbc:drill:zk=local> select columns[0], columns[1] from dfs.`test.tsv`;
      -----------------+

      EXPR$0 EXPR$1

      -----------------+

      foobar bar
      aa" "bc null

      -----------------+
      2 rows selected (0.122 seconds)

        Attachments

        1. DRILL-3393.patch
          3 kB
          Steven Phillips
        2. fail.tsv
          0.0 kB
          Chi Lang

          Activity

            People

            • Assignee:
              cchang@maprtech.com Chun Chang
              Reporter:
              chilang Chi Lang
            • Votes:
              1 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: