Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3393

Quotes not being recognized in tab delimited (tsv) files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 1.0.0
    • 1.2.0
    • Storage - Text & CSV
    • None

    Description

      Drill doesn't seem to recognise quotes in tsv, while working fine for csv files.

      For example, given the following files

      test.tsv
      -------
      foobar bar
      "aa" "bc"
      -------

      test.csv
      ----------
      foobar,bar
      "aa","bc"
      ----------

      I get these results:

      0: jdbc:drill:zk=local> select columns[0], columns[1] from dfs.`test.csv`;
      ----------------+

      EXPR$0 EXPR$1

      ----------------+

      foobar bar
      aa bc

      ----------------+
      2 rows selected (0.259 seconds)
      0: jdbc:drill:zk=local> select columns[0], columns[1] from dfs.`test.tsv`;
      -----------------+

      EXPR$0 EXPR$1

      -----------------+

      foobar bar
      aa" "bc null

      -----------------+
      2 rows selected (0.122 seconds)

      Attachments

        1. DRILL-3393.patch
          3 kB
          Steven Phillips
        2. fail.tsv
          0.0 kB
          Chi Lang

        Activity

          People

            cchang@maprtech.com Chun Chang
            chilang Chi Lang
            Votes:
            1 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: