Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-2956

Downloaded CSV/TSV data will get unexpected division when the column value contains both delimiter and quotation mark.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 0.6.2, 0.7.2, 0.7.3
    • None
    • front-end
    • None
    • zeppelin 0.7.2
      jdk 1.7

    Description

      After downloading csv/tsv data from zeppelin paragraph, I opened the file through Excel and found that the data is not seperated as expected.

      Then I found that's because the colunm value contains both delimiter and quotation mark, so Excel could not recognize and seperate the value correctly.

      Here are zeppelin downloaded csv data example:

      hello,beijing,i,love,you -> can be well recognized
      hello,beijing,i,lo"v"e,you -> can be well recognized
      hello,beijing,i,lo"v,"e,you -> can not be well recognized

      I met this bug in our production environment, the column value format might be json, and json format data may contain quotation mark and comma.

      Attachments

        1. csvdata.png
          19 kB
          bruce xu

        Issue Links

          Activity

            People

              Unassigned Unassigned
              xwc3504 bruce xu
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: