Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3726

Drill is not properly interpreting CRLF (0d0a). CR gets read as content.

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.1.0
    • Fix Version/s: 1.8.0
    • Component/s: Storage - Text & CSV
    • Labels:
      None
    • Environment:

      Linux RHEL 6.6, OSX 10.9

      Description

      When we query the last attribute of a text file, we get missing characters. Looking at the row through Drill, a \r is included at the end of the last attribute.
      Looking in a text editor, it's not embedded into that attribute.

      I'm thinking that Drill is not interpreting CRLF (0d0a) as a new line, only the LF, resulting in the CR becoming part of the last attribute.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                arina Arina Ielchiieva
                Reporter:
                ebegoli Edmon Begoli
                Reviewer:
                Krystal
              • Votes:
                1 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - 120h
                  120h
                  Remaining:
                  Remaining Estimate - 120h
                  120h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified