Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3656

[C++] Allow whitespace in numeric CSV fields

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.11.0
    • 0.12.0
    • C++

    Description

      Pandas allows whitespace before and after numbers in CSV files, but Arrow doesn't:

      >>> s = b"a,b,c\n12 , 34 , 56\n"
      >>> pd.read_csv(io.BytesIO(s))
          a   b   c
      0  12  34  56
      >>> csv.read_csv(io.BytesIO(s)).to_pandas()
              a        b       c
      0  b'12 '  b' 34 '  b' 56'
      

      Attachments

        Issue Links

          Activity

            People

              apitrou Antoine Pitrou
              apitrou Antoine Pitrou
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 40m
                  1h 40m