Details
-
New Feature
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
0.13.0, 0.14.0
Description
If two gzipped files are concatenated together, the result is a valid gzip file. However, it appears that pyarrow.csv.read_csv will only read the portion related to the first file.
If the repro script here is run, the output is:
$ python repro.py
pyarrow.csv only reads one row:
{{ x}}
0 1
pandas reads two rows:
{{ x}}
0 1
1 2
pyarrow version: 0.14.0
Attachments
Issue Links
- links to