Details
-
Wish
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
I read a big (taxi) csv file and specified that I wanted to dictionary-encode some columns. The resulting Table has ChunkedArrays with 1604 chunks. When I go to write this Table to the IPC file format (write_feather), I get an error:
Invalid: Dictionary replacement detected when writing IPC file format. Arrow IPC files only support a single dictionary for a given field accross all batches.
I can write this to Parquet and read it back in, and the roundtrip of the data is correct. We should be able to do this in IPC too.
Attachments
Issue Links
- links to