Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
In several places in cpp/src/parquet/arrow, the FromParquetSchema function is used to construct fields using a filtered "view" of the Parquet schema. This is a hack caused by the lack of some kind of a "schema tree" which maps Parquet concepts to Arrow Field objects.
One manifestation of this issue is that I was unable to implement dictionary encoded subfields in cases like list<string>, where you want the inner field to be dictionary-encoded.
Patch forthcoming
Attachments
Issue Links
- is depended upon by
-
ARROW-3325 [Python] Support reading Parquet binary/string columns directly as DictionaryArray
- Resolved
- links to