Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
Description
(Apologies if I'm imprecise here, I'm still coming up to speed on the arrow details.)
The IPC message format describes how data and metadata messages are encapsulated, but it is not a requirement that each message must include the schema.
In Go, github.com/apache/arrow/go/arrow/ipc contains NewReader() for setting up reading of IPC messages, and accepts the option WithSchema to pass the schema into said reader. However, the implementation merely uses that information to compare that the schema it reads from the IPC stream matches the passed in reader. This request is to allow WithSchema to behave as expected, and use the option-provided Schema for performing reads.
The one gotcha here appears to be the dictionary type map, which is currently retained independently of the schema but is part of the internal readSchema() setup. Completeness may warrant another option for communicating those externally as well? Or perhaps option-passed Schema should be documented to not support dictionary types?
Attachments
Issue Links
- links to