Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
In a dataset, one can have timestamp columns with different resolutions. There should be an optional to cast all timestamps to the type mentioned in the schema. A typical example could be that we store a pandas DataFrame with ns precision to Parquet files that only support us resolution in their most widespread from. Then the dataset schema and the actual file content don't match anymore.
Attachments
Issue Links
- is related to
-
ARROW-11003 [C++][Dataset] Schema evolution in Dataset scanning
- Open