Details
-
Task
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
3.5.2
-
None
Description
This task involves adding a configurable flag to control the inference of time-only columns during schema detection in Spark. By default, Spark converts time-only columns to Timestamp type, which can lead to unintended behavior in certain use cases. This new flag will allow users to specify whether time-only columns should be inferred as Timestamp or as String.
Key Changes:
- Introduce a flag (e.g., inferStringTypeForTimeOnlyColumn).
- When the flag is set to true, time-only columns will be inferred as String.
- When the flag is set to false (default), time-only columns will be inferred as Timestamp.
- Update documentation to reflect the new option.
- Ensure backward compatibility by defaulting to the current behavior.
Attachments
Issue Links
- links to