Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Done
-
1.1.0
-
None
-
None
Description
For the underlying storage layer it would simplify things such as schema conversions, predicate filter determination and such to record in the data type itself whether a column can be nullable. So the DataType type could look like like this:
abstract class DataType(nullable: Boolean = true)
Concrete subclasses could then override the nullable val. Mostly this could be left as the default but when types can be contained in nested types one could optimize for, e.g., arrays with elements that are nullable and those that are not.
Attachments
Issue Links
- blocks
-
SPARK-2179 Public API for DataTypes and Schema
- Resolved