Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.4.0
-
None
Description
CatalystTypeConverters's Catalyst row to Scala row converters access columns' values via the generic Row.get() call rather than using type-specific accessor methods. If we refactor the internal converter interfaces slightly, we can pass the row and column number into the converter function and allow it to do its own type-specific field extraction, similar to what we do in UnsafeRowConverter. This is a blocker for being able to unit test new operators that I'm developing as part of Project Tungsten, since those operators may output UnsafeRow instances which don't support the generic get().
Attachments
Issue Links
- relates to
-
SPARK-6620 Speed up toDF() and rdd() functions by constructing converters in ScalaReflection
- Resolved
- links to