Details
-
Bug
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Sometimes Hive tables contain tabs and/or other characters that could easily be misinterpreted by a transformer as a delimiter. This can break many TRANSFORM queries.
The solution is to have a ROW FORMAT semantics that can be attached to an individual TRANSFORM instance. It would have the same semantics as table creation, but during serialization it would ensure that any formal delimiter characters that did not indicate an actual break between columns would be escaped.
At the very least, it is a bug that TRANSFORM statement deserialization does not backslash out literal tabs in the current implementation.