Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.17.0
-
None
-
None
Description
The Drill Metastore will infer the schema of a table while gathering stats. Unfortunately, this creates a chicken-and-egg problem. Some files need the Metastore because the schema is ambiguous. Such data won't even scan correctly without such information. Classic JSON example:
{a: 10} {a: 10.1}
In these cases, the user should first define the table schema, then run the ANALYZE TABLE commands. In such cases, Drill should not attempt to change the type information (since the actual data is ambiguous.)