Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Currently Drill will fail by default to read JSON files with changes in schema, or with several different data types in a list. These files can however be read if the store.json.all_text_mode option is set to true. This option will read all data from the JSON files as varchar and from here the data can be casted to an appropriate type (or filtered out in the case where a cast to numeric is desired but some of the data in the column is a string or boolean type). This is only mentioned on the wiki in this page, but does not really describe all of the use cases of the feature. https://cwiki.apache.org/confluence/display/DRILL/Analyzing+Yelp+JSON+Data+with+Apache+Drill