Details
-
New Feature
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
Product Backlog
-
All supported environments
Description
Hi,
Hive comes with some useful built-in UDFs to process JSON objects.
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF
Namely:
- get_json_object
- json_tuple
To make Impala and Hive tables and quieries more interchangable, I am proposing porting these UDFs to be part Impala's built in functions:
Example
Consider the following table raw_log
action | parameters |
---|---|
search | {"keyword":"hotel"} |
visit | {"url":"http://example.com"} |
...and the following query:
SELECT get_json_object(event_params, "$.keyword") AS keyword FROM raw_log WHERE action='search';
The query should return the following results:
keyword |
---|
hotel |
Attachments
Issue Links
- is blocked by
-
IMPALA-7364 Upgrade RapidJson to the latest version
- Resolved
- is related to
-
IMPALA-7609 Support ANSI SQL JSON functions
- Open
There are no Sub-Tasks for this issue.