[HIVE-18780] Improve schema discovery For Druid Storage Handler - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0.0
Component/s: Druid integration
Labels:
None

Description

Currently, Druid Storage adapter issues a Segment metadata Query every time the query is of type Select or Scan. Not only that but then every input split (map) will do the same as well since it is using the same Serde, this is very expensive and put a lot of pressure on the Druid Cluster. The way to fix this is to add the schema out of the calcite plan instead of serializing the query itself as part of the Hive query context.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-18780.11.patch
23/Mar/18 01:04
471 kB
Slim Bouguerra
HIVE-18780.12.patch
23/Mar/18 04:23
471 kB
Slim Bouguerra
HIVE-18780.13.patch
24/Mar/18 03:05
489 kB
Slim Bouguerra
HIVE-18780.14.patch
24/Mar/18 14:15
489 kB
Slim Bouguerra
HIVE-18780.2.patch
20/Mar/18 03:57
436 kB
Slim Bouguerra
HIVE-18780.4.patch
20/Mar/18 22:24
447 kB
Slim Bouguerra
HIVE-18780.5.patch
20/Mar/18 22:31
450 kB
Slim Bouguerra
HIVE-18780.6.patch
20/Mar/18 23:26
450 kB
Slim Bouguerra
HIVE-18780.7.patch
21/Mar/18 15:10
470 kB
Slim Bouguerra
HIVE-18780.8.patch
22/Mar/18 16:33
472 kB
Slim Bouguerra
HIVE-18780.patch
19/Mar/18 23:01
419 kB
Slim Bouguerra
HIVE-18780.patch
19/Mar/18 23:00
419 kB
Slim Bouguerra

Issue Links

contains

HIVE-14518 Support 'having' translation for Druid GroupBy queries

Closed

HIVE-18993 Use Druid Expressions

Closed

relates to

HIVE-18957 Upgrade Calcite version to 1.16.0

Closed

Activity

People

Assignee:: Slim Bouguerra

Reporter:: Slim Bouguerra

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 23/Feb/18 03:30

Updated:: 22/May/18 23:16

Resolved:: 24/Mar/18 17:33