[HIVE-12427] HiveServer2: Improve HiveServer2 JDBC/ODBC ResultSet performance - part1 - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: HiveServer2, JDBC, ODBC
Labels:
None

Description

The current implementation does a lot of CPU intensive work in HiveServer2. It deserializes the temporary results written to disk and also serializes the results into thrift objects in HS2. This adds to the latency of fetching results via HS2. Doing that work in the task nodes would get that work done in parallel and scale it, thereby reducing the time it takes to retrieve large results.

Attachments

Issue Links

relates to

HIVE-14549 HiveServer2: Improve HiveServer2 JDBC/ODBC ResultSet performance - part2

Open

Sub-Tasks

1.	HiveServer2: Refactor/repackage HiveServer2's Thrift code so that it can be used in the tasks	Closed	Rohit Dholakia
2.	HiveServer2: Provide an option for HiveServer2 to stream serialized thrift results when they are available	Resolved	Vaibhav Gumashta
3.	HiveServer2: Provide an option to write serialized thrift objects in final tasks	Closed	Rohit Dholakia
4.	Empty resultset run into Exception when using Thrift Binary Serde	Closed	Ziyang Zhao
5.	bump a new api version for ThriftJDBCBinarySerde changes	Closed	Ziyang Zhao
6.	Executing join query on type Float using Thrift Serde will result in Float cast to Double error	Closed	Ziyang Zhao
7.	Update protocol version in TOpenSessionReq and TOpenSessionResp	Closed	Ziyang Zhao
8.	HiveServer2: Make the usage of server with JDBC thirft serde enabled, backward compatible for older clients	Closed	Ziyang Zhao
9.	HiveServer2: Evaluate if ThriftJDBCBinarySerde should implement VectorizedSerde	Resolved	Ziyang Zhao
10.	HiveServer2: Performance instrumentation for HIVE-12049 (serializing thrift ResultSets in tasks)	Closed	Ziyang Zhao
11.	HiveServer2: Cleanup code which checks for ThriftJDBCSerde usage	Closed	Ziyang Zhao
12.	HiveServer2: Use user supplied fetch size to determine #rows serialized in tasks	Resolved	Norris Lee
13.	HiveServer2: enable ThriftJDBCBinarySerde use by default	Patch Available	Ziyang Zhao

Activity

People

Assignee:: Unassigned

Reporter:: Vaibhav Gumashta

Votes:: 0 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 16/Nov/15 23:28

Updated:: 16/Aug/16 19:33