Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
The current implementation does a lot of CPU intensive work in HiveServer2. It deserializes the temporary results written to disk and also serializes the results into thrift objects in HS2. This adds to the latency of fetching results via HS2. Doing that work in the task nodes would get that work done in parallel and scale it, thereby reducing the time it takes to retrieve large results.
Attachments
Issue Links
- relates to
-
HIVE-14549 HiveServer2: Improve HiveServer2 JDBC/ODBC ResultSet performance - part2
- Open