Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Kudu_Impala
Description
Inserts into Kudu tables should be partitioned (i.e. rows hashed using the same hash partitioning as the Kudu table) and, at the table sink, sorted on the primary key. This would significantly improve performance.
This will require a local sort (IMPALA-2521), and support from Kudu to provide the partitioning.
Attachments
Issue Links
- breaks
-
IMPALA-5871 KuduPartitionExpr incorrectly handles its child types
- Resolved
-
IMPALA-5294 Kudu INSERT partitioning fails with constants
- Resolved
-
IMPALA-5611 KuduPartitionExpr holds onto memory unnecessarily
- Resolved
- is related to
-
IMPALA-2521 Introduce CLUSTERED plan hint for insert statements
- Resolved