Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
4.8.0
-
None
-
None
Description
The core MapReduce classes org.apache.phoenix.mapreduce.PhoenixInputSplit and org.apache.phoenix.mapreduce.PhoenixInputFormat don't provide region size or location information, leaving the execution engine (MR, Spark, etc.) to randomly assign splits to nodes.
Interestingly, the phoenix-hive module has reimplemented these classes, including the node-aware functionality. We should port a subset of those changes back to the core code so that other engines can make use of them.
Attachments
Attachments
Issue Links
- causes
-
PHOENIX-4489 HBase Connection leak in Phoenix MR Jobs
- Resolved