Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.1.0
-
None
Description
The `filterByRange` method makes it possible to efficiently filter a sorted RDD using bounds on its keys. It prunes out partitions instead of scanning them if a RangePartitioner is available in the RDD.
This method is part of the Scala API, defined in OrderedRDDFunctions, but is not exposed in the Java API as far as I can tell. All other methods defined in OrderedRDDFunctions are exposed in JavaPairRDD, therefore it seems natural to expose `filterByRange` there too.