Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31518

Expose filterByRange in JavaPairRDD

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.1.0
    • 3.1.0
    • Spark Core
    • None

    Description

      The `filterByRange` method makes it possible to efficiently filter a sorted RDD using bounds on its keys. It prunes out partitions instead of scanning them if a RangePartitioner is available in the RDD.

      This method is part of the Scala API, defined in OrderedRDDFunctions, but is not exposed in the Java API as far as I can tell. All other methods defined in OrderedRDDFunctions are exposed in JavaPairRDD, therefore it seems natural to expose `filterByRange` there too.

      Attachments

        Activity

          People

            pintoch Antonin Delpeuch
            pintoch Antonin Delpeuch
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: