Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-2515

Implement Spark join optimization support

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.7.1
    • None
    • None
    • None

    Description

      At the time of writing, Spark is not able to properly optimize joins on Kudu tables because Kudu does not provide statistics for Spark to use to determine the optimal join strategy.

      It would be a big improvement to find some way to help Spark optimize joins between Kudu tables or between Kudu tables and Parquet-on-HDFS tables. 

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mpercy Mike Percy
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: