Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10899

Support JDBC pushdown for additional commands

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 1.5.0
    • None
    • SQL
    • None

    Description

      Spark DataFrames currently support predicate push-down with JDBC sources but term predicate is used in a strict SQL meaning. It means it covers only WHERE clause. Moreover it looks like it is limited to the logical conjunction (no IN and OR I am afraid) and simple predicates.

      This creates a situation where a simple query such as "select * from table limit 100" could easily result in the database being overloaded when the table is large.

      This feature request is to expand the support for push-downs to additional SQL commands:
      -LIMIT
      -WHERE IN
      -WHERE NOT IN
      -GROUP BY

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              MrCabbage Victor May
              Votes:
              2 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: