Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-29966

avoid load table twice

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • SQL
    • None

    Description

      Now resolve logic plan will load table twice which are in ResolveTables and ResolveRelations. The ResolveRelations is old code path, and ResolveTables is v2 code path, and the reason why load table twice is that ResolveTables will load table and rollback v1 table to ResolveRelations code path.
      The same scene also exists in ResolveSessionCatalog.

      It affect that execute command will cost double time than spark 2.4.

      Here is the idea that add a table version method in TableCatalog, and rules should always get table version firstly without load table.

      Attachments

        Issue Links

          Activity

            People

              imback82 Terry Kim
              ulysses XiDuo You
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: