Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-28596

Skip ColumnAccessInfo collection when not needed to speed-up compilation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 4.1.0
    • None
    • Security Level: Public (Viewable by anyone)

    Description

      Avoid paying the perf-overhead of collecting ColumnAccessInfo via the HiveRelFieldTrimmer when it is not necessary.

      Currently, we only use the ColumnAccessInfo when we need to perform column based authorization or when it is requested explicitly by the user via the hive.stats.collect.scancols property.

      The field trimmer is an expensive operation and can become pretty slow for certain use-cases/queries so skipping some calls when it is not necessary is in general beneficial and can notably improve the compilation time for some queries.

      Attachments

        Issue Links

          Activity

            People

              zabetak Stamatis Zampetakis
              zabetak Stamatis Zampetakis
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: