Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7450

Improve performance for ANALYZE command

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 1.17.0
    • Fix Version/s: 1.17.0
    • Component/s: None
    • Labels:

      Description

      In the scope of DRILL-7273 was introduced ANALYZE command for collecting metadata and storing it to the metastore.
      But current implementation uses too much memory and is low-performant. It uses stream aggregate for collecting metadata, but all incoming data should be sorted before producing the aggregation. Memory usage may be reduced by using hash aggregate, so sort may not be produced and it should increase performance.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                volodymyr Vova Vysotskyi
                Reporter:
                volodymyr Vova Vysotskyi
                Reviewer:
                Arina Ielchiieva
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: