Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-6901

Investigate memory consumption of table metrics in the catalog

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • Impala 3.0, Impala 2.12.0
    • None
    • Catalog
    • None
    • ghx-label-9

    Description

      IMPALA-4886 introduced the concept of per-table metrics. In some cases (catalog with 90K tables) it has been reported that the table metrics can consume almost 30% of catalog heap size. We should perform the following actions:

      1. Measure the impact of table metrics on memory usage.
      2. Tune/optimize table metrics to reduce their memory requirements. Some quick fixes/ideas may include: a) replace/tune histogram based metrics with simpler ones, b) eliminate metrics that are not considered particularly useful, c) store detailed metrics only for "interesting" tables (e.g. large tables that are heavily used).  

      Attachments

        Activity

          People

            Unassigned Unassigned
            dtsirogiannis Dimitris Tsirogiannis
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: