Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-23768

Metastore's update service wrongly strips partition column stats from the cache

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.0.0
    • Component/s: None

      Description

      Metastore's update service wrongly strips partition column stats from the cache in an attempt to update them. The issue may go unnoticed since missing stats do not lead to query failures.

      However, they can alter significantly the query plan affecting performance. Moreover, they lead to flakiness since some times the stats are present and sometimes are not leading to a query that has a different plan overtime.

      Normally missing elements from the cache shouldn't be a correctness problem since we can always fallback to the raw stats. Unfortunately, there are many interconnections with other parts of the code (e.g., code to obtain aggregate statistics) where this contract breaks.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                zabetak Stamatis Zampetakis
                Reporter:
                zabetak Stamatis Zampetakis
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m