Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17265

Cache merged column stats from retrieved partitions

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • Statistics
    • None

    Description

      Currently when we retrieve stats from the metastore for a column in a partitioned table, we will execute the logic to merge the column stats coming from each partition multiple times.
      Even though we avoid multiple calls to metastore if the cache for the stats in enabled, merging the stats for a given column can take a large amount of time if there is a large number of partitions.

      Attachments

        1. HIVE-17265.patch
          27 kB
          jcamachorodriguez
        2. HIVE-17265.02.patch
          36 kB
          jcamachorodriguez
        3. HIVE-17265.03.patch
          37 kB
          jcamachorodriguez
        4. HIVE-17265.04.patch
          62 kB
          jcamachorodriguez
        5. HIVE-17265.05.patch
          60 kB
          jcamachorodriguez

        Issue Links

          Activity

            People

              jcamacho Jesús Camacho Rodríguez
              jcamacho Jesús Camacho Rodríguez
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: