Hive
  1. Hive
  2. HIVE-2026

Parallelize UpdateInputAccessTimeHook

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      UpdateInputAccessTimeHook is usually used as a pre-execution hook to update the metastore's lastAccessTime field of input partition/table. If a query touches a large number of partitions, this hooks takes a long time to execute. One approach is to make the hook itself to run in a separate thread. But it is hard to guarantee backward compatibility in semantics in case of exceptions encountered in the hook execution. This task takes another approach to parallelize the hook itself (update multiple partitions concurrently), but execute each pre-hook in sequential order.

      1. HIVE-2026_2.patch
        22 kB
        MIS
      2. HIVE-2026.patch
        22 kB
        Ning Zhang

        Activity

        Ning Zhang created issue -
        Ning Zhang made changes -
        Field Original Value New Value
        Attachment HIVE-2026.patch [ 12472515 ]
        Ning Zhang made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        MIS made changes -
        Attachment HIVE-2026_2.patch [ 12472651 ]
        Namit Jain made changes -
        Status Patch Available [ 10002 ] Open [ 1 ]

          People

          • Assignee:
            Ning Zhang
            Reporter:
            Ning Zhang
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:

              Development