Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-25

Faster Incremental queries on Hoodie #492

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Unresolved
    • None
    • 0.5.1
    • hive

    Description

      Hive Incremental queries on Hoodie currently suffer a limitation of listing all partitions when a datestr is not present (lists .hoodie and the partitions) and end up throwing away a lot of the files (since `_hoodie_commit_time` column values filters out those files) . This can be very expensive and can impact query planning time and sometime causes timeouts as well if the table is large. The original issue is tracked here - https://github.com/uber/hudi/issues/492

      Attachments

        Issue Links

          Activity

            People

              bhavanisudha Bhavani Sudha
              vinoth Vinoth Chandar
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m