Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-8037

Partition query for transformed value incorrectly prunes valid partitions

    XMLWordPrintableJSON

Details

    Description

      With timestamp keygen you can have a partition column with timestamps, but then use the keygen so it will create partitions based on days so that all records that have a timestamp on 7-31-2024 will go to the same parititon even though the values in the partition column differ by hours and minutes etc.

      This causes a problem with partition pruning. lets say you query "select * from table where partition < 7-31-2024 at 7am and partition > 7-31-2024 at 6am ". Since the file structure has the partition of just 7-31-2024, that will be interpreted as 7-31-2024 at 12am. So the partition will be pruned from the search space.

      Attachments

        Issue Links

          Activity

            People

              jonvex Jonathan Vexler
              jonvex Jonathan Vexler
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: