Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-6828

After Insert overwrite into a non-empty partition/table with empty datasets, the subsequent query results are incorrect

    XMLWordPrintableJSON

Details

    Description

      After Insert overwrite into a non-empty partition/table with empty datasets, the subsequent query results are incorrect
      For example, current dataset of partition dt=2021-07-17 is

      id name dt
      1 'a' '2021-07-17'

      Using the following SQL to overwrite partition dt=2021-07-17

      insert overwrite table targetT partition(dt='2021-07-17') select id, name, price from sourceT where  dt=2021-07-17 and product='test'
      

      If the input dataset is empty. We expect after insert overwrite, the partition of dt=2021-07-17 is empty.
      However, the query result is still the old data.

      Attachments

        Issue Links

          Activity

            People

              jingzhang Jing Zhang
              jingzhang Jing Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: