Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3338

Use custom relation instead of HadoopFsRelation

    XMLWordPrintableJSON

Details

    Description

      For HUDI-3204, COW table and MOR table in read_optimized query mode should return the 'yyyy-MM-dd' format of origin `data_date`, not yyyy/MM/dd''.

      And the reason for that is because Hudi use HadoopFsRelation for the snapshot query mode of cow and the read_optimized query mode of mor.

      Spark HadoopFsRelation will append the partition value of the real partition path. However, different from the normal table, Hudi will persist the partition value in the parquet file. So we just need read the partition value from the parquet file, not leave it to spark.

      So we should not use `HadoopFsRelation` any more, and implement Hudi own `Relation` to deal with it.

      Attachments

        Issue Links

          Activity

            People

              biyan900116@gmail.com Yann Byron
              biyan900116@gmail.com Yann Byron
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: