Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
None
Description
For HUDI-3204, COW table and MOR table in read_optimized query mode should return the 'yyyy-MM-dd' format of origin `data_date`, not yyyy/MM/dd''.
And the reason for that is because Hudi use HadoopFsRelation for the snapshot query mode of cow and the read_optimized query mode of mor.
Spark HadoopFsRelation will append the partition value of the real partition path. However, different from the normal table, Hudi will persist the partition value in the parquet file. So we just need read the partition value from the parquet file, not leave it to spark.
So we should not use `HadoopFsRelation` any more, and implement Hudi own `Relation` to deal with it.
Attachments
Issue Links
- relates to
-
HUDI-3204 Allow original partition column value to be retrieved when using TimestampBasedKeyGen
- Open
- links to