Details
-
Task
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
While benchmarking querying raw Parquet tables against Hudi tables, i've run the test against the same (Hudi) table:
- In one query path i'm reading it as just a raw Parquet table
- In another, i'm reading it as Hudi RO (read_optimized) table
Surprisingly enough, those 2 diverge in the # of files being read:
Raw Parquet
Hudi