Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Won't Do
-
3.1.0, 3.0.0
-
None
-
None
Description
While debugging
TestCrudCompactorOnTez#testCompactionWithSchemaEvolutionAndBuckets(), it has come to my attention, that even though before compaction, the file name of the single bucket in the delta directories is bucket_00001, in the new base, the name of the new single bucket file is bucket_00000. At the same time, the bucket value in the ROW__ID of the records remain the same and suggest that the bucket id is 1.
So the bucket id and the file name do not match. This could lead to problems.
The test itself does not reveal this issue, although I think that the tests should check this, too. At the same time, the tests assume the exact bucket id value in cases where it cannot be predicted and fail, even though the bucket it does not change after the compaction, so the check should really pass.