Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Cannot Reproduce
-
2.1.1
-
None
-
None
Description
We currently facing issue, that when we call checkpoint on dataframe, it creates partitions in checkpoint dir, but some of them are empty. So we having exceptions reading dataframe back.
Do you have any idea how to avoid it?
it creates 200 partitions.Some are empty. I used repartition(1) before checkpoint. But it is not good wordaround. Do we have anyway , to populate all partitions with data, or avoid empty files?
Pasted snapshot.