Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Progress should be updated for every read of an input
reads an input, writes an output, nor updates its status string
I think ever loop should simply be calling progress(). If during a major compaction there are a lot of deleted values, long gaps of time can occur without a progress update and the job may be timed out by YARN.
I'm not 100% sure this is happening, but just something I wanted to point out.