Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
-
Description
Currently, we have no idea how many rows are being produced by join/file sink while the process is running.
It makes the tasks very difficult to debug - it would be very useful to dump some stats while the process (mapper/reducer) is running