Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Spark has an OutputMetrics it uses to expose records / bytes written. We currently don't integrate with it and the Spark UI shows a blank value for output records / bytes. We have our own customer accumulators instead (like HIVE_RECORDS_OUT).
Spark exposes the OutputMetrics object inside individual tasks via the TaskContext.get() method. We can use this method to access the OutputMetrics object and update it.