Details
-
Improvement
-
Status: Resolved
-
P2
-
Resolution: Duplicate
-
None
-
None
Description
[copying in-person / email discussion at Strata Singapore to JIRA]
The FlinkBatchTransformTranslators use transform.getName() [1] – this is the "SDK name" for the transform.
The "user name" for the transform is not available here, it is in fact on the TransformHierarchy.Node as node.getFullName() [2].
getFullName() is used some in Flink, but not when setting step names.
I drafted a quick commit that sort of propagates the user names to the web UI (but only for DataSource, and still too verbose: https://github.com/dhalperi/incubator-beam/commit/a2f1fb06b22a85ec738e4f2a604c9a129891916c)
Before this change, the "ReadLines" step showed up as: "DataSource (at Read(CompressedSource) (org.apache.beam.runners.flink.translation.wrappers.SourceInputFormat))"
With this change, it shows up as "DataSource (at ReadLines/Read (org.apache.beam.runners.flink.translation.wrappers.SourceInputFormat))"
which I think is closer. [I'd still like it to JUST be "ReadLines/Read" e.g.].
Thoughts?
[1] https://github.com/apache/incubator-beam/blob/master/runners/flink/runner/src/main/java/org/apache/beam/runners/flink/translation/FlinkBatchTransformTranslators.java#L129
[2] https://github.com/apache/incubator-beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/runners/TransformHierarchy.java#L252