Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48597

Distinguish the streaming nodes from the text representation of logical plan

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 4.0.0
    • 4.0.0
    • SQL

    Description

      We had a hard time to figure out whether the nodes are streaming or not, when we debugged the issue https://issues.apache.org/jira/browse/SPARK-47305 .

      Plan text for logical plan does not show the property of isStreaming, hence we had to speculate the value based on other context. In addition, even though the type of leaf node is explicitly meant to be streaming which enables us to track down the isStreaming for certain subtree, the plan could be very long and it’s a non-trivial effort to trace down to the leaf nodes. Also, if the leaf nodes are skipped on the representation due to the size, there is no way to get the information of isStreaming.

      We propose to introduce a marker of the representation for streaming, which will be shown in the text logical plan. There is no concept of "isStreaming" in physical plan, so the change only needs to happen in logical plan.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            kabhwan Jungtaek Lim
            kabhwan Jungtaek Lim
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment