Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-45558

Introduce a metadata file for streaming stateful operator

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The information to store in the metadata file:

      • operator name (no need to be unique among stateful operators in the query)
      • state store name
      • numColumnsPrefixKey: > 0 if prefix scan is enabled, 0 otherwise

      The body of metadata file will be in json format. The metadata file will be versioned just as other streaming metadata file to be future proof.

      The metadata file will improve expose more information about the state store, improves debugability and facilitate the development of state related feature such as reading and writing state and state repartitioning.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Chaoqin Chaoqin Li
            Chaoqin Chaoqin Li
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment