Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3634

Fix documentation for DataSetUtils.zipWithUniqueId()

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.1.0
    • Fix Version/s: 1.0.1, 1.1.0
    • Component/s: Documentation
    • Labels:
      None

      Description

      Under FLINK-2590 the assignment and testing of unique IDs was improved but the documentation looks to still reference the old implementation.

      With parallelism=1 there is no difference between zipWithUniqueID and zipWithIndex. With greater parallelism the results of zipWithUniqueID are dependent on the partitioning.

      The documentation should demonstrate a possible result that is different from the incremental sequence of zipWithIndex while noting that results are dependent on the parallelism and partitioning.

        Attachments

          Activity

            People

            • Assignee:
              greghogan Greg Hogan
              Reporter:
              greghogan Greg Hogan

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment