Uploaded image for project: 'Apache Nemo'
  1. Apache Nemo
  2. NEMO-237

Refactor ParentTaskDataFetcher to emit streaming data and watermark

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersStop watchingWatchersCreate sub-taskConvert to sub-taskLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.1

    Description

      Currently, ParentTaskDataFetcher is designed for batch jobs. 

      It retrieves multiple customized Iterators from {{InputReader, each of which has blocking hasNext()}} operation. This will block fetching data from multiple streams if a data stream does not emit data. 

      We should refactor ParentTaskDataFetcherInputReader and related classes to retrieve streaming data and watermark without blocking. 

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            taegeonum Tae-Geon Um
            Votes:
            0 Vote for this issue
            Watchers:
            4 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment