Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-982

Improve Fetcher to get multiple shuffle outputs through a request

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • None
    • 0.13.0
    • Data Shuffle
    • None

    Description

      Currently, Fetcher only can request at most a fetch for one shuffle output at a time. The implementation can cause performance degradation even though intermediate data is actually small.

      For example, If an input data set of the first stage is big and the intermediate data is very small, QueryMaster will choose a few of nodes for next execution block. Since each node keeps limited threads for fetch, it will take a long time for the nodes for next stage to fetch all intermediate.

      If Fetcher can get multiple shuffle outputs through a request, it would solve the slowness which occurs in some cases.

      Attachments

        Activity

          People

            jihoonson Jihoon Son
            hyunsik Hyunsik Choi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: