Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3245

Race condition between addKnowInput and clearAndGetOnepartition of InputHost

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9.0
    • 0.9.0
    • None
    • None

    Description

      In ShuffleManager::addKnownInput, InputHost::addKnowInput is invoked without locking. At the same time, InputHost::clearAndGetOnePartition can be invoked with lock(ShuffleManager.RunShuffleCallable::callInternal => ShuffleManager::constructFetcherForHost => InputHosst::clearAndGetOnePartition). The method clearAndGetOnePartition does two things: drain input attempt of a host and then delete the entry for this host. If addKnownInput happens between these two operation, the added input will be deleted without being handled.

      Attachments

        1. TEZ-3245.1.patch
          1 kB
          Zhiyuan Yang

        Issue Links

          Activity

            People

              zhiyuany Zhiyuan Yang
              zhiyuany Zhiyuan Yang
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: