Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-13303

spill to YARN directories, not tmp, when available

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.1.0
    • None
    • None

    Description

      RowContainer::setupWriter, HybridHashTableContainer::spillPartition, (KeyValueContainer|ObjectContainer)::setupOutput, VectorMapJoinRowBytesContainer::setupOutputFileStreams create files in tmp. Maybe some other code does it too, those are the ones I see on the execution path. When there are multiple YARN output directories and multiple tasks running on a machine, it's better to use the YARN directories. The only question is cleanup.

      Attachments

        1. HIVE-13303.patch
          31 kB
          Sergey Shelukhin

        Issue Links

          Activity

            People

              sershe Sergey Shelukhin
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: