Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-131

insert overwrite directory leaves behind uncommitted/tmp files from failed tasks

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • None
    • 0.3.0
    • Query Processor
    • None
    • Reviewed
    • Hide
      HIVE-131. Remove uncommitted files from failed tasks. (Joydeep Sen Sarma via zshao)
      Show
      HIVE-131 . Remove uncommitted files from failed tasks. (Joydeep Sen Sarma via zshao)

    Description

      _tmp files are getting left behind on insert overwrite directory:

      /user/jssarma/ctst1/40422_m_000195_0.deflate <r 3> 13285 2008-12-07 01:47 rw-r-r- jssarma supergroup
      /user/jssarma/ctst1/40422_m_000196_0.deflate <r 3> 3055 2008-12-07 01:46 rw-r-r- jssarma supergroup
      /user/jssarma/ctst1/_tmp.40422_m_000033_0 <r 3> 0 2008-12-07 01:53 rw-r-r- jssarma supergroup
      /user/jssarma/ctst1/_tmp.40422_m_000037_1 <r 3> 0 2008-12-07 01:53 rw-r-r- jssarma supergroup

      this happened with speculative execution. the code looks good (in fact in this case many speculative tasks were launched - and only a couple caused problems). Almost seems like these files did not appear in the namespace until after the map-reduce job finished and the movetask did a listing of the output dir ..

      Attachments

        1. hive-131.patch.2
          15 kB
          Joydeep Sen Sarma
        2. HIVE-131.patch.1
          14 kB
          Joydeep Sen Sarma

        Issue Links

          Activity

            People

              jsensarma Joydeep Sen Sarma
              jsensarma Joydeep Sen Sarma
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: