Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-1355

Hive should use NullOutputFormat for hadoop jobs

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.6.0
    • Query Processor
    • None
    • Reviewed

    Description

      hive doesn't depend on hadoop job output folder. it produces output exclusively via side effect folders. we should use an outputformat that can request hadoop skip cleanup/setup. this could be nulloutputformat (unless there are any objections in hadoop to changing nulloutputformat behavior).

      as a small side effect, it also avoids some totally unnecessary hdfs file creates and deletes in hdfs.

      Attachments

        1. 1355.1.patch
          11 kB
          Joydeep Sen Sarma

        Activity

          People

            jsensarma Joydeep Sen Sarma
            jsensarma Joydeep Sen Sarma
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: