Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-3489

Use OutputCommitter to get rid of requirement of client being alive for long running queries

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      There is a bunch of work (depending on query) which hive client does once query finishes. This has multiple effects which would be nice to avoid. Few of these work items include:

      • Moving temp files / dirs around to their final destination.
      • Creating empty files if required.
      • Updating metastore with metadata.
      • More ?

      If we start using OutputCommitter then clients can fire and forget, they won't need to be around for all the time query is executing. Plus, since processing is going to happen in cluster, this further makes client lightweight.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ashutoshc Ashutosh Chauhan
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: