Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15435

HiveCLI: two jobs are submitted consecutively when tez is used

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.1.1
    • None
    • CLI, Hive, Tez
    • None
    • Hadoop 2.7.3, Hive 2.1.1, Tez 0.8.4

    Description

      Since Hive 2.1.1 (Hive 2.1.0 was OK), the HiveCLI has a strange behavior when submitting a job with Tez as the exection engine.

      Doing a query like:

      hive (shfs3453)> set hive.execution.engine=tez;
      hive (shfs3453)> set tez.queue.name=HQ_OLPS;
      hive (shfs3453)> SELECT COUNT(year) FROM chicagocrimes GROUP BY year;
      

      The query will submit two jobs: one submitted in the queue defined, one submitted in the queue default. Both finish successfully. But only the second job processes the data.
      You can find attached a screenshot.

      This behavior is not visible when using beeline.

      Attachments

        1. rm_tez_query.png
          192 kB
          Alexandre Linte
        2. pom.xml
          48 kB
          Alexandre Linte

        Activity

          People

            Unassigned Unassigned
            BigDataOrange Alexandre Linte
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: