Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15435

HiveCLI: two jobs are submitted consecutively when tez is used

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.1.1
    • Fix Version/s: None
    • Component/s: CLI, Hive, Tez
    • Labels:
      None
    • Environment:

      Hadoop 2.7.3, Hive 2.1.1, Tez 0.8.4

      Description

      Since Hive 2.1.1 (Hive 2.1.0 was OK), the HiveCLI has a strange behavior when submitting a job with Tez as the exection engine.

      Doing a query like:

      hive (shfs3453)> set hive.execution.engine=tez;
      hive (shfs3453)> set tez.queue.name=HQ_OLPS;
      hive (shfs3453)> SELECT COUNT(year) FROM chicagocrimes GROUP BY year;
      

      The query will submit two jobs: one submitted in the queue defined, one submitted in the queue default. Both finish successfully. But only the second job processes the data.
      You can find attached a screenshot.

      This behavior is not visible when using beeline.

        Attachments

        1. pom.xml
          48 kB
          Alexandre Linte
        2. rm_tez_query.png
          192 kB
          Alexandre Linte

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              BigDataOrange Alexandre Linte
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated: