Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15017

Random job failures with MapReduce and Tez

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 2.1.0
    • None
    • Hive
    • None
    • Hadoop 2.7.2, Hive 2.1.0

    Description

      Since Hive 2.1.0, we are facing a blocking issue on our cluster. All the jobs are failing randomly on mapreduce and tez as well.

      In both case, we don't have any ERROR or WARN message in the logs. You can find attached:

      • hive cli output errors
      • yarn logs for a tez and mapreduce job
      • nodemanager logs (mr only, we have the same logs with tez)

      Note: This issue doesn't exist with Pig jobs (mr + tez), Spark jobs (mr), so this cannot be an Hadoop / Yarn issue.

      Attachments

        1. debug_yarn_container_mr_job_datanode05.log
          171 kB
          Alexandre Linte
        2. debug_yarn_container_mr_job_datanode03.log
          175 kB
          Alexandre Linte
        3. hive-site.xml
          39 kB
          Alexandre Linte
        4. yarn_container_tez_job_datanode06.txt
          15 kB
          Alexandre Linte
        5. yarn_container_tez_job_datanode05.txt
          9 kB
          Alexandre Linte
        6. nodemanager_logs_mr_job.txt
          1 kB
          Alexandre Linte
        7. yarn_syslog_mr_job.txt
          45 kB
          Alexandre Linte
        8. yarn_syslog_tez_job.txt
          42 kB
          Alexandre Linte
        9. hive_cli_mr.txt
          2 kB
          Alexandre Linte
        10. hive_cli_tez.txt
          4 kB
          Alexandre Linte

        Activity

          People

            Unassigned Unassigned
            BigDataOrange Alexandre Linte
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: