Uploaded image for project: 'Ambari'
  1. Ambari
  2. AMBARI-2116

Add parameters to improve HIVE performance

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.2.3
    • 1.2.4
    • ambari-server
    • None

    Description

      This bug captures the set of configurations for hive and hadoop and the default values that are required for better/correct performance.

      Here are the setting Ambari should use for hive-site.xml.

      set hive.enforce.bucketing=true;
      set hive.enforce.sorting=true;
      set hive.map.aggr=true;
      set hive.optimize.bucketmapjoin=true;
      set hive.optimize.bucketmapjoin.sortedmerge=true;
      set hive.mapred.reduce.tasks.speculative.execution=false;
      set hive.auto.convert.join=true;
      set hive.auto.convert.sortmerge.join=true;
      set hive.auto.convert.sortmerge.join.noconditionaltask=true;
      set hive.auto.convert.join.noconditionaltask=true;
      set hive.auto.convert.join.noconditionaltask.size=1000000000;
      set hive.optimize.reducededuplication.min.reducer=1;
      set hive.optimize.mapjoin.mapreduce=true;
      set hive.mapjoin.bucket.cache.size=10000;

      Attachments

        1. AMBARI-2116.patch
          2 kB
          Siddharth Wagle
        2. AMBARI-2116-1.patch
          2 kB
          Siddharth Wagle
        3. AMBARI-2116-2.patch
          8 kB
          Siddharth Wagle
        4. AMBARI-2116-3.patch
          2 kB
          Siddharth Wagle

        Activity

          People

            swagle Siddharth Wagle
            swagle Siddharth Wagle
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: