Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20656

Sensible defaults: Map aggregation memory configs are too aggressive

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.2.0, 4.0.0
    • 4.0.0-alpha-1
    • None
    • None

    Description

      The defaults for the following configs seems to be too aggressive. In java this can easily lead to several full GC pauses whose memory cannot be reclaimed.

      HIVEMAPAGGRHASHMEMORY("hive.map.aggr.hash.percentmemory", (float) 0.99,
          "Portion of total memory to be used by map-side group aggregation hash table"),
      HIVEMAPAGGRMEMORYTHRESHOLD("hive.map.aggr.hash.force.flush.memory.threshold", (float) 0.9,
          "The max memory to be used by map-side group aggregation hash table.\n" +
          "If the memory usage is higher than this number, force to flush data"),

       

      We can be little bit conservative for these configs to avoid getting into GC pause. 

      Attachments

        1. HIVE-20656.3.patch
          13 kB
          Prasanth Jayachandran
        2. HIVE-20656.2.patch
          13 kB
          Prasanth Jayachandran
        3. HIVE-20656.1.patch
          2 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              prasanth_j Prasanth Jayachandran
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: