Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-19937

Intern fields in MapWork on deserialization

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 4.0.0, 4.0.0-alpha-1
    • Spark
    • None

    Description

      When fixing HIVE-16395, we decided that each new Spark task should clone the JobConf object to prevent any ConcurrentModificationException from being thrown. However, setting this variable comes at a cost of storing a duplicate JobConf object for each Spark task. These objects can take up a significant amount of memory, we should intern them so that Spark tasks running in the same JVM don't store duplicate copies.

      Attachments

        1. HIVE-19937.1.patch
          1 kB
          Sahil Takiar
        2. HIVE-19937.2.patch
          7 kB
          Sahil Takiar
        3. HIVE-19937.3.patch
          13 kB
          Sahil Takiar
        4. HIVE-19937.4.patch
          15 kB
          Sahil Takiar
        5. HIVE-19937.5.patch
          15 kB
          Sahil Takiar
        6. HIVE-19937.6.patch
          15 kB
          Sahil Takiar
        7. post-patch-report.html
          2.38 MB
          Sahil Takiar
        8. report.html
          1.34 MB
          Sahil Takiar

        Activity

          People

            stakiar Sahil Takiar
            stakiar Sahil Takiar
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: