Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16865 Handle replication bootstrap of large databases
  3. HIVE-16897

repl load does not lead to excessive memory consumption for multiple functions from same binary jar

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0.0
    • None
    • HiveServer2
    • None

    Description

      as part of function replication we currently keep a separate copy of the binary jar associated with the function ( this should be same on the primary warehouse also since each hdfs jar location given during creation of function will download the resource in a separate resource location thus leading to the same jar being included in class path multiple times)

      this will lead to excessive space used to keep all jars in classpath, solve this by identifying the common binary jar ( using checksum from primary on replica) and not creating multiple copies thus preventing excessive memory usage.

      Attachments

        Activity

          People

            anishek Anishek Agarwal
            anishek Anishek Agarwal
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: