Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16865 Handle replication bootstrap of large databases
  3. HIVE-16894

Multi-threaded execution of bootstrap dump of tables / functions

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0.0
    • None
    • HiveServer2
    • None

    Description

      after completing HIVE-16893 the bootstrap process will dump single table at a time and hence will be very time consuming while not optimally utilizing the available resources. Since there is no dependency between dumps of various tables we should be able to do this in parallel.

      Bootstrap dump at db level does :

      • boostrap of all tables (scope of current jira)
        • boostrap of all partitions in a table.
      • boostrap of all functions (scope of current jira)

      Attachments

        Activity

          People

            anishek Anishek Agarwal
            anishek Anishek Agarwal
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: