Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16865

Handle replication bootstrap of large databases

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0.0
    • None
    • HiveServer2
    • None

    Description

      for larger databases make sure that we can handle replication bootstrap.

      • Assuming large database can have close to million tables or a few tables with few hundred thousand partitions.
      • for function replication if a primary warehouse has large number of custom functions defined such that the same binary file in corporates most of these functions then on the replica warehouse there might be a problem in loading all these functions as we will have the same jar on primary copied over for each function such that each function will have a local copy of the jar, loading all these jars might lead to excessive memory usage.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            anishek Anishek Agarwal Assign to me
            anishek Anishek Agarwal

            Dates

              Created:
              Updated:

              Slack

                Issue deployment