Uploaded image for project: 'Atlas'
  1. Atlas
  2. ATLAS-510 High availability of Atlas
  3. ATLAS-511

Ability to run multiple instances of Atlas Server with automatic failover to one active server

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.7-incubating
    • None
    • None

    Description

      One of the most important components that only supports active-standby mode currently is the Atlas server which hosts the API / UI for Atlas. As described in the HA Documentation, we currently are limited to running only one instance of the Atlas server behind a proxy service. If the running instance goes down, a manual process is required to bring up another instance.

      In this JIRA, we propose to have an ability to run multiple Atlas server instances. However, as a first step, only one of them will be actively processing requests. To have a consistent terminology, let us call that server the master. Any requests sent to the other servers will be redirected to the master.

      When the master suffers a partition, one of the other servers must automatically become the master and start processing requests. What this mode brings us over the current system is the ability to automatically failover the Atlas server instance without any manual intervention. Note that this can be arguably called an active/active setup

      ATLAS-488 raised to support multiple active Atlas server instances. While that would be ideal, we have to learn more about the underlying system behavior before we can get there, and hopefully we can take smaller steps to improve the system systematically. The method proposed here is similar to what is adopted in many other Hadoop components including HDFS NameNode, HBase HMaster etc.

      Attachments

        1. ATLAS-511-3.patch
          149 kB
          Hemanth Yamijala
        2. ATLAS-511-2.patch
          154 kB
          Hemanth Yamijala
        3. ATLAS-511-1.patch
          111 kB
          Hemanth Yamijala
        4. ATLAS-511.patch
          89 kB
          Hemanth Yamijala
        5. HADesign.pdf
          131 kB
          Hemanth Yamijala

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            yhemanth Hemanth Yamijala
            yhemanth Hemanth Yamijala
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment