Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-698

Support Topology Awareness for Ozone

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • None
    • 1.0.0
    • SCM
    • None

    Description

      This is an umbrella JIRA to add topology aware support for Ozone Pipelines, Containers and Blocks. Long time since HDFS is created, we provide rack/nodegroup awareness for reliability and high performance for data access. Ozone need a similar mechanism and this can be more flexible for cloud scenarios.

      Attachments

        1. HDDS-698.000.patch
          159 kB
          Sammi Chen
        2. network-topology-default.xml
          3 kB
          Sammi Chen
        3. network-topology-nodegroup.xml
          3 kB
          Sammi Chen

        Issue Links

          1.
          Detect Ozone Network topology Sub-task Resolved Sammi Chen  
          2.
          Support YAML format network topology cluster definition Sub-task Resolved Junjie Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 3h
          3.
          Support rack awared node placement policy based on network topology Sub-task Resolved Sammi Chen  
          4.
          Fix random test failure TestSCMContainerPlacementRackAware Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h 10m
          5.
          Add "scmcli printTopology" shell command to print datanode topology Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h 10m
          6.
          Support configurable container placement policy through "ozone.scm.container.placement.classname" Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 2h 50m
          7.
          Missing test resources of integrataion-test project in target directory after compile Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1.5h
          8.
          Add datanode to network topology cluster during node register Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 7.5h
          9.
          Allow Ozone RPC client to read with topology awareness Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 4h 50m
          10.
          NPE thrown while trying to find DN closest to client Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1.5h
          11.
          Make Topology Aware Replication/Read non-default for ozone 0.4.1 Sub-task Resolved Xiaoyu Yao

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 40m
          12.
          Add option to "ozone scmcli printTopology" to order the output acccording to topology layer Sub-task Resolved Xiaoyu Yao

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h 10m
          13.
          ReplicationManager fail to find proper node topology based on Datanode details from heartbeat Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 4h
          14.
          Change topology sorting related logs in Pipeline from INFO to DEBUG Sub-task Resolved Junjie Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1.5h
          15.
          Turn on topology aware read in TestFailureHandlingByClient Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h
          16.
          Suppress WARN log from NetworkTopology#getDistanceCost Sub-task Resolved Xiaoyu Yao

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 40m
          17.
          Use "ozone.network.topology.aware.read" to control both RPC client and server side logic Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 2h 20m
          18.
          TestKeyManagerImpl.testLookupKeyWithLocation is failing Sub-task Resolved Xiaoyu Yao

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 40m
          19.
          Support listPipelines by filters in scmcli Sub-task Resolved Li Cheng

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 40m
          20.
          Add metrics in rack aware container placement policy Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 3.5h
          21.
          Support multiple excluded scopes when choosing datanodes in NetworkTopology Sub-task Resolved Sammi Chen

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 5h
          22.
          Choose datanode for pipeline creation based on network topology Sub-task Resolved Sammi Chen  
          23.
          Replication manager should detect and correct containers which don't meet the replication policy Sub-task Resolved Stephen O'Donnell  
          24.
          Refactor recon missing containers task to detect under, over and mis-replicated containers Sub-task Resolved Stephen O'Donnell  
          25.
          Extend network topology acceptance test to read data when datanodes are stopped Sub-task Resolved Stephen O'Donnell

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 20m
          26.
          Enable test added in HDDS-3084 when blocking issues are resolved Sub-task Resolved Stephen O'Donnell

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 10m
          27.
          Test Topology Aware Job scheduling with Ozone Topology Sub-task Resolved Stephen O'Donnell  

          Activity

            People

              Sammi Sammi Chen
              xyao Xiaoyu Yao
              Votes:
              1 Vote for this issue
              Watchers:
              21 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 45h 10m
                  45h 10m