[HDDS-9343] Shift sortDatanodes logic to OM - ASF JIRA

XML

Word

Printable

JSON

Motivation behind this change:

When OM has to call SCM then the performance of the objects on the read-path can affect SCM’s scaling requirements, so SCM needs to perform at the same level as that of OM for the read-path - harming the performance of OM by going to SCM for every read.
Instead, a more efficient approach would be to perform sorting within OM itself, eliminating the need to rely on SCM for every read.

Steps how this can be achieved:

I. Add API in SCM:

The current sorting logic of datanodes in the container pipelines is present in SCM (SCMBlockProtocolServer).
SCM holds a configuration regarding the layout of the datanodes (similar to that of Hadoop) as part of an .xml file (NodeSchemaManager#init) - Configuration key: ozone.scm.network.topology.schema.file, value: network-topology-default.xml
The first step would be to add an API for SCM to serve this .xml file as part of an RPC call to OM.
We can do this such that OM has enough information to take the client location and the layout information for sorting the datanodes.

II. OM Cache and Refresh:

OM will need to cache the layout information and periodically refresh it.
The periodic refreshes would involve refetching the updated layout information from SCM.

III. Refactor NetworkTopology + Sort:

The NetworkTopology calculation can be refactored and moved to a common location so that both OM and SCM can utilize it.
On top of the NetworkTopology object obtained, NetworkTopology#sortByDistanceCost could be called.

is related to

HDDS-9272 Performance improvement for OM's sort datanodes

relates to

HDDS-9674 Read from non-datanode host does not consider topology

links to

GitHub Pull Request #5391

1.	Introduce new API and cache refresh for serving network topology schema to OM	Resolved	Tanvi Penumudy
2.	Refactor sortDatanodes to OM	Resolved	Tanvi Penumudy
3.	Reorder initialization of ScmTopologyClient in OM	Resolved	Tanvi Penumudy
4.	Avoid loading network topology layer schema file for every read	Resolved	Tanvi Penumudy
5.	Revise NetworkTopology-based use cases to leverage custom equals() method	Resolved	Tanvi Penumudy
6.	OM admin CLI to enable force fetching network topology tree from SCM	Patch Available	Tanvi Penumudy
7.	Implement granular metrics for OM sortDatanodes	Patch Available	Tanvi Penumudy