Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
A container can move and change the Datanode details in it's lifecycle post close of a container.
When a key needs to be read by the client, OM queries SCM to update the location information for a container. This adds additional latency per request and load on SCM which is directly proportional to the read load on OM.
This Jira tracks the design and implementation of improving read performance by addressing this code path.
Attachments
Attachments
1.
|
Introduce container location cache in ScmClient | Resolved | Duong | ||
2.
|
Implement GetKeyInfo API | Resolved | Duong | ||
3.
|
Integrate the GetKeyInfo API to key read flows | Resolved | Duong | ||
4.
|
Experiment the location cache size | Resolved | Duong | ||
5.
|
Add container location cache metrics. | Resolved | Duong | ||
6.
|
Normalize DatanodeDetails to reduce cache memory footprint. | Resolved | Duong | ||
7.
|
S3G: Avoid calling getS3Volume by using GetKeyInfo API. | Resolved | Duong | ||
8.
|
Integrate the GetKeyInfo API to OFS | Resolved | Duong | ||
9.
|
Use container cache in Key listing API. | Resolved | Duong |