[HDFS-16540] Data locality is lost when DataNode pod restarts in kubernetes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.3.2
Fix Version/s: 3.4.0
Component/s: namenode
Labels:
- pull-request-available

Hadoop Flags:

Reviewed

Description

We have HBase RegionServer and Hdfs DataNode running in one pod. When the pod restarts, we found that data locality is lost after we do a major compaction of hbase regions. After some debugging, we found that upon pod restarts, its ip changes. In DatanodeManager, maps like networktopology are updated with the new info. host2DatanodeMap is not updated accordingly. When hdfs client with the new ip tries to find a local DataNode, it fails.

Attachments

Issue Links

is related to

HDFS-17188 Data loss in our production clusters due to missing HDFS-16540

Open

links to

GitHub Pull Request #4170

GitHub Pull Request #4246

GitHub Pull Request #4326

Activity

People

Assignee:: Huaxiang Sun

Reporter:: Huaxiang Sun

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 13/Apr/22 18:08

Updated:: 29/Sep/23 17:48

Resolved:: 16/May/22 04:33

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

8h 20m