Description
This jira introduces a new HDFS federation balance tool to balance data across different federation namespaces. It uses Distcp to copy data from the source path to the target path.
The process is:
1. Use distcp and snapshot diff to sync data between src and dst until they are the same.
2. Update mount table in Router if we specified RBF mode.
3. Deal with src data, move to trash, delete or skip them.
The design of fedbalance tool comes from the discussion in HDFS-15087.
Attachments
Attachments
Issue Links
- Blocked
-
HDFS-16087 RBF balance process is stuck at DisableWrite stage
- Resolved
-
HDFS-16116 Fix Hadoop federationBanance markdown bug.
- Resolved
- is related to
-
HDFS-15087 RBF: Balance/Rename across federation namespaces
- Open
-
HDFS-15747 RBF: Rename across sub-namespaces.
- Open
-
HDFS-13123 RBF: Add a balancer tool to move data across subcluster
- Patch Available
-
HDFS-15662 Complete the javadoc of hadoop-federation-balance.
- Patch Available
- links to
1.
|
RBF: Implement BalanceProcedureScheduler basic framework | Resolved | Jinglun | |||||||||
2.
|
FedBalance tool implementation | Resolved | Jinglun | |||||||||
3.
|
Add documentation for fedbalance tool | Resolved | Jinglun | |||||||||
4.
|
Add separated config file hdfs-fedbalance-default.xml for fedbalance tool | Resolved | Jinglun | |||||||||
5.
|
Javadocs fails for hadoop-federation-balance | Resolved | Xieming Li | |||||||||
6.
|
Add diff threshold to FedBalance | Resolved | Jinglun | |||||||||
7.
|
RBF: Router fails to start due to NoClassDefFoundError for hadoop-federation-balance | Resolved | Takanobu Asanuma |
|
||||||||
8.
|
Minor fixes in HDFS Fedbalance document | Resolved | Ashutosh Gupta |
|