[HDFS-7337] Configurable and pluggable erasure codec and policy - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0.0-beta1
Component/s: erasure-coding
Labels:
- hdfs-ec-3.0-nice-to-have

Target Version/s:

3.0.0-beta1
Hadoop Flags:

Incompatible change
Release Note:

Hide
This allows users to:
* develop and plugin their own erasure codec and coders. The plugin will be loaded automatically from hadoop jars, the corresponding codec and coder will be registered for runtime use.
* define their own erasure coding policies thru an xml file and CLI command. The added policies will be persisted into fsimage.

Show
This allows users to: * develop and plugin their own erasure codec and coders. The plugin will be loaded automatically from hadoop jars, the corresponding codec and coder will be registered for runtime use. * define their own erasure coding policies thru an xml file and CLI command. The added policies will be persisted into fsimage.

Description

According to ~~HDFS-7285~~ and the design, this considers to support multiple Erasure Codecs via pluggable approach. It allows to define and configure multiple codec schemas with different coding algorithms and parameters. The resultant codec schemas can be utilized and specified via command tool for different file folders. While design and implement such pluggable framework, it’s also to implement a concrete codec by default (Reed Solomon) to prove the framework is useful and workable. Separate JIRA could be opened for the RS codec implementation.

Note ~~HDFS-7353~~ will focus on the very low level codec API and implementation to make concrete vendor libraries transparent to the upper layer. This JIRA focuses on high level stuffs that interact with configuration, schema and etc.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-7337-prototype-v1.patch
06/Nov/14 09:28
14 kB
Kai Zheng
HDFS-7337-prototype-v2.zip
27/Nov/14 08:26
34 kB
Kai Zheng
HDFS-7337-prototype-v3.zip
05/Dec/14 05:53
49 kB
Kai Zheng
PluggableErasureCodec.pdf
12/Nov/14 09:28
409 kB
Kai Zheng
PluggableErasureCodec-v2.pdf
10/Mar/15 23:51
432 kB
Kai Zheng
PluggableErasureCodec-v3.pdf
15/Apr/15 08:15
363 kB
Kai Zheng
PluggableErasureCodec v4.pdf
10/May/17 14:47
154 kB
Sammi Chen

Issue Links

contains

HADOOP-11649 Erasure Coding: Load multiple erasure codecs

Open

HADOOP-13200 Implement customizable and configurable erasure coders

Resolved

incorporates

HADOOP-11514 Raw Erasure Coder API for concrete encoding and decoding

Resolved

HADOOP-11645 Erasure Codec API covering the essential aspects for an erasure code

Resolved

HADOOP-11646 Erasure Coder API for encoding and decoding of block group

Resolved

HADOOP-11664 Loading predefined EC schemas from configuration

Resolved

HDFS-7363 Pluggable algorithms to form block groups in erasure coding

Reopened

is depended upon by

HDFS-7345 Local Repairable Codes (LRC)

In Progress

is related to

HDFS-8031 Follow-on work for erasure coding phase I (striping layout)

Open

(2 incorporates, 1 is depended upon by, 1 is related to)

Sub-Tasks

1.	Erasure Coding: perform stripping erasure encoding work given block reader and writer	Resolved	Li Bo
2.	Erasure Coding: createErasureCodingZone api should accept the policyname as argument instead of ErasureCodingPolicy	Resolved	J.Andreina
3.	Define and parse erasure code policies	Resolved	Frank Zeng
4.	Allow user to customize new erasure code policies	Resolved	Huafeng Wang
5.	Add CLI cmd to remove an erasure code policy	Resolved	Tim Yao
6.	Add ec sub command -listCodec to show currently supported ec codecs	Resolved	Sammi Chen
7.	Provide a system-default EC policy	Resolved	luhuichun

Activity

People

Assignee:: Sammi Chen

Reporter:: Zhe Zhang

Votes:: 2 Vote for this issue

Watchers:: 28 Start watching this issue

Dates

Created:: 04/Nov/14 01:22

Updated:: 02/Oct/19 17:14

Resolved:: 21/Sep/17 07:01