[HADOOP-12041] Implement another Reed-Solomon coder in pure Java - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0.0-alpha1
Component/s: None
Labels:
None

Target Version/s:
Hadoop Flags:

Reviewed

Description

Currently existing Java RS coders based on GaloisField implementation have some drawbacks or limitations:

The decoder computes not really erased units unnecessarily (HADOOP-11871);
The decoder requires parity units + data units order for the inputs in the decode API (~~HADOOP-12040~~);
Need to support or align with native erasure coders regarding concrete coding algorithms and matrix, so Java coders and native coders can be easily swapped in/out and transparent to HDFS (~~HADOOP-12010~~);
It's unnecessarily flexible but incurs some overhead, as HDFS erasure coding is totally a byte based data system, we don't need to consider other symbol size instead of 256.

This desires to implement another RS coder in pure Java, in addition to the existing GaliosField from HDFS-RAID. The new Java RS coder will be favored and used by default to resolve the related issues. The old HDFS-RAID originated coder will still be there for comparing, and converting old data from HDFS-RAID systems.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-12041-v1.patch
18/Aug/15 13:02
43 kB
Kai Zheng
HADOOP-12041-v2.patch
02/Nov/15 03:25
43 kB
Kai Zheng
HADOOP-12041-v3.patch
03/Nov/15 02:35
47 kB
Kai Zheng
HADOOP-12041-v4.patch
06/Nov/15 06:20
50 kB
Kai Zheng
HADOOP-12041-v5.patch
06/Jan/16 02:07
51 kB
Kai Zheng
HADOOP-12041-v6.patch
15/Jan/16 09:27
51 kB
Kai Zheng
HADOOP-12041-v7.patch
02/Feb/16 01:29
54 kB
Kai Zheng
HADOOP-12041-v8.patch
02/Feb/16 03:11
54 kB
Kai Zheng

Issue Links

depends upon

HADOOP-12040 Adjust inputs order for the decode API in raw erasure coder

Resolved

HADOOP-12047 Indicate preference not to affect input buffers during coding in erasure coder

Resolved

HADOOP-12327 Initialize output buffers with ZERO bytes in erasure coder

Resolved

is related to

HADOOP-12808 Rename the RS coder from HDFS-RAID as legacy

Resolved

Activity

People

Assignee:: Kai Zheng

Reporter:: Kai Zheng

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 29/May/15 02:50

Updated:: 12/May/16 18:24

Resolved:: 03/Feb/16 23:06