Issue Details (XML | Word | Printable)

Key: HADOOP-5793
Type: New Feature New Feature
Status: Open Open
Priority: Minor Minor
Assignee: Unassigned
Reporter: elhoim gibor
Votes: 0
Watchers: 9
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

High speed compression algorithm like BMDiff

Created: 08/May/09 01:11 PM   Updated: 08/May/09 01:11 PM
Component/s: None
Affects Version/s: None
Fix Version/s: None

Time Tracking:
Not Specified


 Description  « Hide
Add a high speed compression algorithm like BMDiff.
It gives speeds ~100MB/s for writes and ~1000MB/s for reads, compressing 2.1billions web pages from 45.1TB in 4.2TB

Reference:
http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=437
2005 Jeff Dean talk about google architecture - around 46:00.

http://feedblog.org/2008/10/12/google-bigtable-compression-zippy-and-bmdiff/

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=755678

A reference implementation exists in HyperTable.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
There are no comments yet on this issue.