Hadoop Common
  1. Hadoop Common
  2. HADOOP-5793

High speed compression algorithm like BMDiff

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Minor Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      Add a high speed compression algorithm like BMDiff.
      It gives speeds ~100MB/s for writes and ~1000MB/s for reads, compressing 2.1billions web pages from 45.1TB in 4.2TB

      Reference:
      http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=437
      2005 Jeff Dean talk about google architecture - around 46:00.

      http://feedblog.org/2008/10/12/google-bigtable-compression-zippy-and-bmdiff/

      http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=755678

      A reference implementation exists in HyperTable.

        Issue Links

          Activity

          Gavin made changes -
          Link This issue is depended upon by HBASE-2655 [ HBASE-2655 ]
          Gavin made changes -
          Link This issue blocks HBASE-2655 [ HBASE-2655 ]
          Michele Catasta made changes -
          Assignee Michele Catasta [ pirroh ]
          Michele Catasta made changes -
          Field Original Value New Value
          Link This issue blocks HBASE-2655 [ HBASE-2655 ]
          elhoim gibor created issue -

            People

            • Assignee:
              Michele Catasta
              Reporter:
              elhoim gibor
            • Votes:
              0 Vote for this issue
              Watchers:
              24 Start watching this issue

              Dates

              • Created:
                Updated:

                Development