Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-430

Performance: Scanners and getRow return maps with duplicate data

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.2.0
    • None
    • None

    Description

      Right now, whenever we get back multiple cells worth of data at a time, we do so in a map of HStoreKey->byte[]. This means that there is a duplicated Text row and long timestamp at the very least between every cell. This is quite a bit wasted. It also means we have to do a lot of translation every time.

      We could create a new Writable that contains just one row, one timestamp, and a map of Text->byte[].

      Attachments

        1. 430.patch
          31 kB
          Bryan Duxbury
        2. 430-v2.patch
          32 kB
          Bryan Duxbury
        3. 430-v3.patch
          37 kB
          Bryan Duxbury
        4. 430-v4.patch
          36 kB
          Bryan Duxbury
        5. 430-v5.patch
          36 kB
          Bryan Duxbury

        Issue Links

          Activity

            People

              bryanduxbury Bryan Duxbury
              bryanduxbury Bryan Duxbury
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: