Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-9102

HFile block pre-loading for large sequential scan

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 0.89-fb
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      The current HBase scan model cannot take full advantage of the aggrediate disk throughput, especially for the large sequential scan cases. And for the large sequential scan, it is easy to predict what the next block to read in advance so that it can pre-load and decompress/decoded these data blocks from HDFS into block cache right before the current read point.

      Therefore, this jira is to optimized the large sequential scan performance by pre-loading the HFile blocks into the block cache in a stream fashion so that the scan query can read from the cache directly.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                liyin Liyin Tang
                Reporter:
                liyin Liyin Tang
              • Votes:
                0 Vote for this issue
                Watchers:
                12 Start watching this issue

                Dates

                • Created:
                  Updated: