Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-6922 HBase scanner performance improvements
  3. HBASE-6770

Allow scanner setCaching to specify size instead of number of rows

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.89-fb
    • Client, regionserver
    • None

    Description

      Currently, we have the following api's to customize the behavior of scans:
      setCaching() - how many rows to cache on client to speed up scans
      setBatch() - max columns per row to return per row to prevent a very large response.

      Ideally, we should be able to specify a memory buffer size because:
      1. that would take care of both of these use cases.
      2. it does not need any knowledge of the size of the rows or cells, as the final thing we are worried about is the available memory.

      Attachments

        Issue Links

          Activity

            People

              xiaocai_ss Chen Jin
              karthik.ranga Karthik Ranganathan
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: