Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-16091

Canary takes lot more time when there are delete markers in the table

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.0
    • Fix Version/s: 1.4.0, 0.98.21, 1.3.3, 2.0.0
    • Component/s: canary
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      We have a table which has lot of delete markers and we running Canary test on a regular interval sometimes tests are timing out because to reading first row would skip all these delete markers. Since purpose of Canary is to find health of the region, i think keeping raw=true would not defeat the purpose but provide good perf improvement.

      Following are the example of one such scan where
      without changing code it took 62.3 sec for onre region scan
      2016-06-23 08:49:11,670 INFO [pool-2-thread-1] tool.Canary - read from region <tablename>.<region> column family 0 in 62338ms

      whereas after setting raw=true, it reduced to 58ms
      2016-06-23 08:45:20,259 INFO [pool-2-thread-1] tests.Canary - read from region <tablename>.<region> column family 0 in 58ms

      Taking this over multiple tables , with multiple region would be a good performance gain.

        Attachments

        1. HBASE-16091.02.patch
          10 kB
          Vishal Khandelwal
        2. HBASE-16091.01.patch
          10 kB
          Vishal Khandelwal
        3. HBASE-16091.00.patch
          0.6 kB
          Vishal Khandelwal

          Activity

            People

            • Assignee:
              vishk Vishal Khandelwal
              Reporter:
              vishk Vishal Khandelwal
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: