Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-8888

Tweak retry settings some more, *some more*

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.95.2
    • None
    • None
    • Reviewed
    • Hide
      Changes retries so we ram up quickly from a pause of 100ms between retries to 10 seonds between retries. Once at ten seconds, we keep retrying unto the maxium of 35 times by default which works out to about five minutes. Change this configuration to suit your work load.
      Show
      Changes retries so we ram up quickly from a pause of 100ms between retries to 10 seonds between retries. Once at ten seconds, we keep retrying unto the maxium of 35 times by default which works out to about five minutes. Change this configuration to suit your work load.

    Description

      Follow on from hbase-8776.

      Need to fix retries and timeouts. We cut them down so much hbase-it tests fail.

      From https://issues.apache.org/jira/browse/HBASE-8776?focusedCommentId=13698762&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13698762 @nkeywal says:

      I would like to change
      hbase.client.retries.number -> 30 (instead of 14 or 20 today)
      hbase.client.pause -> 500 (instead of 100 or 1000 today).
      Context: see HBASE-6295.
      As well, would it make sense to remove all the hbase-site.xml and hbase-defaults.xml to rely only on the defaults in the code. This would trigger another set of issues, as sometimes the defaults are duplicated and different. But these are bugs as well. Imho, this duplication is confusing and it leads to unreliable behavior as we don't really know what are the setting actually used.
      

      Regards removing hbase-site.xml from everywhere to rely on defaults in code, over in hbase-8776 I tried removing them and way too many tests failed. Looks like it'd be tough removing them.

      Attachments

        1. 8888.txt
          115 kB
          Michael Stack
        2. 8888v2.txt
          7 kB
          Michael Stack
        3. 8888v3.txt
          8 kB
          Michael Stack
        4. 8888v4.txt
          8 kB
          Michael Stack
        5. 8888v5.txt
          8 kB
          Michael Stack
        6. 8888v6.txt
          2 kB
          Michael Stack
        7. 8888v7.txt
          18 kB
          Michael Stack
        8. 8888v7.txt
          10 kB
          Michael Stack
        9. 8888v8.txt
          19 kB
          Michael Stack
        10. 8888v8.txt
          10 kB
          Michael Stack
        11. 8888v9.txt
          15 kB
          Michael Stack

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            stack Michael Stack
            stack Michael Stack
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment