Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-449

Failed log copy is not restarted

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.3.5-incubating, 1.4.0
    • 1.4.1, 1.5.0
    • logger, master

    Description

      I shut a single node instance down uncleanly. When I restarted it the logger did not have enough memory to preform the log sort, it got an OOME and died. I edited accumulo-env.sh and gave the logger process more memory. I restarted the logger process. However, the log recovery never restarted.

      The master was continually printing message like the following.

      06 17:07:16,609 [master.CoordinateRecoveryTask] DEBUG: Copying 65c48045-88c1-48e4-93d3-4865a9a86050 from xxx.xxx.xxx.xxx:11224 (for 1210.306000 seconds) 0.0
      

      After 20m I restarted the master and then log recovery proceeded.

      Attachments

        Activity

          People

            kturner Keith Turner
            kturner Keith Turner
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: