[ACCUMULO-449] Failed log copy is not restarted - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.3.5-incubating, 1.4.0
Fix Version/s: 1.4.1, 1.5.0
Component/s: logger, master
Labels:
- 14_qa_bug

Description

I shut a single node instance down uncleanly. When I restarted it the logger did not have enough memory to preform the log sort, it got an OOME and died. I edited accumulo-env.sh and gave the logger process more memory. I restarted the logger process. However, the log recovery never restarted.

The master was continually printing message like the following.

06 17:07:16,609 [master.CoordinateRecoveryTask] DEBUG: Copying 65c48045-88c1-48e4-93d3-4865a9a86050 from xxx.xxx.xxx.xxx:11224 (for 1210.306000 seconds) 0.0

After 20m I restarted the master and then log recovery proceeded.

Attachments

Activity

People

Assignee:: Keith Turner

Reporter:: Keith Turner

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 06/Mar/12 22:14

Updated:: 20/Apr/12 20:54

Resolved:: 20/Apr/12 20:53