[HBASE-14317] Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL - ASF JIRA

Details

Type: Bug
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 1.2.0, 1.1.1
Fix Version/s: 1.2.0, 1.3.0, 2.0.0
Component/s: wal
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
Tighten up WAL-use semantic.

1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it.
2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed.

The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch.

Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled.

TODO: Revisit our WAL system. ~~HBASE-12751~~ helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in.
TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?

Show
Tighten up WAL-use semantic. 1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it. 2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed. The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch. Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled. TODO: Revisit our WAL system. HBASE-12751 helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in. TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?

Description

hbase-1.1.1 and hadoop-2.7.1

We try to roll logs because can't append (See HDFS-8960) but we get stuck. See attached thread dump and associated log. What is interesting is that syncers are waiting to take syncs to run and at same time we want to flush so we are waiting on a safe point but there seems to be nothing in our ring buffer; did we go to roll log and not add safe point sync to clear out ringbuffer?

Needs a bit of study. Try to reproduce.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

[Java] RS stuck on WAL sync to a dead DN - Pastebin.com.html
26/Aug/15 18:51
1.19 MB
Michael Stack
14317.branch-1.txt
04/Sep/15 15:25
83 kB
Michael Stack
14317.branch-1.txt
04/Sep/15 07:39
83 kB
Michael Stack
14317.branch-1.v2.txt
06/Sep/15 04:01
83 kB
Michael Stack
14317.branch-1.v2.txt
06/Sep/15 00:42
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 16:58
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 16:00
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 06:01
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 05:48
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 05:30
83 kB
Michael Stack
14317.branch-1.v2.txt
05/Sep/15 05:05
83 kB
Michael Stack
14317.test.txt
29/Aug/15 14:51
20 kB
Michael Stack
14317v10.txt
02/Sep/15 08:22
57 kB
Michael Stack
14317v11.txt
02/Sep/15 21:22
73 kB
Michael Stack
14317v12.txt
03/Sep/15 00:10
73 kB
Michael Stack
14317v13.txt
03/Sep/15 05:07
73 kB
Michael Stack
14317v14.txt
03/Sep/15 22:47
79 kB
Michael Stack
14317v15.txt
04/Sep/15 03:09
82 kB
Michael Stack
14317v5.branch-1.2.txt
01/Sep/15 23:04
43 kB
Michael Stack
14317v5.txt
01/Sep/15 23:03
43 kB
Michael Stack
14317v9.txt
02/Sep/15 06:44
50 kB
Michael Stack
append-only-test.patch
31/Aug/15 19:30
12 kB
Elliott Neil Clark
HBASE-14317.patch
27/Aug/15 23:15
5 kB
Elliott Neil Clark
HBASE-14317-v1.patch
31/Aug/15 20:33
16 kB
Elliott Neil Clark
HBASE-14317-v2.patch
31/Aug/15 21:51
17 kB
Elliott Neil Clark
HBASE-14317-v3.patch
31/Aug/15 22:02
17 kB
Elliott Neil Clark
HBASE-14317-v4.patch
31/Aug/15 22:21
17 kB
Elliott Neil Clark
raw.php
26/Aug/15 18:52
174 kB
Michael Stack
repro.txt
01/Sep/15 05:51
32 kB
Michael Stack
san_dump.txt
29/Aug/15 19:47
225 kB
Elliott Neil Clark
subset.of.rs.log
26/Aug/15 18:51
877 kB
Michael Stack
timeouts.branch-1.txt
06/Sep/15 20:39
7 kB
Michael Stack

Issue Links

is related to

HDFS-8960 DFS client says "no more good datanodes being available to try" on a single drive failure

Open

relates to

HBASE-13971 Flushes stuck since 6 hours on a regionserver.

Closed

HBASE-16960 RegionServer hang when aborting

Closed

Sub-Tasks

1.	New TestWALLockup broken by addendum added to parent issue	Closed	Michael Stack
2.	Backport parent 'HBASE-14317 Stuck FSHLog' issue to 1.1	Closed	Michael Stack
3.	Get TestAccessController* passing again on branch-1	Closed	Michael Stack
4.	Stamp failed appends with sequenceid too.... Cleans up latches	Closed	Michael Stack

Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL

Details

Description

Attachments

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates