[HADOOP-1707] Remove the DFS Client disk-based cache - ASF JIRA

Dhruba Borthakur added a comment - 09/Oct/07 04:23

I plan on removing the staging file altogether. The client will stream data to the datanodes directly, possibly in chunks of 64K memory buffers. Detail design to follow.

Dhruba Borthakur added a comment - 09/Oct/07 04:23 I plan on removing the staging file altogether. The client will stream data to the datanodes directly, possibly in chunks of 64K memory buffers. Detail design to follow.

Doug Cutting added a comment - 10/Oct/07 20:09

> The client will stream data to the datanodes directly [ ... ]

Some history to be aware of. Long ago writes were tee'd to datanodes directly, and the local file was only used to replay things. Switching it so that writes were always buffered to a local file had two advantages: it radically simplified the code (the tee multiplied the number of failure modes) and it improved performance & reliability. Each datanode had far fewer active connections, since blocks were written in a burst rather than as a trickle.

How will you handle datanode failures? Since you have no local file to replay, won't those always cause an exception in the client? That will cause tasks to fail, which might be acceptable, now that things are overall more reliable, but, at the time I looked at this (again, long ago) datanode timeouts were frequent enough that this would cause job failure.

Doug Cutting added a comment - 10/Oct/07 20:09 > The client will stream data to the datanodes directly [ ... ] Some history to be aware of. Long ago writes were tee'd to datanodes directly, and the local file was only used to replay things. Switching it so that writes were always buffered to a local file had two advantages: it radically simplified the code (the tee multiplied the number of failure modes) and it improved performance & reliability. Each datanode had far fewer active connections, since blocks were written in a burst rather than as a trickle. How will you handle datanode failures? Since you have no local file to replay, won't those always cause an exception in the client? That will cause tasks to fail, which might be acceptable, now that things are overall more reliable, but, at the time I looked at this (again, long ago) datanode timeouts were frequent enough that this would cause job failure.

Dhruba Borthakur added a comment - 10/Oct/07 21:07

Thanks Doug for your comments.

1. My thinking is as follows: the client has a bunch of small buffers. Say 2 buffers each of size 16K. When the first buffer is full, it writes that buffer to the first datanode in the pipeline. The client meanwhile can continue to fill up the remaining buffer(s). The first datanode, on receipt of this buffer, sends it to the next datanode in the pipeline and also writes it to its local disk.

2. If a datanode fails to write a buffer to its disk, it is reported back to the client. The client removes this datanode from the pipeline and continues to write to the remaining two datanodes. The file in the bad datanode remains in the "tmp" directory.

3. When the file is closed, the under-replicated blocks will be replicated by the namenode.

Dhruba Borthakur added a comment - 10/Oct/07 21:07 Thanks Doug for your comments. 1. My thinking is as follows: the client has a bunch of small buffers. Say 2 buffers each of size 16K. When the first buffer is full, it writes that buffer to the first datanode in the pipeline. The client meanwhile can continue to fill up the remaining buffer(s). The first datanode, on receipt of this buffer, sends it to the next datanode in the pipeline and also writes it to its local disk. 2. If a datanode fails to write a buffer to its disk, it is reported back to the client. The client removes this datanode from the pipeline and continues to write to the remaining two datanodes. The file in the bad datanode remains in the "tmp" directory. 3. When the file is closed, the under-replicated blocks will be replicated by the namenode.

Doug Cutting added a comment - 10/Oct/07 21:52

If a datanode fails to write a buffer to its disk, it is reported back to the client. The client removes this datanode from the pipeline and continues to write to the remaining two datanodes. [ ... ] When the file is closed, the under-replicated blocks will be replicated by the namenode.

I think the more typical failure mode will be a timeout. I'm also still not sure of the answer to my question: if the first datanode in the pipeline times out, does the write fail, throwing an exception to the client? Or does the client route around the first datanode in the pipeline and continue until all datanodes in the pipeline time out? If so, how can it be sure that the other datanodes have received their copies of prior chunks from the first datanode in the pipeline?

Also, ~~HADOOP-1927~~ states that we should fail as soon as any element in the pipeline fails. Do you agree? Currently this would be invisible to clients, since the entire block can be replayed to a new pipeline. But, without a local file, this would force us to fail the write when any element of the pipeline fails. Thoughts?

Doug Cutting added a comment - 10/Oct/07 21:52 If a datanode fails to write a buffer to its disk, it is reported back to the client. The client removes this datanode from the pipeline and continues to write to the remaining two datanodes. [ ... ] When the file is closed, the under-replicated blocks will be replicated by the namenode. I think the more typical failure mode will be a timeout. I'm also still not sure of the answer to my question: if the first datanode in the pipeline times out, does the write fail, throwing an exception to the client? Or does the client route around the first datanode in the pipeline and continue until all datanodes in the pipeline time out? If so, how can it be sure that the other datanodes have received their copies of prior chunks from the first datanode in the pipeline? Also, HADOOP-1927 states that we should fail as soon as any element in the pipeline fails. Do you agree? Currently this would be invisible to clients, since the entire block can be replayed to a new pipeline. But, without a local file, this would force us to fail the write when any element of the pipeline fails. Thoughts?

Dhruba Borthakur added a comment - 10/Oct/07 22:52

If the primary datanode fails the client can still replay the last-flushed-data buffer to the remaining datanodes. The client has to specify the offset in the block where this buffer contents has to be written. The datanode, given this offset-in-block, can determine whether to do the write or whether the write was already done. The pre-requisite is that a client holds on to a buffer until the write is complete on all known good datanodes.

Another option would be to say that the application gets an error if the Primary datanode fails. Do you think that this is acceptable?

I think ~~HADOOP-1927~~ says that if a non-primary datanode dies, the client should detect it and possibly take appropriate action. Currently the client has no way of knowing whether any secondary datanodes have died.

Dhruba Borthakur added a comment - 10/Oct/07 22:52 If the primary datanode fails the client can still replay the last-flushed-data buffer to the remaining datanodes. The client has to specify the offset in the block where this buffer contents has to be written. The datanode, given this offset-in-block, can determine whether to do the write or whether the write was already done. The pre-requisite is that a client holds on to a buffer until the write is complete on all known good datanodes. Another option would be to say that the application gets an error if the Primary datanode fails. Do you think that this is acceptable? I think HADOOP-1927 says that if a non-primary datanode dies, the client should detect it and possibly take appropriate action. Currently the client has no way of knowing whether any secondary datanodes have died.

Raghu Angadi added a comment - 10/Oct/07 23:20 - edited

The jira description only talks about parallel write to datanodes. It does not require removal of the temp file on client.

How about just storing the block at the client like we do now and replay the data if the there is an error? It still allows parallel write to the client. This also does not need any changes/improvements to datanode protocol. Yes, removing the temp file would be better, but it is not worse than current implementation.

Raghu Angadi added a comment - 10/Oct/07 23:20 - edited The jira description only talks about parallel write to datanodes. It does not require removal of the temp file on client. How about just storing the block at the client like we do now and replay the data if the there is an error? It still allows parallel write to the client. This also does not need any changes/improvements to datanode protocol. Yes, removing the temp file would be better, but it is not worse than current implementation.

Dhruba Borthakur added a comment - 10/Oct/07 23:26

The local disk cache implementation is similar to creating four replicas of the block and then deleting the excess replica when the block is done. This reduces overall cluster throughput and I would like to analyze ways of getting rid of it.

I agree that I should change the name-description of this JIRA. Will do.

Dhruba Borthakur added a comment - 10/Oct/07 23:26 The local disk cache implementation is similar to creating four replicas of the block and then deleting the excess replica when the block is done. This reduces overall cluster throughput and I would like to analyze ways of getting rid of it. I agree that I should change the name-description of this JIRA. Will do.

Doug Cutting added a comment - 11/Oct/07 17:08

> Another option would be to say that the application gets an error if the Primary datanode fails. Do you think that this is acceptable?

Perhaps, if it only happens rarely. If, e.g., sorts generally complete on 900 nodes with no such failures, then this is probably acceptable. If the primary datanode is localhost, and if secondary failures are survivable, then this may work well enough.

Otherwise, how do we recover when a datanode in the pipeline becomes unreachable? Will we use per-buffer acks? The primary datanode won't ack a buffer until all datanodes in the pipeline have it? Then if one datanode fails, we could route around it, initialize its copy of the block from one of the survivors, and continue. The acking will effectively add flow-control, which could be a feature, or could slow things. Datanodes may receive the same buffer twice, so buffers will need revision numbers or somesuch.

Doug Cutting added a comment - 11/Oct/07 17:08 > Another option would be to say that the application gets an error if the Primary datanode fails. Do you think that this is acceptable? Perhaps, if it only happens rarely. If, e.g., sorts generally complete on 900 nodes with no such failures, then this is probably acceptable. If the primary datanode is localhost, and if secondary failures are survivable, then this may work well enough. Otherwise, how do we recover when a datanode in the pipeline becomes unreachable? Will we use per-buffer acks? The primary datanode won't ack a buffer until all datanodes in the pipeline have it? Then if one datanode fails, we could route around it, initialize its copy of the block from one of the survivors, and continue. The acking will effectively add flow-control, which could be a feature, or could slow things. Datanodes may receive the same buffer twice, so buffers will need revision numbers or somesuch.

Dhruba Borthakur added a comment - 12/Oct/07 22:21

I have the following proposal in mind:

1. The Client uses a small pool of memory buffers per dfs-output stream. Say, 10 buffers of size 64K each.
2. A write to the output stream actually copies the user data into one of the buffers, if available. Otherwise the user-write blocks.
3. A separate thread (one per output stream), sends buffers that are full. Each buffer has metadata that contains a sequence number (locally generated on the client) , the length of the buffer and its offset in this block.
4. Another thread(one per output stream) process incoming responses. The incoming response has the sequence number of the buffer that the datanode had processed. The client removes that buffer from its queue.
5. The client gets an exception if the primary datanode fails. If a secondary datanode fails, the primary informs the client about this event.
6. In any datanodes fail, the client removes it from the pipeline and resends all pending buffers to all known good datanodes.
7. A target datanode remembers the last sequencenumber that it has previously processed. It forwards the buffer to the next datanode in the pipeline. If the datanode receives a buffer that it has not processed earlier, it writes it to local disk. When the response arrives, it forwards the response back to the client.

Dhruba Borthakur added a comment - 12/Oct/07 22:21 I have the following proposal in mind: 1. The Client uses a small pool of memory buffers per dfs-output stream. Say, 10 buffers of size 64K each. 2. A write to the output stream actually copies the user data into one of the buffers, if available. Otherwise the user-write blocks. 3. A separate thread (one per output stream), sends buffers that are full. Each buffer has metadata that contains a sequence number (locally generated on the client) , the length of the buffer and its offset in this block. 4. Another thread(one per output stream) process incoming responses. The incoming response has the sequence number of the buffer that the datanode had processed. The client removes that buffer from its queue. 5. The client gets an exception if the primary datanode fails. If a secondary datanode fails, the primary informs the client about this event. 6. In any datanodes fail, the client removes it from the pipeline and resends all pending buffers to all known good datanodes. 7. A target datanode remembers the last sequencenumber that it has previously processed. It forwards the buffer to the next datanode in the pipeline. If the datanode receives a buffer that it has not processed earlier, it writes it to local disk. When the response arrives, it forwards the response back to the client.

Doug Cutting added a comment - 15/Oct/07 19:47

> The client gets an exception if the primary datanode fails.

Why can't it simply replace the primary with one of the secondary datanodes and proceed?

> If a secondary datanode fails, the primary informs the client about this event.

Since a secondary will typically fail by timing out, the timeout used between the client and the primary must be longer than that used between the primary and secondary, so that the client waits long enough to hear about a failed secondary. And the timeout used between the application and the client must be longer yet. Right? Perhaps we should make all these timeouts proportional to a single configuration parameter, the application timeout?

If we wish to ensure that blocks are sufficiently replicated, then we'll block on file close, right?

Overall, this sounds like an approach worth trying.

Doug Cutting added a comment - 15/Oct/07 19:47 > The client gets an exception if the primary datanode fails. Why can't it simply replace the primary with one of the secondary datanodes and proceed? > If a secondary datanode fails, the primary informs the client about this event. Since a secondary will typically fail by timing out, the timeout used between the client and the primary must be longer than that used between the primary and secondary, so that the client waits long enough to hear about a failed secondary. And the timeout used between the application and the client must be longer yet. Right? Perhaps we should make all these timeouts proportional to a single configuration parameter, the application timeout? If we wish to ensure that blocks are sufficiently replicated, then we'll block on file close, right? Overall, this sounds like an approach worth trying.

Dhruba Borthakur added a comment - 16/Oct/07 05:49

This design depends on the fact that a the client can detect the datanode that encountered an error in the pipeline. This patch will fix the issue described in ~~HADOOP-1927~~.

Dhruba Borthakur added a comment - 16/Oct/07 05:49 This design depends on the fact that a the client can detect the datanode that encountered an error in the pipeline. This patch will fix the issue described in HADOOP-1927 .

Dhruba Borthakur added a comment - 30/Oct/07 04:21

I agree that that timeout issue does not have a very elegant solution. Here is a new proposal.

The Client
--------------
1. The Client uses a small pool of memory buffers per dfs-output stream. Say, 10 buffers of size 64K each.
2. A write to the output stream actually copies the user data into one of the buffers, if available. Otherwise the user-write blocks.
3. A separate thread (one per output stream), sends buffers that are full. Each buffer has metadata that contains a sequence number (locally generated on the client) , the length of the buffer and its offset in this block.
4. Another thread(one per output stream) process incoming responses. The incoming response has the sequence number of the buffer that the datanode had processed. The client removes that buffer from its queue.

The Primary Datanode
------------------------------
The primary datanode has two threads per stream. The first thread processes incoming packets from the client, writes them to the downstream datanode and writes them to local disk. The second thread processes responses from downstream datanodes and forwards them back to the client.

This means that the client gets back an ack only when the packet is persisted on all datanodes. In the future this can be changed so that the client gets an ack when the data is persisted in dfs.replication.min number of datanodes.

In case the primary datanode encounters an exception while writing to the downstream datanode, it declares the block as bad. It removes the immediate downstream datanode from the pipeline. It makes an RPC to the namenode to abandon the current blockId and*replace* the block id with a new one. It then establishes a new pipeline using the new blockid using the remaining datanodes. It then copies all the data from the local temporary block file to the downstream datanodes using the new blockId.

The Secondary Datanodes
------------------------------------
The Secondary datanode has two threads per stream. The first thread processes incoming packets from the upstream datanode, writes them to the downstream datanode and writes them to local disk. The second thread processes responses from downstream datanodes and forwards them back to the upstream datanode.

Each secondary datanode sends its response as well forwards the response of all downstream datanodes.

Dhruba Borthakur added a comment - 30/Oct/07 04:21 I agree that that timeout issue does not have a very elegant solution. Here is a new proposal. The Client -------------- 1. The Client uses a small pool of memory buffers per dfs-output stream. Say, 10 buffers of size 64K each. 2. A write to the output stream actually copies the user data into one of the buffers, if available. Otherwise the user-write blocks. 3. A separate thread (one per output stream), sends buffers that are full. Each buffer has metadata that contains a sequence number (locally generated on the client) , the length of the buffer and its offset in this block. 4. Another thread(one per output stream) process incoming responses. The incoming response has the sequence number of the buffer that the datanode had processed. The client removes that buffer from its queue. The Primary Datanode ------------------------------ The primary datanode has two threads per stream. The first thread processes incoming packets from the client, writes them to the downstream datanode and writes them to local disk. The second thread processes responses from downstream datanodes and forwards them back to the client. This means that the client gets back an ack only when the packet is persisted on all datanodes. In the future this can be changed so that the client gets an ack when the data is persisted in dfs.replication.min number of datanodes. In case the primary datanode encounters an exception while writing to the downstream datanode, it declares the block as bad. It removes the immediate downstream datanode from the pipeline. It makes an RPC to the namenode to abandon the current blockId and*replace* the block id with a new one. It then establishes a new pipeline using the new blockid using the remaining datanodes. It then copies all the data from the local temporary block file to the downstream datanodes using the new blockId. The Secondary Datanodes ------------------------------------ The Secondary datanode has two threads per stream. The first thread processes incoming packets from the upstream datanode, writes them to the downstream datanode and writes them to local disk. The second thread processes responses from downstream datanodes and forwards them back to the upstream datanode. Each secondary datanode sends its response as well forwards the response of all downstream datanodes.

Doug Cutting added a comment - 30/Oct/07 17:39

This still appears to have the cascading timeout issue, no? Each stage in the pipeline must have a smaller timeout than the prior stage or else the whole pipeline will fail when any node fails. In particular, the client must use a much larger timeout, since it must permit the primary to potentially replay the entire block downstream. Perhaps there can be multiple kinds of acks, some which just indicate that the primary is still alive and others that indicate that replication is complete? (Acks might include the current level of replication.) That might help distinguish the cases where the primary has actually gone down from those where it is still doing productive work. Then one timeout could be used for communications, and a substantially longer one for awaiting replication.

I also wonder whether, instead of having so many threads, we might implement this with async i/o. Much of the processing seems simple enough that maintaining a state object for each file being written and using a single thread that selects on sockets and then updates the state might be more efficient. Perhaps it will be simpler to write these with threads, then convert them to async?

We discussed offline last week a different approach from what you've described here. In that, acks would only signal that the immediately downstream node had written the data, not all downstream nodes. Only at block end or flush would it check that sufficient replicas exist, with a different command. Why have you abandoned this plan?

An intermediate approach might be to use buffer pools on each datanode in the pipeline. Each would write the buffer locally and queue it to be written downstream. The buffer would only be returned to the pool when both writes complete. A datanode could block when no buffers are available. That might improve throughput.

Doug Cutting added a comment - 30/Oct/07 17:39 This still appears to have the cascading timeout issue, no? Each stage in the pipeline must have a smaller timeout than the prior stage or else the whole pipeline will fail when any node fails. In particular, the client must use a much larger timeout, since it must permit the primary to potentially replay the entire block downstream. Perhaps there can be multiple kinds of acks, some which just indicate that the primary is still alive and others that indicate that replication is complete? (Acks might include the current level of replication.) That might help distinguish the cases where the primary has actually gone down from those where it is still doing productive work. Then one timeout could be used for communications, and a substantially longer one for awaiting replication. I also wonder whether, instead of having so many threads, we might implement this with async i/o. Much of the processing seems simple enough that maintaining a state object for each file being written and using a single thread that selects on sockets and then updates the state might be more efficient. Perhaps it will be simpler to write these with threads, then convert them to async? We discussed offline last week a different approach from what you've described here. In that, acks would only signal that the immediately downstream node had written the data, not all downstream nodes. Only at block end or flush would it check that sufficient replicas exist, with a different command. Why have you abandoned this plan? An intermediate approach might be to use buffer pools on each datanode in the pipeline. Each would write the buffer locally and queue it to be written downstream. The buffer would only be returned to the pool when both writes complete. A datanode could block when no buffers are available. That might improve throughput.

Dhruba Borthakur added a comment - 31/Oct/07 20:47

This is a very very preliminary patch that packetize writes from clients. It does not do any error recovery at all.

We discussed a proposal where datanodes do local recovery. If a datanode fails, the datanode immediately preceeding it will recreate the pipeline by ignoring the one that failed and connecting directly to the datanode that followed the one that failed. This approach has the disadvantage that in the case of multiple failures, two upstream datanodes might be in recovery and both of then might try to resend the block to a downstream datanode simultaneously. This might be a difficult case to handle.

Also, the earlier proposal generated an exception to the client if the primary datanode fails. This might be a commonly occuring case. If we would want to avoid this problem, then the client has to do Recovery (over and above any datanodes doing local recovery). In this case, maybe it is better to drive the entire recovery from a single point : the client.

The cascading timeouts issue has to be handled somehow. Your proposal of setting different timeouts for datanodes in the pipeline will work but it might be a little tricky to implement and debug. Another approach would be for each datanode to expose a new "ping" RPC. The Client, when it has to recover, "pings" each Datanode and determines which of them are not responding. This seems to work, isn't it?

Dhruba Borthakur added a comment - 31/Oct/07 20:47 This is a very very preliminary patch that packetize writes from clients. It does not do any error recovery at all. We discussed a proposal where datanodes do local recovery. If a datanode fails, the datanode immediately preceeding it will recreate the pipeline by ignoring the one that failed and connecting directly to the datanode that followed the one that failed. This approach has the disadvantage that in the case of multiple failures, two upstream datanodes might be in recovery and both of then might try to resend the block to a downstream datanode simultaneously. This might be a difficult case to handle. Also, the earlier proposal generated an exception to the client if the primary datanode fails. This might be a commonly occuring case. If we would want to avoid this problem, then the client has to do Recovery (over and above any datanodes doing local recovery). In this case, maybe it is better to drive the entire recovery from a single point : the client. The cascading timeouts issue has to be handled somehow. Your proposal of setting different timeouts for datanodes in the pipeline will work but it might be a little tricky to implement and debug. Another approach would be for each datanode to expose a new "ping" RPC. The Client, when it has to recover, "pings" each Datanode and determines which of them are not responding. This seems to work, isn't it?

Dhruba Borthakur added a comment - 06/Nov/07 00:12

In the current trunk, the first datanode in the pipeline sets a timeout of 2 minutes. The second datanode sets a timeout of 1 minute, and so on. If a datanode does not receive a response from a downstream datanode within this timeout period, it declared the downsteam data as dead.

In this patch for removing the client-side-disk buffer, the connection between datanodes in the pipeline could remain open for extended periods of time, especially for clients that are producing output slowly. I propose that we change the timeouts to behave as follows:

1. Each datanode in the pipeline has the same timeout of 1 minute. If a datanode does not receive a response from a downstream datanode in 1 minute, it declares the downstream datanode as dead.
2. Each datanode sends a heartbeat message to the upstream datanode once every half-timeout-period.

Dhruba Borthakur added a comment - 06/Nov/07 00:12 In the current trunk, the first datanode in the pipeline sets a timeout of 2 minutes. The second datanode sets a timeout of 1 minute, and so on. If a datanode does not receive a response from a downstream datanode within this timeout period, it declared the downsteam data as dead. In this patch for removing the client-side-disk buffer, the connection between datanodes in the pipeline could remain open for extended periods of time, especially for clients that are producing output slowly. I propose that we change the timeouts to behave as follows: 1. Each datanode in the pipeline has the same timeout of 1 minute. If a datanode does not receive a response from a downstream datanode in 1 minute, it declares the downstream datanode as dead. 2. Each datanode sends a heartbeat message to the upstream datanode once every half-timeout-period.

Raghu Angadi added a comment - 06/Nov/07 00:22

> 2. Each datanode sends a heartbeat message to the upstream datanode once every half-timeout-period.
It might be better to call this 'KeepAlive' since it is per connection and avoids confusion with DataNode heartbeat.

Raghu Angadi added a comment - 06/Nov/07 00:22 > 2. Each datanode sends a heartbeat message to the upstream datanode once every half-timeout-period. It might be better to call this 'KeepAlive' since it is per connection and avoids confusion with DataNode heartbeat.

Dhruba Borthakur added a comment - 13/Nov/07 09:08

This patch removes the client side disk buffer.

1. FSConstants.java : Bumped up DATA_TRANSFER_VERSION.
2. Daemon.java: Added a ThreadGroup to the Daemon class. All worker threads that process data transfers belong to this group. The shutdown of a datnode waits for the entire threadgroup to exit. Prior to this change, a datanode shutdown did not wait for the data transfer threads to exit.
3. FSNamesystem.java: Allows a zero size file to have no blocks associated with it.
4. DataChecksum.java: A utility method to return the size of a checksum header.
5. FSDataset.java: The ongoingCreates data structure remembers the thread that is currently writing to a block. The writeToBlock() method (when the recovery flag is set) terminates any existing threads that might have been writing to a block before allowing a new thread to write to the same block.
6. FSDataOutputStream.java: The unit test needed to extract the pipeline associated with a block. This is facilitated by exposing a new public API called getWrappedStream() that returns the underlying DFSOutputStream object.
7. MiniDFSCluster.java: Allows stopping a particular datanode.
8. DFSClient.java/DataNode.java: User data is transferred in the form of packets. Each Packet requires an ack from all datanodes. The DFSClient drives the entire recovery strategy. A keepalive is sent every READ_TIMEOUT/2 period on the response socket channel. Each packet is 64K in size and the client has a sliding window of 5MB per stream.
9. TestDatanodeDeath.java: A unit test to trigger datanode deaths and DFSClient recovery.

Dhruba Borthakur added a comment - 13/Nov/07 09:08 This patch removes the client side disk buffer. 1. FSConstants.java : Bumped up DATA_TRANSFER_VERSION. 2. Daemon.java: Added a ThreadGroup to the Daemon class. All worker threads that process data transfers belong to this group. The shutdown of a datnode waits for the entire threadgroup to exit. Prior to this change, a datanode shutdown did not wait for the data transfer threads to exit. 3. FSNamesystem.java: Allows a zero size file to have no blocks associated with it. 4. DataChecksum.java: A utility method to return the size of a checksum header. 5. FSDataset.java: The ongoingCreates data structure remembers the thread that is currently writing to a block. The writeToBlock() method (when the recovery flag is set) terminates any existing threads that might have been writing to a block before allowing a new thread to write to the same block. 6. FSDataOutputStream.java: The unit test needed to extract the pipeline associated with a block. This is facilitated by exposing a new public API called getWrappedStream() that returns the underlying DFSOutputStream object. 7. MiniDFSCluster.java: Allows stopping a particular datanode. 8. DFSClient.java/DataNode.java: User data is transferred in the form of packets. Each Packet requires an ack from all datanodes. The DFSClient drives the entire recovery strategy. A keepalive is sent every READ_TIMEOUT/2 period on the response socket channel. Each packet is 64K in size and the client has a sliding window of 5MB per stream. 9. TestDatanodeDeath.java: A unit test to trigger datanode deaths and DFSClient recovery.

Dhruba Borthakur added a comment - 16/Nov/07 00:09

A document that describes the streaming protocol used to transfer data among clients and datanodes.

Dhruba Borthakur added a comment - 16/Nov/07 00:09 A document that describes the streaming protocol used to transfer data among clients and datanodes.

Dhruba Borthakur added a comment - 16/Nov/07 00:13

The Data transfer Protocll document in html format.

Dhruba Borthakur added a comment - 16/Nov/07 00:13 The Data transfer Protocll document in html format.

Doug Cutting added a comment - 19/Nov/07 19:03

This protocol document is great to have! Can we get it converted into forrest-compatible XML and included in a reference section of the documentation when this patch is committed?

Doug Cutting added a comment - 19/Nov/07 19:03 This protocol document is great to have! Can we get it converted into forrest-compatible XML and included in a reference section of the documentation when this patch is committed?

Dhruba Borthakur added a comment - 21/Nov/07 10:11

Merged patch with latest trunk.

Dhruba Borthakur added a comment - 21/Nov/07 10:11 Merged patch with latest trunk.

Dhruba Borthakur added a comment - 27/Nov/07 19:32

Fixed two bugs that were exposed while running random writer on a 100 node cluster.

1. The code was such that it was waiting for the ResponseThread to exit while holding the lock on dataQueue. This caused a deadlock.

2. The DFSClient was sending the packet to the first datanode before it inserted the packet into the ackQueue. Now, if the response from the datanode arrives before the DFSClient could enqueue the packet into the ackQueue it triggered an error. This situation is now avoided because the DFSClient first inserts the packet into the ackQueue before sending the packet to the datanode.

Dhruba Borthakur added a comment - 27/Nov/07 19:32 Fixed two bugs that were exposed while running random writer on a 100 node cluster. 1. The code was such that it was waiting for the ResponseThread to exit while holding the lock on dataQueue. This caused a deadlock. 2. The DFSClient was sending the packet to the first datanode before it inserted the packet into the ackQueue. Now, if the response from the datanode arrives before the DFSClient could enqueue the packet into the ackQueue it triggered an error. This situation is now avoided because the DFSClient first inserts the packet into the ackQueue before sending the packet to the datanode.

Konstantin Shvachko added a comment - 29/Nov/07 22:05

Since you have just encountered that.
The same problem will potentially be in the following 3 methods

nextBlockOutputStream()
locateFollowingBlock()
DFSOutputStream.close()

where the client sleeps under a lock. In general a thread should wait() instead of sleep() under a lock.

Konstantin Shvachko added a comment - 29/Nov/07 22:05 Since you have just encountered that. The same problem will potentially be in the following 3 methods nextBlockOutputStream() locateFollowingBlock() DFSOutputStream.close() where the client sleeps under a lock. In general a thread should wait() instead of sleep() under a lock.

Dhruba Borthakur added a comment - 03/Dec/07 23:08

Make patch compile with JDK 1.5

Dhruba Borthakur added a comment - 03/Dec/07 23:08 Make patch compile with JDK 1.5

Dhruba Borthakur added a comment - 05/Dec/07 07:41

Fixed a bug that was causing the client to hang if all datanodes in the pipeline reported an error. This situation was triggered while testing this patch on a 500 node cluster.

Dhruba Borthakur added a comment - 05/Dec/07 07:41 Fixed a bug that was causing the client to hang if all datanodes in the pipeline reported an error. This situation was triggered while testing this patch on a 500 node cluster.

Dhruba Borthakur added a comment - 05/Dec/07 23:52

Merged with latest trunk.

Dhruba Borthakur added a comment - 05/Dec/07 23:52 Merged with latest trunk.

Konstantin Shvachko added a comment - 06/Dec/07 03:38

I think this patch has been tested quite thoroughly, and I don't see any algorithmic flaws in it.
The logic is fairly complicated though, so imo

we need better documentation either in JavaDoc or at least in Jira.
it would be good if you could extract common actions for the client and the data-node into
separate classes, not inner ones.

=========== DFSClient.java

DFSClient: 4 unused variables, members.
DFSOutputStream.lb should be local variable.
processDatanodeError() and DFSOutputStream.close() have common code.

BlockReader.readChunk()

07/12/04 18:36:22 INFO fs.FSInputChecker: DFSClient readChunk got seqno 14 offsetInBlock 7168

Should be DEBUG.

More comments: What is e.g. dataQueue, ackQueue, bytesCurBlock?

Some new members in DFSOutputStream can be calculated from the other.
No need to store them all. See e.g.

    private int packetSize = 0;
    private int chunksPerPacket = 0;
    private int chunksPerBlock = 0;
    private int chunkSize = 0;

In the line below "8" should be defined as a constant. Otherwise, the meaning of that is not clear.
```
      chunkSize = bytesPerChecksum + 8; // user data + checksum
```
currentPacket should be a local variable of writeChunk()

The 4 in the code snippet below looks misterious:

      if (len + cklen + 4 > chunkSize) {

why start ResponseProcessor in processDatanodeError()
some methods should be moved into new inner classes, like
nextBlockOutputStream() should be a part of DataStreamer
Packet should be factored out to a separate class (named probably DataPacket).
It should have serialization/deserialization methods for packet header, which should
be reused in DFSClient and DataNodes for consistency in data transfer.
It also should have methods readPacker() and writePacket()

=========== DataNode.java

import org.apache.hadoop.io.Text; is redundant.
My Eclipse shows 5 variables that are "never read".
Rather than using "4" on several occasions a constant should be defined
```
SIZE_OF_INTEGER = Integer.SIZE / Byte.SIZE;
```
and used whenever required.
lastDataNodeRun() should not be public

=========== FSDataset.java

writeToBlock(): These are two searches in a map instead of one.

      if (ongoingCreates.containsKey(b)) {
        ActiveFile activeFile = ongoingCreates.get(b);

unfinalizeBlock() I kinda find the name funny.

=========== General

Convert comments like // .......... to JavaDoc /** ... */ style comments
when used as method or class headers even if they are private.
Formatting. Tabs should be replaced by 2 spaces. Eg: ResponseProcessor.run(), DataStreamer.run()
Formatting. Long lines.

Konstantin Shvachko added a comment - 06/Dec/07 03:38 I think this patch has been tested quite thoroughly, and I don't see any algorithmic flaws in it. The logic is fairly complicated though, so imo we need better documentation either in JavaDoc or at least in Jira. it would be good if you could extract common actions for the client and the data-node into separate classes, not inner ones. =========== DFSClient.java DFSClient: 4 unused variables, members. DFSOutputStream.lb should be local variable. processDatanodeError() and DFSOutputStream.close() have common code. BlockReader.readChunk() 07/12/04 18:36:22 INFO fs.FSInputChecker: DFSClient readChunk got seqno 14 offsetInBlock 7168 Should be DEBUG. More comments: What is e.g. dataQueue, ackQueue, bytesCurBlock? Some new members in DFSOutputStream can be calculated from the other. No need to store them all. See e.g. private int packetSize = 0; private int chunksPerPacket = 0; private int chunksPerBlock = 0; private int chunkSize = 0; In the line below "8" should be defined as a constant. Otherwise, the meaning of that is not clear. chunkSize = bytesPerChecksum + 8; // user data + checksum currentPacket should be a local variable of writeChunk() The 4 in the code snippet below looks misterious: if (len + cklen + 4 > chunkSize) { why start ResponseProcessor in processDatanodeError() some methods should be moved into new inner classes, like nextBlockOutputStream() should be a part of DataStreamer Packet should be factored out to a separate class (named probably DataPacket). It should have serialization/deserialization methods for packet header, which should be reused in DFSClient and DataNodes for consistency in data transfer. It also should have methods readPacker() and writePacket() =========== DataNode.java import org.apache.hadoop.io.Text; is redundant. My Eclipse shows 5 variables that are "never read". Rather than using "4" on several occasions a constant should be defined SIZE_OF_INTEGER = Integer .SIZE / Byte .SIZE; and used whenever required. lastDataNodeRun() should not be public =========== FSDataset.java writeToBlock(): These are two searches in a map instead of one. if (ongoingCreates.containsKey(b)) { ActiveFile activeFile = ongoingCreates.get(b); unfinalizeBlock() I kinda find the name funny. =========== General Convert comments like // .......... to JavaDoc /** ... */ style comments when used as method or class headers even if they are private. Formatting. Tabs should be replaced by 2 spaces. Eg: ResponseProcessor.run(), DataStreamer.run() Formatting. Long lines.

Dhruba Borthakur added a comment - 07/Dec/07 08:32

Setting the TCP buffer size to 64K (instead of the default of 8K) and setting tcpNoDelay() on the response socket improves performance by about 5%.

Dhruba Borthakur added a comment - 07/Dec/07 08:32 Setting the TCP buffer size to 64K (instead of the default of 8K) and setting tcpNoDelay() on the response socket improves performance by about 5%.

Dhruba Borthakur added a comment - 13/Dec/07 01:09

Merged with latest trunk. Also fixed a bug where an InterruptedException was being consumed silently, thus leading to long delays in timing-outs.

Dhruba Borthakur added a comment - 13/Dec/07 01:09 Merged with latest trunk. Also fixed a bug where an InterruptedException was being consumed silently, thus leading to long delays in timing-outs.

Mukund Madhugiri added a comment - 14/Dec/07 23:31

I ran sort benchmark on 500 nodes and here is the data:

trunk:

random writer: 0.405 hrs
sort: 1.508 hrs
sort validation: 0.333 hrs

trunk + patch:

random writer: 0.534 hrs
sort: 1.808 hrs
sort validation: 0.408 hrs

During the sort reduce phase, I observed some errors, but the sort eventually succeeded:

java.io.IOException: Could not get block locations. Aborting...
java.io.IOException: All datanodes are bad. Aborting...

Mukund Madhugiri added a comment - 14/Dec/07 23:31 I ran sort benchmark on 500 nodes and here is the data: trunk: random writer: 0.405 hrs sort: 1.508 hrs sort validation: 0.333 hrs trunk + patch: random writer: 0.534 hrs sort: 1.808 hrs sort validation: 0.408 hrs During the sort reduce phase, I observed some errors, but the sort eventually succeeded: java.io.IOException: Could not get block locations. Aborting... java.io.IOException: All datanodes are bad. Aborting...

Dhruba Borthakur added a comment - 14/Dec/07 23:51

Thanks Mukund. The errors are causingt he numbers to go up. i will dig into the logs and code to find the cause of the errors.

Dhruba Borthakur added a comment - 14/Dec/07 23:51 Thanks Mukund. The errors are causingt he numbers to go up. i will dig into the logs and code to find the cause of the errors.

Dhruba Borthakur added a comment - 19/Dec/07 07:57

Found a race condition that was causing the client to close the connection before the datanodes had a chance to process the end-of-packet. This caused the datanode to treat it as an error condition, thus causing the client to do error recovery and re-send the outstanding packets to the remaining good datanodes. This was causing performance regression.

Dhruba Borthakur added a comment - 19/Dec/07 07:57 Found a race condition that was causing the client to close the connection before the datanodes had a chance to process the end-of-packet. This caused the datanode to treat it as an error condition, thus causing the client to do error recovery and re-send the outstanding packets to the remaining good datanodes. This was causing performance regression.

Dhruba Borthakur added a comment - 29/Dec/07 09:09

This patch fixes another performance degradation shown by the earlier patch. There was a race condition whereby an intermediate datanode in the pipeline was ignoring the response sent from the downstream datanode, always forwarding an "error" to the client.

Dhruba Borthakur added a comment - 29/Dec/07 09:09 This patch fixes another performance degradation shown by the earlier patch. There was a race condition whereby an intermediate datanode in the pipeline was ignoring the response sent from the downstream datanode, always forwarding an "error" to the client.

Dhruba Borthakur added a comment - 07/Jan/08 19:33

Merged patch with latest trunk. This patch has additional debugging that might help in getting to the cause of the performance degradation seen in one earlier run.

Dhruba Borthakur added a comment - 07/Jan/08 19:33 Merged patch with latest trunk. This patch has additional debugging that might help in getting to the cause of the performance degradation seen in one earlier run.

Dhruba Borthakur added a comment - 08/Jan/08 17:56

The earlier patch had LOG levels set to debug by default.

Dhruba Borthakur added a comment - 08/Jan/08 17:56 The earlier patch had LOG levels set to debug by default.

Dhruba Borthakur added a comment - 10/Jan/08 23:42

A sort on a 500 node cluster detected a data corruption. The code in datanode had a race whereby a block confirmation of a block did not have the correct size of the block. This caused the namenode to think that the block is shorter in length.

Dhruba Borthakur added a comment - 10/Jan/08 23:42 A sort on a 500 node cluster detected a data corruption. The code in datanode had a race whereby a block confirmation of a block did not have the correct size of the block. This caused the namenode to think that the block is shorter in length.

Mukund Madhugiri added a comment - 11/Jan/08 17:43

Running on a 100 node cluster, with the patch clientDiskBuffer19.patch, the sort benchmark showed these results:

100 nodes	trunk	trunk + patch
randomWriter (hrs)	0.44	0.45
sort (hrs)	1.03	1
sortValidation (hrs)	0.39	0.3

Mukund Madhugiri added a comment - 11/Jan/08 17:43 Running on a 100 node cluster, with the patch clientDiskBuffer19.patch, the sort benchmark showed these results: 100 nodes trunk trunk + patch randomWriter (hrs) 0.44 0.45 sort (hrs) 1.03 1 sortValidation (hrs) 0.39 0.3

Hadoop QA added a comment - 12/Jan/08 16:11

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12372949/clientDiskBuffer19.patch
against trunk revision r611385.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 10 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests -1. The patch failed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/console

This message is automatically generated.

Hadoop QA added a comment - 12/Jan/08 16:11 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12372949/clientDiskBuffer19.patch against trunk revision r611385. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 10 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests -1. The patch failed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1555/console This message is automatically generated.

Dhruba Borthakur added a comment - 13/Jan/08 07:57

Cancel patch to fix findbugs warnings.

Dhruba Borthakur added a comment - 13/Jan/08 07:57 Cancel patch to fix findbugs warnings.

Dhruba Borthakur added a comment - 13/Jan/08 08:00

Fix findbugs warnings. There are two findbugs warnings (about having two locks while invoking wait()) that are valid scenarios.

Dhruba Borthakur added a comment - 13/Jan/08 08:00 Fix findbugs warnings. There are two findbugs warnings (about having two locks while invoking wait()) that are valid scenarios.

Hadoop QA added a comment - 13/Jan/08 10:30

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373048/clientDiskBuffer20.patch
against trunk revision r611537.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 2 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests -1. The patch failed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/console

This message is automatically generated.

Hadoop QA added a comment - 13/Jan/08 10:30 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373048/clientDiskBuffer20.patch against trunk revision r611537. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 2 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests -1. The patch failed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1569/console This message is automatically generated.

Dhruba Borthakur added a comment - 14/Jan/08 08:57

More findbugs issues and test failures.

Dhruba Borthakur added a comment - 14/Jan/08 08:57 More findbugs issues and test failures.

Hadoop QA added a comment - 14/Jan/08 17:54

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373081/clientDiskBuffer21.patch
against trunk revision r611760.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 2 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests -1. The patch failed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/console

This message is automatically generated.

Hadoop QA added a comment - 14/Jan/08 17:54 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373081/clientDiskBuffer21.patch against trunk revision r611760. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 2 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests -1. The patch failed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1583/console This message is automatically generated.

Dhruba Borthakur added a comment - 14/Jan/08 19:13

Debugging switched one for catching problem that occurs repeatedly on hadoopQA patch testing environment.

Dhruba Borthakur added a comment - 14/Jan/08 19:13 Debugging switched one for catching problem that occurs repeatedly on hadoopQA patch testing environment.

Dhruba Borthakur added a comment - 14/Jan/08 19:14

HadoopQA patch testing sees unit test failures that cannot be reproduced on linux machines. Re-submitting patch with additional debugging so that problem with unit test failing can be debugged. This in in lieu of having a direct account on the solaris machine on which Hadoop QA patch testing occurs.

Dhruba Borthakur added a comment - 14/Jan/08 19:14 HadoopQA patch testing sees unit test failures that cannot be reproduced on linux machines. Re-submitting patch with additional debugging so that problem with unit test failing can be debugged. This in in lieu of having a direct account on the solaris machine on which Hadoop QA patch testing occurs.

Dhruba Borthakur added a comment - 15/Jan/08 00:08

The solaris platform exposed a race condition where an InterurptedException was interrupting the PacketHandler, thus causing it to not send an ack message for the last packet in a block.

Dhruba Borthakur added a comment - 15/Jan/08 00:08 The solaris platform exposed a race condition where an InterurptedException was interrupting the PacketHandler, thus causing it to not send an ack message for the last packet in a block.

Dhruba Borthakur added a comment - 15/Jan/08 00:59

Reduced the number of datanodes thread in the unit test from 40 to 15, otherwise the unit tests take a long amount of time to complete.

Dhruba Borthakur added a comment - 15/Jan/08 00:59 Reduced the number of datanodes thread in the unit test from 40 to 15, otherwise the unit tests take a long amount of time to complete.

Dhruba Borthakur added a comment - 15/Jan/08 07:11

For the unit tests to run faster, reduced the socket timeout period from a default of 1 minute to 5 seconds.

Dhruba Borthakur added a comment - 15/Jan/08 07:11 For the unit tests to run faster, reduced the socket timeout period from a default of 1 minute to 5 seconds.

Hadoop QA added a comment - 15/Jan/08 08:08

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373138/clientDiskBuffer23.patch
against trunk revision r612025.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 2 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests -1. The patch failed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/console

This message is automatically generated.

Hadoop QA added a comment - 15/Jan/08 08:08 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373138/clientDiskBuffer23.patch against trunk revision r612025. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 2 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests -1. The patch failed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1590/console This message is automatically generated.

Hadoop QA added a comment - 15/Jan/08 22:29

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373189/clientDiskBuffer24.patch
against trunk revision r612200.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 2 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests +1. The patch passed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/console

This message is automatically generated.

Hadoop QA added a comment - 15/Jan/08 22:29 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373189/clientDiskBuffer24.patch against trunk revision r612200. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 2 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests +1. The patch passed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1601/console This message is automatically generated.

Dhruba Borthakur added a comment - 16/Jan/08 01:11

Fixed failure in TestCrcCorruption.

Dhruba Borthakur added a comment - 16/Jan/08 01:11 Fixed failure in TestCrcCorruption.

Hadoop QA added a comment - 16/Jan/08 12:07

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373225/clientDiskBuffer25.patch
against trunk revision r612314.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 3 new Findbugs warnings.

core tests -1. The patch failed core unit tests.

contrib tests +1. The patch passed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/console

This message is automatically generated.

Hadoop QA added a comment - 16/Jan/08 12:07 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373225/clientDiskBuffer25.patch against trunk revision r612314. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 3 new Findbugs warnings. core tests -1. The patch failed core unit tests. contrib tests +1. The patch passed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1609/console This message is automatically generated.

Dhruba Borthakur added a comment - 16/Jan/08 18:17

The TestSetReplicationIncreasing test case takes a long time to run, usually in the order of 10+ minutes. The reason being that the default timeout of a replication request timeout is 10 minutes. Change the configuration so that the default fro this test is 2 seconds. Makes this test case run a lot faster!

Dhruba Borthakur added a comment - 16/Jan/08 18:17 The TestSetReplicationIncreasing test case takes a long time to run, usually in the order of 10+ minutes. The reason being that the default timeout of a replication request timeout is 10 minutes. Change the configuration so that the default fro this test is 2 seconds. Makes this test case run a lot faster!

Hadoop QA added a comment - 16/Jan/08 23:11

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373299/clientDiskBuffer26.patch
against trunk revision r612614.

@author +1. The patch does not contain any @author tags.

patch -1. The patch command could not apply the patch.

Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1616/console

This message is automatically generated.

Hadoop QA added a comment - 16/Jan/08 23:11 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373299/clientDiskBuffer26.patch against trunk revision r612614. @author +1. The patch does not contain any @author tags. patch -1. The patch command could not apply the patch. Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1616/console This message is automatically generated.

Dhruba Borthakur added a comment - 16/Jan/08 23:22

merged patch with latest trunk.

Dhruba Borthakur added a comment - 16/Jan/08 23:22 merged patch with latest trunk.

Hadoop QA added a comment - 17/Jan/08 16:22

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373344/clientDiskBuffer27.patch
against trunk revision r612674.

@author +1. The patch does not contain any @author tags.

javadoc +1. The javadoc tool did not generate any warning messages.

javac +1. The applied patch does not generate any new compiler warnings.

findbugs -1. The patch appears to introduce 3 new Findbugs warnings.

core tests +1. The patch passed core unit tests.

contrib tests -1. The patch failed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/console

This message is automatically generated.

Hadoop QA added a comment - 17/Jan/08 16:22 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373344/clientDiskBuffer27.patch against trunk revision r612674. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs -1. The patch appears to introduce 3 new Findbugs warnings. core tests +1. The patch passed core unit tests. contrib tests -1. The patch failed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1620/console This message is automatically generated.

Dhruba Borthakur added a comment - 17/Jan/08 18:00

I am ignoring the two findbugs warnings because the code maintains strict locking heirarchy.

Dhruba Borthakur added a comment - 17/Jan/08 18:00 I am ignoring the two findbugs warnings because the code maintains strict locking heirarchy.

Dhruba Borthakur added a comment - 17/Jan/08 18:12

I just committed this.

Dhruba Borthakur added a comment - 17/Jan/08 18:12 I just committed this.

Hadoop QA added a comment - 17/Jan/08 20:03

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12373344/clientDiskBuffer27.patch
against trunk revision r612933.

@author +1. The patch does not contain any @author tags.

patch -1. The patch command could not apply the patch.

Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1624/console

This message is automatically generated.

Hadoop QA added a comment - 17/Jan/08 20:03 -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12373344/clientDiskBuffer27.patch against trunk revision r612933. @author +1. The patch does not contain any @author tags. patch -1. The patch command could not apply the patch. Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/1624/console This message is automatically generated.

Hadoop Common

Remove the DFS Client disk-based cache

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates