|
Jim Kellerman made changes - 26/Jan/09 07:03 PM
Jim Kellerman made changes - 05/Feb/09 08:02 AM
Simple testing using the test programs that I attached to HADOOP-4379, would seem to indicate that the patch for 4379 works. However we need more testing in the HBase environment to verify that the patch is sufficient.
@Stack
Yes, there are not many changes. Will work on this as soon as I finish up what I am currently working on. each record is approximately 1024 bytes.
one block is either 1,048,576 (1MB) or 67,108,864 (64 MB) A 1MB block holds 1,002 records expected overhead for 64MB is 1,441,792 A 64MB block holds 64,157 records So overhead is ~ 22-23 bytes/record. ======================================== Without the patch the best we can do is read up to the end of the last If we write write 70,000 records into 64MB blocks we can read 64157 If less than a block is written, we get back nothing. We only get up ======================================== With the patch, 1MB block size and no syncs:
There seems to be a problem with writing about 1024 records to a 1MB ======================================== With the patch, it appears that the block size is irrelevant and it is With a 64MB block size:
With a 1MB block size:
More testing to do: try writing close to 64MB with a 64MB block size and see if it experiences the non-recoverability that writing ~1MB with 1MB block size does. To clarify Stack's question, he said:
> ---- The answer to this question is TBD. I have yet to test how it works in a loaded cluster. To this point, I have just verified that in Patch that uses new API's to recover file lease and read from last log file being written by region server.
It does work, but slowly. As noted in HADOOP-4379, it takes almost an hour to recover the file lease when the clusters are loaded. 2009-02-25 21:39:16,843 DEBUG org.apache.hadoop.hbase.regionserver.HLog: Splitting 3 of 3: hdfs:/x.y.com:8100/hbase/log_10.76.44.139_1235597506284_8020/hlog.dat.1235597820662 2009-02-25 21:39:16,847 DEBUG org.apache.hadoop.hbase.regionserver.HLog: Triggering lease recovery. ... 2009-02-25 22:37:12,755 INFO org.apache.hadoop.hbase.regionserver.HLog: log file splitting completed for hdfs://x.y.com:8100/hbase/log_10.76.44.139_1235597506284_8020
Jim Kellerman made changes - 25/Feb/09 11:47 PM
Yes an hour is way too long. I asked in HADOOP-4379 if there is a way to speed it up.
Moving out of 0.19.1 because it is unlikely we will a patch for HADOOP-4379 soon enough.
Jim Kellerman made changes - 06/Mar/09 06:51 PM
stack made changes - 28/Apr/09 04:47 PM
stack made changes - 20/May/09 06:27 PM
stack made changes - 09/Jun/09 04:57 AM
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
We need more testing to confirm.