Testing 0.95.1 RC1, I found that just inserting data and splitting seems to trigger a bug in the archivedHFileCleaner that makes it complain about missing files. It seems to happen mostly to reference files post-split.
I'm attaching a grep I did with the file name (you'll see the same file name happens to be in 2 regions, it's because it's a reference file).
|Assignee||Matteo Bertozzi [ mbertozzi ]|
|Summary||archivedHFileCleaner seems to be racing and complains about missing files||HFileLinkCleaner (FSUtils.listStatus) logs too much if links do not exists|
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Hadoop Flags||Reviewed [ 10343 ]|
|Fix Version/s||0.98.0 [ 12323143 ]|
|Resolution||Fixed [ 1 ]|
|Status||Resolved [ 5 ]||Closed [ 6 ]|
|Transition||Time In Source Status||Execution Times||Last Executer||Last Execution Date|
|1h 21m||1||Jean-Daniel Cryans||08/Jun/13 00:51|
|107d 19h 31m||1||stack||23/Sep/13 20:22|