Description
Using HDFS for walogs would fix:
ACCUMULO-84: any node can read the replicated filesACCUMULO-558: wouldn't need to monitor loggersACCUMULO-544: log references wouldn't include hostnamesACCUMULO-423: wouldn't need to monitor loggersACCUMULO-258: hdfs has load balancing already
To implement it, we would need the ability to distribute log sorts.
Continuing to use loggers helps us avoid:
- hdfs pipeline strategy
- we don't have fine-grained insight when a single node makes dfs slow
- additional namenode pressure
- flexibility: for example, we can add fadvise() calls to the logger before HDFS supports it