My goal is to modify MapReduce Random Forest example by combining BuildForest.java and TestForest.java into a new class called RandomForest.java
The main point is to input one data file which is going to be used in training and testing; with k-fold cross-validation.
I have a big data with hight diminutional features and small amount of instances.
Seems to be a frustrating dead-end. is this process achievable? Or is it against MapReduce nature?