Using features and train stream sources generate a model with TP, TN, FP, FN fields. For some reason, the summation of the values of these fields is sometimes less than the training set size.
How to regenerate:
1. Create two collections: cellphones and cellphones-model
2. Indexing the attached dataset into cellphones
3. Run the following expression:
features(cellphones, q=":", featureSet="featureSet",
4. Run the following query to retrieve confusion matrix:
The summation of the metrics TP, TN, FP, FN is always less than the training set size by one in this instance for all iterations.