Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
New
Description
Currently DatasetSplitter throws an exception if it's not possible to find SortedDocValues or SortedSetDocValues on the class field as it wouldn't be possible to correctly split the indexes in a balanced way.
As a fallback we could instead use the no. of terms per leaf reader as an approximate count (upper bound) to the no. of classes.