GitHub user andrewpalumbo opened a pull request:
MAHOUT-1815: dsqDist(X,Y) and dsqDist(X) failing in flink tests.
After taking the Very long way around trying to repartition, etc., it turns out that the row just needed to be properly re-keyed.
Tests pass now.
Though we may want to re-examine the implementation of FlinkOpAtB, as it seems pretty inefficient.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/andrewpalumbo/mahout
Alternatively you can review and apply these changes as the patch at:
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #197
Author: Andrew Palumbo <email@example.com>
properly re-key rows in FlinkOpAtB