Details
-
Improvement
-
Status: Open
-
P3
-
Resolution: Unresolved
-
None
-
None
-
None
Description
stats.ApproximateUnique has an optional mmh3 dependency [1] (mmh3 is roughly 9xs faster than md5), but if that repository is problematic for users, we should look into alternatives.
Other options: sklearn.utils.murmurhash3_32
[1]https://github.com/hajimes/mmh3, https://pypi.org/project/mmh3/2.0/
cc: tvalentyn