Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Invalid
-
1.3.0
-
None
-
None
Description
Spark MLlib fpm#FPGrowth minSupport should use long instead
==============
val minCount = math.ceil(minSupport * count).toLong
because:
1. [count]numbers of datasets is not kown before read.
2. [minSupport ]double precision.
from mahout#FPGrowthDriver.java
addOption("minSupport", "s", "(Optional) The minimum number of times a co-occurrence must be present."
+ " Default Value: 3", "3");
I just want to set minCount=2 for test.
Thanks.
Attachments
Issue Links
- is duplicated by
-
SPARK-10920 another constructor for FPGrowth algorithm to support the absolute value for support
- Resolved
- links to