Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
1.6.0
-
None
Description
We assume the input itemsets are multi-item in PrefixSpan, e.g., (ab)(cd). In some use cases, all itemsets are single-item, e.g., abcd. In this case, our implementation has overhead remembering the boundaries between itemsets. We could detect it and put specialized implementation for this use case.
Attachments
Issue Links
- is duplicated by
-
SPARK-20179 Major improvements to Spark's Prefix span
- Resolved