Description
In SPARK-6487, we assume that all items are ordered. However, we should support non-temporal sequences in PrefixSpan. This should be done before 1.5 because it changes PrefixSpan APIs.
We can use `Array[Array[Int]]` or follow SPMF to use `Array[Int]` and use -1 to mark itemset boundaries. The latter is more efficient for storage. If we support generic item type, we can use null.
Attachments
Issue Links
- blocks
-
SPARK-9000 Support generic item type in PrefixSpan
- Resolved
- depends upon
-
SPARK-6487 Add sequential pattern mining algorithm PrefixSpan to Spark MLlib
- Resolved
- is blocked by
-
SPARK-8997 Improve LocalPrefixSpan performance
- Resolved
-
SPARK-8998 Collect enough frequent prefixes before projection in PrefixSpan
- Resolved
- relates to
-
SPARK-10028 Add Python API for PrefixSpan
- Resolved
- links to