[SPARK-8998] Collect enough frequent prefixes before projection in PrefixSpan - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.5.0
Fix Version/s: 1.5.0
Component/s: MLlib
Labels:
None

Target Version/s:

1.5.0

Description

The implementation in ~~SPARK-6487~~ might have scalability issues when the number of frequent items is very small. In this case, we can generate candidate sets of higher orders using Apriori-like algorithms and count them, until we collect enough prefixes.

Attachments

Issue Links

blocks

SPARK-8999 Support non-temporal sequence in PrefixSpan

Resolved

depends upon

SPARK-6487 Add sequential pattern mining algorithm PrefixSpan to Spark MLlib

Resolved

links to

[Github] Pull Request #7383 (zhangjiajin)

[Github] Pull Request #7412 (zhangjiajin)

[Github] Pull Request #7783 (feynmanliang)

Activity

People

Assignee:: Zhang JiaJin

Reporter:: Xiangrui Meng

Shepherd:: Xiangrui Meng

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 11/Jul/15 04:17

Updated:: 30/Jul/15 15:14

Resolved:: 30/Jul/15 15:14

Time Tracking

Estimated:

48h

Remaining:

48h

Logged:

Not Specified