Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-709

FP-Growth Redundant patterns

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: 0.4, 0.5
    • Fix Version/s: None
    • Component/s: None

      Description

      The algorithm outputs more patterns that it is needed.

      I have tested Mahout's PFP-Growth algorithm with the http://www.borgelt.net/fpgrowth.html FP-Growth implementation. This implementation has an option to generate closed patterns too.

      When I filtered out the sub patterns from the output of Parallel FP-Growth I arrived to the same result, as in http://www.borgelt.net/fpgrowth.html

      Succinctly, you are not outputting closed items

      I am attaching the dummy DB along with the output of both algorithms

        Attachments

        1. SixTransactions.dat
          0.6 kB
          Yarco Hayduk
        2. patterns-converted.txt
          2 kB
          Yarco Hayduk
        3. dumpedPatterns
          9 kB
          Yarco Hayduk
        4. bresult-new.txt
          0.3 kB
          Yarco Hayduk

        Issue Links

          Activity

            People

            • Assignee:
              robinanil Robin Anil
              Reporter:
              yarco Yarco Hayduk

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment