Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-108

Implementation of Assoication Rules learning by Apriori algorithm

    Details

    • Type: Task
    • Status: Closed
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: None
    • Fix Version/s: 0.2
    • Component/s: None
    • Labels:
      None
    • Environment:

      Linux, Hadoop-0.17.1

      Description

      Target: Association Rules learning is a popular method for discovering interesting relations between variables in large databases. Here, we would implement the Apriori algorithm using Hadoop&Mapreduce parallel techniques.

      Applications: Typically, association rules learning is used to discover regularities between products in large scale transaction data in supermarkets. For example, the rule "

      {onions, patatoes}

      ->beef" found in the sales data would indicate that if a customer buys onions and potatoes together, he or she is likely to also buy beef. Such information can be used as the basis for decisions about marketing activities. In addition to the market basket analysis, association rules are employed today in many application areas including Web usage mining, intrusion detection and bioinformatics.

      Apriori algorithm: Apriori is the best-known algorithm to mine association rules. It uses a breadth-first search strategy to counting the support of itemsets and uses a candidate generation function which exploits the downward closure property of support

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              cmri_bcpdm chao deng
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 504h
                504h
                Remaining:
                Remaining Estimate - 504h
                504h
                Logged:
                Time Spent - Not Specified
                Not Specified