Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22867

Add Isolation Forest algorithm to MLlib

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: 2.2.1
    • Fix Version/s: None
    • Component/s: MLlib
    • Labels:
      None

      Description

      Isolation Forest (iForest) is an effective model that focuses on anomaly isolation.
      iForest uses tree structure for modeling data, iTree isolates anomalies closer to the root of the tree as compared to normal points.
      A anomaly score is calculated by iForest model to measure the abnormality of the data instances. The lower, the more abnormal.

      More details about iForest can be found in the following papers:
      <a href="https://dl.acm.org/citation.cfm?id=1511387">Isolation Forest</a> [1]
      and <a href="https://dl.acm.org/citation.cfm?id=2133363">Isolation-Based Anomaly Detection</a> [2].

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              Titicaca Fangzhou Yang
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: