Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22867

Add Isolation Forest algorithm to MLlib

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 2.2.1
    • None
    • MLlib
    • None

    Description

      Isolation Forest (iForest) is an effective model that focuses on anomaly isolation.
      iForest uses tree structure for modeling data, iTree isolates anomalies closer to the root of the tree as compared to normal points.
      A anomaly score is calculated by iForest model to measure the abnormality of the data instances. The lower, the more abnormal.

      More details about iForest can be found in the following papers:
      <a href="https://dl.acm.org/citation.cfm?id=1511387">Isolation Forest</a> [1]
      and <a href="https://dl.acm.org/citation.cfm?id=2133363">Isolation-Based Anomaly Detection</a> [2].

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            Titicaca Fangzhou Yang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment