Uploaded image for project: 'SystemDS'
  1. SystemDS
  2. SYSTEMDS-3149

Decision Tree cleanups and Builtin

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • SystemDS 3.2
    • None
    • None

    Description

      Cleanup of existing decision tree algorithm in SystemDS "scripts/algorithms/decision-tree.dml" and convert it into dml builtin function.

      As a preparation for a holistic cleanup of decision tree and random forest scripts (in scripts/builtin and scripts/algorithms), we should first introduce primitives for information gain, entropy, and gini (SYSTEMDS-3184), and devise vectorized prediction scripts.

      In a first step, this task should introduce a new builtin/decisionTreePredict that implements the different strategies of the Hummingbird paper [1]. Initial tests can hard-code the vectorized decision tree representation and focus on testing the prediction procedure. The builtin function signature might expose a 'method' attribute to select the different strategies.

      [1] Supun Nakandala, Karla Saur, Gyeong-In Yu, Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi:
      A Tensor Compiler for Unified Machine Learning Prediction Serving. OSDI 2020: 899-917, https://www.usenix.org/system/files/osdi20-nakandala.pdf

      Attachments

        Issue Links

          Activity

            People

              mboehm7 Matthias Boehm
              ssiddiqi Shafaq Siddiqi
              Votes:
              1 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: