Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-1029

Decision Tree's output summary table does not contain the right list independent variables

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Module: Decision Tree
    • None

    Description

      Decision Tree's output summary table does not contain the right list independent variables.

      Steps to reproduce:

      select madlib.tree_train('abalone_2', – source table
      'adaboost_output_test', – output model table
      'rowid', – id column
      'sex', – response
      'length,diam,height,whole,shucked,viscera,shell,rings,color', – features
      NULL::text, – exclude columns
      'gini', – split criterion
      NULL::text, – no grouping
      NULL::text, – no weights
      5, – max depth
      3, – min split
      1, – min bucket
      10
      );

      gpadmin=# select * from adaboost_output_test_summary;
      [ RECORD 1 ]-------+--------------------
      method | tree_train
      is_classification | t
      source_table | abalone_2
      model_table | adaboost_output_test
      id_col_name | rowid
      dependent_varname | sex
      independent_varnames | color
      cat_features | color
      con_features |
      grouping_cols |
      num_all_groups | 1
      num_failed_groups | 0
      total_rows_processed | 2835
      total_rows_skipped | 0
      dependent_var_levels | "0","1"
      dependent_var_type | integer
      input_cp | 0.01
      independent_var_types | text

      Abalone data can be found here: https://archive.ics.uci.edu/ml/datasets/Abalone

      Attachments

        Activity

          People

            Unassigned Unassigned
            asong April Song
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: