Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Decision Tree's output summary table does not contain the right list independent variables.
Steps to reproduce:
select madlib.tree_train('abalone_2', – source table
'adaboost_output_test', – output model table
'rowid', – id column
'sex', – response
'length,diam,height,whole,shucked,viscera,shell,rings,color', – features
NULL::text, – exclude columns
'gini', – split criterion
NULL::text, – no grouping
NULL::text, – no weights
5, – max depth
3, – min split
1, – min bucket
10
);
gpadmin=# select * from adaboost_output_test_summary;
[ RECORD 1 ]-------+--------------------
method | tree_train
is_classification | t
source_table | abalone_2
model_table | adaboost_output_test
id_col_name | rowid
dependent_varname | sex
independent_varnames | color
cat_features | color
con_features |
grouping_cols |
num_all_groups | 1
num_failed_groups | 0
total_rows_processed | 2835
total_rows_skipped | 0
dependent_var_levels | "0","1"
dependent_var_type | integer
input_cp | 0.01
independent_var_types | text
Abalone data can be found here: https://archive.ics.uci.edu/ml/datasets/Abalone