Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
Description
As a user,
I want to have an easier way of accessing the variable importance output from random forest so that I can understand which are the most important variables.
Current method of getting variable importance for each variable (in a tabular format - assuming output table name is `rf_output`):
```
SELECT unnest(regexp_split_to_array(cat_features, ',')) as variable,
unnest(cat_var_importance) as importance
FROM rf_output_group, rf_output_summary;
```
This is a cumbersome query to write and has to be written twice - for categorical and for continuous features.
Attachments
Issue Links
- links to
- mentioned in
-
Page Loading...