Description
In addition to the current $HOME/bin/prepare_iris.sh script, we can create more data preparation scripts. More concretely, at least datasets used by tutorials in our user guide need to be supported:
- a9a
- news20
- kdd2010 a/b
- webspam
- E2006-tfidf
Unfortunately, we cannot automate to use datasets hosted by Kaggle because they require us to log-in to Kaggle.
Attachments
Issue Links
- relates to
-
HIVEMALL-106 Distribute Docker image in each ASF official release
- Open
-
HIVEMALL-84 Add docker support
- Closed