Uploaded image for project: 'Hivemall'
  1. Hivemall
  2. HIVEMALL-111

Add more ready-to-use data to the Docker image

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • 0.7.0

    Description

      In addition to the current $HOME/bin/prepare_iris.sh script, we can create more data preparation scripts. More concretely, at least datasets used by tutorials in our user guide need to be supported:

      • a9a
      • news20
      • kdd2010 a/b
      • webspam
      • E2006-tfidf

      Unfortunately, we cannot automate to use datasets hosted by Kaggle because they require us to log-in to Kaggle.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              takuti Takuya Kitazawa
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: