Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-285

Wrap up collocation and dictionary vectorizer integration

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.3
    • 0.3
    • None
    • None

    Description

      Final bit of work to integrate collocations into 0.3

      • Modify collocation finder to use dictionary vectorizer output as input (saves analysis step)
      • Generate input dictionary for dictionary vectorizer that includes unigrams and collocations.

      Chatted with Robin this morning, I know what needs to be done it is just a matter of grinding out the code.

      Attachments

        1. MAHOUT-285.patch
          35 kB
          Drew Farris
        2. MAHOUT-285.patch
          39 kB
          Drew Farris
        3. MAHOUT-285.patch
          14 kB
          Drew Farris

        Activity

          People

            robinanil Robin Anil
            drew.farris Drew Farris
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified