Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-285

Wrap up collocation and dictionary vectorizer integration

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.3
    • Fix Version/s: 0.3
    • Component/s: None
    • Labels:
      None

      Description

      Final bit of work to integrate collocations into 0.3

      • Modify collocation finder to use dictionary vectorizer output as input (saves analysis step)
      • Generate input dictionary for dictionary vectorizer that includes unigrams and collocations.

      Chatted with Robin this morning, I know what needs to be done it is just a matter of grinding out the code.

        Attachments

        1. MAHOUT-285.patch
          14 kB
          Drew Farris
        2. MAHOUT-285.patch
          39 kB
          Drew Farris
        3. MAHOUT-285.patch
          35 kB
          Drew Farris

          Activity

            People

            • Assignee:
              robinanil Robin Anil
              Reporter:
              drew.farris Drew Farris
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified