Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-695

Have LDADriver determine numWords from input vectors

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.5
    • 0.6
    • classic
    • None

    Description

      It bugged me that you needed to specify the number of words directly to the LDADriver
      eg ./bin/mahout lda \
      -i ./examples/bin/work/reuters-out-seqdir-sparse/tf-vectors \
      -o ./examples/bin/work/reuters-lda -k 20 -v 50000 -ow -x 20

      with this patch the ldadriver just checks a vector from the input to determine the size
      eg ./bin/mahout lda \
      -i ./examples/bin/work/reuters-out-seqdir-sparse/tf-vectors \
      -o ./examples/bin/work/reuters-lda -k 20 -ow -x 20

      Attachments

        1. mahout-695.patch
          4 kB
          Jake Mannix
        2. mahout-695.patch
          3 kB
          Mat Kelcey

        Activity

          People

            jake.mannix Jake Mannix
            mat_kelcey Mat Kelcey
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: