Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-560

Support for more flexible file handling in text to sequence file conversion

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • 0.5
    • 0.5
    • classic
    • None

    Description

      Currently SequenceFilesFromDirectory supports for conversion of texts to sequence file. The exact file (and potentially text from file) selection is not configurable. I'd like to re-use most of the conversion logic but change the exact text selection. (More information on what exactly I want to do: http://tinyurl.com/35pv8jg )

      I slightly changed SequenceFilesFromDirectory to make that possible. (Added one additional optional parameter, but by default the current behaviour is used).

      Attachments

        1. MAHOUT-560.patch
          4 kB
          Isabel Drost-Fromm

        Activity

          People

            isabel Isabel Drost-Fromm
            isabel Isabel Drost-Fromm
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: