Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-560

Support for more flexible file handling in text to sequence file conversion

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 0.5
    • Fix Version/s: 0.5
    • Component/s: Integration
    • Labels:
      None

      Description

      Currently SequenceFilesFromDirectory supports for conversion of texts to sequence file. The exact file (and potentially text from file) selection is not configurable. I'd like to re-use most of the conversion logic but change the exact text selection. (More information on what exactly I want to do: http://tinyurl.com/35pv8jg )

      I slightly changed SequenceFilesFromDirectory to make that possible. (Added one additional optional parameter, but by default the current behaviour is used).

        Attachments

        1. MAHOUT-560.patch
          4 kB
          Isabel Drost-Fromm

          Activity

            People

            • Assignee:
              isabel Isabel Drost-Fromm
              Reporter:
              isabel Isabel Drost-Fromm
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: