Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-560

Support for more flexible file handling in text to sequence file conversion

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 0.5
    • Fix Version/s: 0.5
    • Component/s: Integration
    • Labels:
      None

      Description

      Currently SequenceFilesFromDirectory supports for conversion of texts to sequence file. The exact file (and potentially text from file) selection is not configurable. I'd like to re-use most of the conversion logic but change the exact text selection. (More information on what exactly I want to do: http://tinyurl.com/35pv8jg )

      I slightly changed SequenceFilesFromDirectory to make that possible. (Added one additional optional parameter, but by default the current behaviour is used).

      1. MAHOUT-560.patch
        4 kB
        Isabel Drost-Fromm

        Activity

        Hide
        isabel Isabel Drost-Fromm added a comment -

        Changes I made - any comments welcome (especially if there is an easier, more obvious way I have over-looked)

        Show
        isabel Isabel Drost-Fromm added a comment - Changes I made - any comments welcome (especially if there is an easier, more obvious way I have over-looked)
        Hide
        hudson Hudson added a comment -

        Integrated in Mahout-Quality #508 (See https://hudson.apache.org/hudson/job/Mahout-Quality/508/)
        MAHOUT-560 - allow for more flexible file handling when converting text
        files to sequence files.

        Show
        hudson Hudson added a comment - Integrated in Mahout-Quality #508 (See https://hudson.apache.org/hudson/job/Mahout-Quality/508/ ) MAHOUT-560 - allow for more flexible file handling when converting text files to sequence files.

          People

          • Assignee:
            isabel Isabel Drost-Fromm
            Reporter:
            isabel Isabel Drost-Fromm
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development