Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-212

Need random sampler for use in reducers

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.2
    • 0.3
    • classic
    • None

    Description

      For a variety of mining algorithms, it helps to have a uniform way to only process a sub-set of the records in a reducer.

      As such, I have written a simple generic sampler that filters an Iterator returning a fair sample of at most a specified size.

      Attachments

        1. MAHOUT-212.patch
          9 kB
          Ted Dunning
        2. MAHOUT-212-b.patch
          18 kB
          Ted Dunning
        3. MAHOUT-212-C.patch
          28 kB
          Sean R. Owen

        Activity

          People

            srowen Sean R. Owen
            tdunning Ted Dunning
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: