Mahout
  1. Mahout
  2. MAHOUT-444

Need on-line distribution summary statistics ... mean, median, min, max q25, q75

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.4
    • Component/s: None
    • Labels:
      None

      Description

      For the on-line learning algorithms it is very helpful to have robust on-line estimate of various summary statistics.

        Activity

        Transition Time In Source Status Execution Times Last Executer Last Execution Date
        Open Open Patch Available Patch Available
        1m 24s 1 Ted Dunning 22/Jul/10 01:57
        Patch Available Patch Available Resolved Resolved
        64d 11h 2m 1 Sean Owen 24/Sep/10 13:00
        Resolved Resolved Closed Closed
        37d 3h 49m 1 Sean Owen 31/Oct/10 15:49
        Sean Owen made changes -
        Status Resolved [ 5 ] Closed [ 6 ]
        Sean Owen made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Assignee Ted Dunning [ tdunning ]
        Fix Version/s 0.4 [ 12314396 ]
        Resolution Fixed [ 1 ]
        Hide
        Hudson added a comment -

        Integrated in Mahout-Quality #151 (See http://hudson.zones.apache.org/hudson/job/Mahout-Quality/151/)
        MAHOUT-444 - fixed one test and disabled the other

        Show
        Hudson added a comment - Integrated in Mahout-Quality #151 (See http://hudson.zones.apache.org/hudson/job/Mahout-Quality/151/ ) MAHOUT-444 - fixed one test and disabled the other
        Ted Dunning made changes -
        Attachment MAHOUT-444.patch [ 12450117 ]
        Ted Dunning made changes -
        Field Original Value New Value
        Status Open [ 1 ] Patch Available [ 10002 ]
        Hide
        Ted Dunning added a comment -

        Here is a patch with an object that estimates the five quartiles, mean and standard deviation. The associated tests indicate that it is pretty much as accurate as if all of the samples were kept and the empirical rank statistics were computed directly.

        Show
        Ted Dunning added a comment - Here is a patch with an object that estimates the five quartiles, mean and standard deviation. The associated tests indicate that it is pretty much as accurate as if all of the samples were kept and the empirical rank statistics were computed directly.
        Ted Dunning created issue -

          People

          • Assignee:
            Ted Dunning
            Reporter:
            Ted Dunning
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development