Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1361

Online algorithm for computing accurate Quantiles using 1-D clustering

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.9
    • Fix Version/s: 0.9
    • Component/s: Math
    • Labels:
      None

      Description

      Implementation of Ted Dunning's paper and initial work on this subject. See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the paper.

      An on-line algorithm for computing approximations of rank-based statistics that allows controllable accuracy. This algorithm can also be used to compute hybrid statistics such as trimmed means in addition to computing arbitrary quantiles.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                smarthi Suneel Marthi
                Reporter:
                smarthi Suneel Marthi
              • Votes:
                1 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: