Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1361

Online algorithm for computing accurate Quantiles using 1-D clustering

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9
    • 0.9
    • classic
    • None

    Description

      Implementation of Ted Dunning's paper and initial work on this subject. See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the paper.

      An on-line algorithm for computing approximations of rank-based statistics that allows controllable accuracy. This algorithm can also be used to compute hybrid statistics such as trimmed means in addition to computing arbitrary quantiles.

      Attachments

        1. MAHOUT-1361.patch
          51 kB
          Suneel Marthi

        Issue Links

          Activity

            People

              smarthi Suneel Marthi
              smarthi Suneel Marthi
              Votes:
              1 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: