Uploaded image for project: 'DataFu'
  1. DataFu
  2. DATAFU-16

weighted reservoir sampling with exponential jumps UDF

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Won't Do
    • None
    • None
    • None
    • Mac, Linux

      pig-0.11

    Description

      Create a weightedReservoirSampleWithExpJump UDF to implement the weighted reservoir sampling algorithm with exponential jumps. Investigation is tracked in https://github.com/linkedin/datafu/issues/80. This task is part of experiment of different weighted sampling algorithms.

      Attachments

        1. ScoredExpJmpReservoir.java
          5 kB
          jian wang
        2. ScoredReservoir.java
          3 kB
          jian wang
        3. WeightedSamplingCorrectnessTests.java
          19 kB
          jian wang

        Activity

          People

            king821221 jian wang
            king821221 jian wang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: