Uploaded image for project: 'Apache Apex Malhar'
  1. Apache Apex Malhar
  2. APEXMALHAR-1211

Evaluate usage of bloom filter for de-duplication

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • algorithms
    • None

    Description

      We should look at providing a bloom filter for de-dup. That way the size of state would be manageable. This approach would not be 100% guarantee, but would be very high (six sigma?) and may be viable for a lot of use cases

      Attachments

        Activity

          People

            chaithu Chaitanya Chebolu
            akekre Amol Kekre
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: