Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3581

[Rumen] Rumen anonymizer should handle composite string data

    XMLWordPrintableJSON

Details

    • rumen anonymization chunking

    Description

      Rumen's Anonymizer currently considers string as a single entity. At times, strings can be composed of smaller sub-strings which can be anonymized individually. Anonymizing sub-strings separately will result in retaining certain statistics like frequency ('daily', 'weekly' etc). This was brought up by Chris while developing the Anonymizer.

      Attachments

        Issue Links

          Activity

            People

              amar_kamat Amar Kamat
              amar_kamat Amar Kamat
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated: