Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4217

Fix documentation in BuildBloom

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.14.0
    • None
    • None
    • Reviewed

    Description

      /**

      • Build a bloom filter for use later in Bloom. This UDF is intended to run
      • in a group all job. For example:
      • define bb BuildBloom('jenkins', '100', '0.1');
      • A = load 'foo' as (x, y);
      • B = group A all;
      • C = foreach B generate BuildBloom(A.x);
      • store C into 'mybloom';
      • The bloom filter can be on multiple keys by passing more than one field
      • (or the entire bag) to BuildBloom.
      • The resulting file can then be used in a Bloom filter as:
      • define bloom Bloom(mybloom);
      • A = load 'foo' as (x, y);
      • B = load 'bar' as (z);
      • C = filter B by Bloom(z);
      • D = join C by z, A by x;
      • It uses {@link org.apache.hadoop.util.bloom.BloomFilter}

        .
        */

      Pig script inside above doc strings doesn't work

      Attachments

        1. PIG-4217-1.patch
          0.9 kB
          Praveen Rachabattuni

        Activity

          People

            praveenr019 Praveen Rachabattuni
            praveenr019 Praveen Rachabattuni
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: