Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-277

UDF for computing correlation and covariance between data sets

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.1.0
    • Component/s: None
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      UDFs for computing correlation and covariance between data sets. Use following commands to compute covariance
      A = load 'input.xml' using PigStorage(':');
      B = group A all;
      define c COV('a','b','c');
      D = foreach B generate group,c(A.$0,A.$1,A.$2);

        Attachments

        1. newStats.patch
          30 kB
          Ajay Garg
        2. stat.patch
          27 kB
          Ajay Garg

          Activity

            People

            • Assignee:
              ajaygarg Ajay Garg
              Reporter:
              ajaygarg Ajay Garg
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: