Affects Version/s: None
Fix Version/s: None
I have a script which takes in a command line parameter.
The script contains the following parameters:
Realistic use cases of SAMPLE require statisticians to calculate SAMPLE data on demand.
Ideally I would like to calculate SAMPLE from within Pig script without having to run one Pig script first get it's results and another to pass the results.
Ideal use case:
Change this Jira to only track sampling algorithm.
PIG-1926 is opened to track limit/sample taking scalar.
This is a candidate project for Google summer of code 2012. More information about the program can be found at https://cwiki.apache.org/confluence/display/PIG/GSoc2012