[PIG-4190] Implement replicated join in Spark engine - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: spark-branch
Component/s: spark
Labels:
None

Description

Related e2e tests: Union_7, Union_8, Union_13

Sample script:
a = load '/user/pig/tests/data/singlefile/studenttab10k' as (name, age, gpa);
b = load '/user/pig/tests/data/singlefile/studentcolon10k' using PigStorage(':') as (name, age, gpa);
c = union a, b;
d = load '/user/pig/tests/data/singlefile/votertab10k' as (name, age, registration, contributions);
e = join c by name, d by name using 'replicated';
store e into '/user/pig/out/praveenr-1411380943-nightly.conf/Union_7.out';

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PIG-4190.1.patch
13/May/15 19:16
19 kB
Mohit Sabharwal
PIG-4190.2.patch
14/May/15 05:01
19 kB
Mohit Sabharwal
PIG-4190.patch
04/May/15 04:09
18 kB
Mohit Sabharwal

Issue Links

is duplicated by

PIG-4278 Enable unit test "TestFRJoin" for spark

Resolved

links to

review board

Activity

People

Assignee:: Mohit Sabharwal

Reporter:: Praveen Rachabattuni

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 22/Sep/14 10:30

Updated:: 21/Jun/17 09:18

Resolved:: 15/May/15 04:25