|
|
|
PIG-5206
|
PIG-4856
Support outer join for SkewedJoin in spark mode
|
Xianda Ke
|
Xianda Ke
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
PIG-5196
|
PIG-4856
Enable persist/cache mechanism in Pig
|
Xianda Ke
|
Xianda Ke
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
PIG-5128
|
PIG-4856
Fix TestPigRunner.simpleMultiQueryTest3 unit test failure
|
Nándor Kollár
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5068
|
PIG-4856
Set SPARK_REDUCERS by pig.properties not by system configuration
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5054
|
PIG-4856
Initialize SchemaTupleBackend correctly in backend in spark mode if spark job has more than 1 stage
|
Ádám Szita
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5052
|
PIG-4856
Initialize MRConfiguration.JOB_ID in spark mode correctly
|
Ádám Szita
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5051
|
PIG-4856
Initialize PigContants.TASK_INDEX in spark mode correctly
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5047
|
PIG-4856
support outer join for skewedjoin in spark mode
|
Xianda Ke
|
Xianda Ke
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5044
|
PIG-4856
Create SparkCompiler#getSamplingJob in spark mode
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-5029
|
PIG-4856
Optimize sort case when data is skewed
|
liyunzhang
|
liyunzhang
|
|
Patch Available |
Unresolved
|
|
|
|
|
|
|
|
PIG-5024
|
PIG-4856
add a physical operator to broadcast small RDDs
|
Xianda Ke
|
Xianda Ke
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4970
|
PIG-4856
Remove the deserialize and serialization of JobConf in code for spark mode
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4969
|
PIG-4856
Optimize combine case for spark mode
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4952
|
PIG-4856
Calculate the value of parallism for spark mode
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4890
|
PIG-4856
Run pigmix on spark on yarn with multiple nodes
|
Unassigned
|
liyunzhang
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
PIG-4871
|
PIG-4856
Not use OperatorPlan#forceConnect in MultiQueryOptimizationSpark
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4863
|
PIG-4856
Re-design Spark plan to optimize the RDD pipeline
|
Unassigned
|
liyunzhang
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
PIG-4858
|
PIG-4856
Implement Skewed join for spark engine
|
Xianda Ke
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4846
|
PIG-4856
Use pigmix to test the performance of pig on spark
|
liyunzhang
|
liyunzhang
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
PIG-4839
|
PIG-4856
MultiQueryOptimizerSpark doesn't remove all redudant nodes in spark plan
|
liyunzhang
|
liyunzhang
|
|
Open |
Unresolved
|
|
|
|
|
|
|
|
PIG-4810
|
PIG-4856
Implement Merge join for spark engine
|
Xianda Ke
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4797
|
PIG-4856
Optimization for join/group case for spark mode
|
liyunzhang
|
Pallavi Rao
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4771
|
PIG-4856
Implement FR Join for spark engine
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
PIG-4553
|
PIG-4856
Implement secondary sort using one shuffle
|
liyunzhang
|
liyunzhang
|
|
Closed |
Fixed
|
|
|
|
|