[HIVE-7372] Select query gives unpredictable incorrect result when parallelism is greater than 1 [Spark Branch] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: spark-branch
Fix Version/s: None
Component/s: Spark
Labels:
None

Description

In SparkClient.java, if the following property is set, unpredictable, incorrect result may be observed.

    sparkConf.set("spark.default.parallelism", "1");

It's suspected that there are some concurrency issues, as Spark may process multiple datasets in a single JVM when parallelism is greater than 1 in order to use multiple cores.

NO PRECOMMIT TESTS. This is for spark branch only.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-7372.patch
10/Jul/14 07:46
1 kB
Chengxiang Li
HIVE-7372-Spark.1.patch
11/Jul/14 03:00
1 kB
Chengxiang Li

Issue Links

is part of

HIVE-7292 Hive on Spark

Resolved

Activity

People

Assignee:: Chengxiang Li

Reporter:: Xuefu Zhang

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 09/Jul/14 15:11

Updated:: 11/Jul/14 05:39

Resolved:: 11/Jul/14 05:38