[SPARK-16017] YarnClientSchedulerBackend now registers backends as IPs instead of Hostnames which causes all tasks to run with RACK_LOCAL locality. - ASF JIRA

Details

Type: Bug
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: 1.6.2, 2.0.0
Fix Version/s: 1.6.2, 2.0.0
Component/s: Spark Core
Labels:
None

Target Version/s:

1.6.2, 2.0.0

Description

Since this change: SPARK-15395

When registering new executor backends it registers them as IPs instead of hostnames. This causes a flow on effect that when the Task manager is trying to figure out what Locality tasks should run at, no tasks can be run At the NODE_LOCAL level.

This specific call:
https://github.com/apache/spark/blob/branch-2.0/core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala#L886

pendingTasksForHost are all hostnames pulled from the DFS locations while hasExecutorsAliveOnHost, uses executorsByHost, which are all IP's because they are populated from the RpcAddress.

As expected this causes significant performance problems, A simple count query will take 22 seconds, But if I revert the change from SPARK-15395, tasks will run with NODE_LOCAL locality and the same count will take 3 seconds.

Attachments

Issue Links

is broken by

SPARK-15395 Use getHostString to create RpcAddress

Resolved

links to

[Github] Pull Request #13741 (zsxwing)

Activity

Ascending order - Click to sort in descending order

Marcelo Masiero Vanzin added a comment - 17/Jun/16 17:55

This is a pretty bad performance regression. zsxwing

Very tempted to up it to blocker.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 17:55 This is a pretty bad performance regression. zsxwing Very tempted to up it to blocker.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 17:57

BTW ~~SPARK-15395~~ doesn't really explain a real use case where the old code caused a problem; given that we've had a few releases with that code and I've never seen any issues related to it, at this point I'd suggest just reverting that patch and, if there's a real issue, fixing it properly in the next point release.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 17:57 BTW SPARK-15395 doesn't really explain a real use case where the old code caused a problem; given that we've had a few releases with that code and I've never seen any issues related to it, at this point I'd suggest just reverting that patch and, if there's a real issue, fixing it properly in the next point release.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 18:03

In fact this shouldn't affect just YARN, but anything that uses HDFS or any storage system that reports hostnames when providing info about splits.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 18:03 In fact this shouldn't affect just YARN, but anything that uses HDFS or any storage system that reports hostnames when providing info about splits.

Shixiong Zhu added a comment - 17/Jun/16 18:14

vanzin what do you think if we just send hostname from CoarseGrainedSchedulerBackend and set it in https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala#L167 ?

~~SPARK-15395~~ is to improve the robustness. If there is no way to fix this one, I agree that we just revert ~~SPARK-15395~~.

Shixiong Zhu added a comment - 17/Jun/16 18:14 vanzin what do you think if we just send hostname from CoarseGrainedSchedulerBackend and set it in https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala#L167 ? SPARK-15395 is to improve the robustness. If there is no way to fix this one, I agree that we just revert SPARK-15395 .

Marcelo Masiero Vanzin added a comment - 17/Jun/16 18:26

I assume you mean from CoarseGrainedExecutorBackend? That could work. You could also resolve the executor's IP in CoarseGrainedSchedulerBackend, although that might be slower.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 18:26 I assume you mean from CoarseGrainedExecutorBackend ? That could work. You could also resolve the executor's IP in CoarseGrainedSchedulerBackend , although that might be slower.

Shixiong Zhu added a comment - 17/Jun/16 18:41

Yes. Thanks for correcting. I will submit a PR for 2.0. In the mean time, let's revert my patch for batch 1.6 as it seems too much changes for 1.6.

Shixiong Zhu added a comment - 17/Jun/16 18:41 Yes. Thanks for correcting. I will submit a PR for 2.0. In the mean time, let's revert my patch for batch 1.6 as it seems too much changes for 1.6.

Shixiong Zhu added a comment - 17/Jun/16 19:22

vanzin Just realized one problem: If the user starts the executors using IP addresses, we will still report IPs to TaskScheduler. vanzin do you know how HDFS reports hostname? Will it reports hostname even if the user starts the datanodes using IP addresses? If so, my proposal won't fix it and I think "resolving the executor's IP in CoarseGrainedSchedulerBackend" is better.

Shixiong Zhu added a comment - 17/Jun/16 19:22 vanzin Just realized one problem: If the user starts the executors using IP addresses, we will still report IPs to TaskScheduler. vanzin do you know how HDFS reports hostname? Will it reports hostname even if the user starts the datanodes using IP addresses? If so, my proposal won't fix it and I think "resolving the executor's IP in CoarseGrainedSchedulerBackend" is better.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:26

The default configuration of HDFS seems to disallow the case where the DN connects just with an IP address - the hostname should be resolved. So I think it's pretty sure to assume that you'll always have hostnames.

(That code is in HDFS's DatanodeManager.java, btw.)

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:26 The default configuration of HDFS seems to disallow the case where the DN connects just with an IP address - the hostname should be resolved. So I think it's pretty sure to assume that you'll always have hostnames. (That code is in HDFS's DatanodeManager.java, btw.)

Apache Spark added a comment - 17/Jun/16 20:30

User 'zsxwing' has created a pull request for this issue:
https://github.com/apache/spark/pull/13741

Apache Spark added a comment - 17/Jun/16 20:30 User 'zsxwing' has created a pull request for this issue: https://github.com/apache/spark/pull/13741

Sean R. Owen added a comment - 17/Jun/16 20:37

BTW while we're here... is this related to what https://github.com/apache/spark/pull/8533 was talking about? I know, this is narrowly about ~~SPARK-15395~~ but wondering if it is something that is or was already resolved.

Sean R. Owen added a comment - 17/Jun/16 20:37 BTW while we're here... is this related to what https://github.com/apache/spark/pull/8533 was talking about? I know, this is narrowly about SPARK-15395 but wondering if it is something that is or was already resolved.

Shixiong Zhu added a comment - 17/Jun/16 20:40

Just to confirm that: you agree that just sending the hostname from executors to driver. Right?

Shixiong Zhu added a comment - 17/Jun/16 20:40 Just to confirm that: you agree that just sending the hostname from executors to driver. Right?

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:42

That seems it might create a similar situation, but it was filed against a version of Spark that didn't even have the code that caused this particular regression. So my hunch is same symptom, different root cause, which may or may not have been fixed in newer Spark versions.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:42 That seems it might create a similar situation, but it was filed against a version of Spark that didn't even have the code that caused this particular regression. So my hunch is same symptom, different root cause, which may or may not have been fixed in newer Spark versions.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:43

Yes, given the HDFS restrictions, that should be enough.

Marcelo Masiero Vanzin added a comment - 17/Jun/16 20:43 Yes, given the HDFS restrictions, that should be enough.

Shixiong Zhu added a comment - 17/Jun/16 20:46

tleftwich Could help test https://github.com/apache/spark/pull/13741 in your environment, please?

Shixiong Zhu added a comment - 17/Jun/16 20:46 tleftwich Could help test https://github.com/apache/spark/pull/13741 in your environment, please?

Shixiong Zhu added a comment - 17/Jun/16 20:47

FYI, I reverted ~~SPARK-15395~~ for branch-1.6.

Shixiong Zhu added a comment - 17/Jun/16 20:47 FYI, I reverted SPARK-15395 for branch-1.6.

Trystan Leftwich added a comment - 17/Jun/16 21:33

zsxwing I'll run the tests here shortly and get back to you.

Trystan Leftwich added a comment - 17/Jun/16 21:33 zsxwing I'll run the tests here shortly and get back to you.

Trystan Leftwich added a comment - 17/Jun/16 22:45

zsxwing I've tested your fix locally and it all is working as expected. Thanks.

Trystan Leftwich added a comment - 17/Jun/16 22:45 zsxwing I've tested your fix locally and it all is working as expected. Thanks.

Shixiong Zhu added a comment - 17/Jun/16 22:47

tleftwich Thanks! I'm merging it into 2.0 now!

Shixiong Zhu added a comment - 17/Jun/16 22:47 tleftwich Thanks! I'm merging it into 2.0 now!

People

Assignee:: Shixiong Zhu

Reporter:: Trystan Leftwich

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 17/Jun/16 14:26

Updated:: 17/Jun/16 22:49

Resolved:: 17/Jun/16 22:49