Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
1.4.1
-
None
Description
Locality Level is ANY on "Details for Stage" WebUI page
When a sc.textFile(XX) program is running, Locality Level is ANY which should be NODE_LOCAL on stage 0.
org.apache.spark.scheduler.TaskSetManager
// Check for node-local tasks
if (TaskLocality.isAllowed(locality, TaskLocality.NODE_LOCAL)) {
for (index <- speculatableTasks if canRunOnHost(index)) {
val locations = tasks(index).preferredLocations.map(_.host)
if (locations.contains(host))Unknown macro: { speculatableTasks -= index return Some((index, TaskLocality.NODE_LOCAL)) }}
}
The variable "locations" is hostname of HDFS split, which is from InetAddress.getHostName.
The variable "host" is ip of Executor, which is from InetAddress.getLocalHost.getHostAddress.
org.apache.spark.deploy.worker.WorkerArguments
var host = Utils.localHostName()
Attachments
Issue Links
- is duplicated by
-
SPARK-10006 Locality broken in spark 1.4.x for NewHadoopRDD
- Resolved
- is related to
-
SPARK-5113 Audit and document use of hostnames and IP addresses in Spark
- Resolved
- links to