[HADOOP-4296] Spasm of JobClient failures on successful jobs every once in a while - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 0.17.1
Fix Version/s: 0.19.0
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed

Description

At very busy times - we get a wave of job client failures all at the same time. the failures come when the job is about to complete. when we look at the job history files - the jobs are actually complete. Here's the stack:

08/09/27 02:18:00 INFO mapred.JobClient: map 100% reduce 98%
08/09/27 02:18:41 INFO mapred.JobClient: map 100% reduce 99%
java.lang.NullPointerException
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:993)
at com.facebook.hive.common.columnSetLoader.main(columnSetLoader.java:535)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:155)

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

4296_jt_delayretire4.patch
20/Oct/08 05:44
3 kB
Dhruba Borthakur
4296_jt_delayretire3.patch
16/Oct/08 19:07
3 kB
Dhruba Borthakur
4296_jt_delayretire2.patch
16/Oct/08 07:14
2 kB
Dhruba Borthakur
4296_jt_delayretire.patch
01/Oct/08 01:34
3 kB
Dhruba Borthakur

Activity

People

Assignee:: Dhruba Borthakur

Reporter:: Joydeep Sen Sarma

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 27/Sep/08 17:13

Updated:: 08/Jul/09 16:53

Resolved:: 21/Oct/08 06:19