[HADOOP-1324] FSError encountered by one running task should not be fatal to other tasks on that node - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.12.3
Fix Version/s: 0.13.0
Component/s: None
Labels:
None

Description

Currently, if one task encounters a FSError, it reports that to the TaskTracker and the TaskTracker reinitializes itself and effectively loses state of all the other running tasks too. This can probably be improved especially after the fix for ~~HADOOP-1252~~. The TaskTracker should probably avoid reinitializing itself and instead get blacklisted for that job. Other tasks should be allowed to continue as long as they can (complete successfully, or, fail either due to disk problems or otherwise).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-1324_20070507_1.patch
07/May/07 09:16
4 kB
Arun Murthy

Activity

People

Assignee:: Arun Murthy

Reporter:: Devaraj Das

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 03/May/07 17:22

Updated:: 08/Jul/09 16:52

Resolved:: 07/May/07 21:33