Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1763

JobHistory should enable history collection after a timeout or some other event

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Critical
    • Resolution: Duplicate
    • 0.20.1
    • None
    • jobtracker
    • None
    • history jobtracker

    Description

      If you search for disableHistory in JobHistory.java, one can discover that it is enabled only at the initialization time. There are two instances where job history can be disabled:

      • if it fails to initialize the the output directories
      • If it fails to create a single job history file

      There are a few problems with that. One is that there is no way to revert the flag even if the original problem goes away. Second, these cases should probably be handled separately. The result of which is that once the job history file creation fails, the job history mechanism becomes disabled and there is no way to switch it back.

      One simple solution is to have a timeout after which we can try to enable the job history collection. Another is to have a more granular job history control per job.

      Alex K

      Attachments

        Activity

          People

            Unassigned Unassigned
            alexvk Alex Kozlov
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: