Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-212

want InputFormat for task logs

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None

    Description

      We should provide an InputFormat implementation that includes all the task logs from a job. Folks should be able to do something like:

      job = new JobConf();
      job.setInputFormatClass(TaskLogInputFormat.class);
      TaskLogInputFormat.setJobId(jobId);
      ...

      Tasks should ideally be localized to the node that each log is on.

      Examining logs should be as lightweight as possible, to facilitate debugging. It should not require a copy to HDFS. A faster debug loop is like a faster search engine: it makes people more productive. The sooner one can find that, e.g., most tasks failed with a NullPointerException on line 723, the better.

      Attachments

        1. hadoop1199.patch
          13 kB
          Michael Stack
        2. hadoop1199-v2.patch
          14 kB
          Michael Stack

        Activity

          People

            Unassigned Unassigned
            cutting Doug Cutting
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: