[MAPREDUCE-157] Job History log file format is not friendly for external tools. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.20.1
Fix Version/s: 0.21.0
Component/s: None
Labels:
None

Hadoop Flags:

Incompatible change, Reviewed
Release Note:

Hide
Changes the Job History file format to use JSON.
Simplifies the Job History Parsing logic
Removes duplication of code between HistoryViewer and the JSP files
History Files are now named as JobID_user
Introduces a new cluster level configuration "mapreduce.cluster.jobhistory.maxage" for configuring the amount of time history files are kept before getting cleaned up
The configuration "hadoop.job.history.user.location" is no longer supported.

Show
Changes the Job History file format to use JSON. Simplifies the Job History Parsing logic Removes duplication of code between HistoryViewer and the JSP files History Files are now named as JobID_user Introduces a new cluster level configuration "mapreduce.cluster.jobhistory.maxage" for configuring the amount of time history files are kept before getting cleaned up The configuration "hadoop.job.history.user.location" is no longer supported.

Description

Currently, parsing the job history logs with external tools is very difficult because of the format. The most critical problem is that newlines aren't escaped in the strings. That makes using tools like grep, sed, and awk very tricky.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

mapred-157-prelim.patch
20/Aug/09 11:53
101 kB
Jothi Padmanabhan
MAPREDUCE-157-avro.patch
21/Aug/09 21:52
4 kB
Doug Cutting
mapred-157-4Sep.patch
04/Sep/09 11:13
451 kB
Jothi Padmanabhan
mapred-157-7Sep.patch
07/Sep/09 07:24
454 kB
Jothi Padmanabhan
mapred-157-7Sep-v1.patch
07/Sep/09 15:52
454 kB
Jothi Padmanabhan
mapred-157-10Sep.patch
10/Sep/09 05:53
455 kB
Jothi Padmanabhan
MAPREDUCE-157-avro.patch
14/Sep/09 23:47
10 kB
Doug Cutting
mapred-157-15Sep.patch
15/Sep/09 04:22
468 kB
Jothi Padmanabhan
mapred-157-15Sep-v1.patch
15/Sep/09 11:11
469 kB
Jothi Padmanabhan
mapred-157-16Sep.patch
16/Sep/09 11:14
469 kB
Jothi Padmanabhan
mapred-157-16Sep-v1.patch
16/Sep/09 16:42
470 kB
Jothi Padmanabhan

Issue Links

blocks

MAPREDUCE-864 Enhance JobClient API implementations to look at history files to get information about jobs that are not in memory

Resolved

MAPREDUCE-975 Add an API in job client to get the history file url for a given job id

Closed

MAPREDUCE-198 Log job history events to a common dump file

Resolved

MAPREDUCE-980 Modify JobHistory to use Avro for serialization instead of raw JSON

Closed

MAPREDUCE-277 Job history counters should be avaible on the UI.

Closed

incorporates

MAPREDUCE-881 Jobtracker continues even if History initialization fails

Resolved

MAPREDUCE-926 History viewer on web UI should filter by job-id also

Resolved

relates to

MAPREDUCE-2251 Remove mapreduce.job.userhistorylocation config

Closed

(2 incorporates, 1 relates to)

Activity

People

Assignee:: Jothi Padmanabhan

Reporter:: Owen O'Malley

Votes:: 1 Vote for this issue

Watchers:: 22 Start watching this issue

Dates

Created:: 14/May/09 16:38

Updated:: 12/Jan/11 04:45

Resolved:: 17/Sep/09 05:10