Issue Details (XML | Word | Printable)

Key: HADOOP-4179
Type: New Feature New Feature
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Suhas Gogate
Reporter: Suhas Gogate
Votes: 0
Watchers: 11
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

Hadoop-Vaidya : Rule based performance diagnostic tool for Map/Reduce jobs

Created: 15/Sep/08 11:56 PM   Updated: 08/Jul/09 04:40 PM
Component/s: None
Affects Version/s: 0.18.1
Fix Version/s: 0.20.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works hadoop_vaidya_0200-1.patch 2008-11-25 12:29 PM Suhas Gogate 109 kB
Environment: ALL
Issue Links:
Reference
 

Hadoop Flags: Reviewed
Release Note: Introduced Vaidya rule based performance diagnostic tool for Map/Reduce jobs.
Resolution Date: 08/Dec/08 10:46 PM


 Description  « Hide
Hadoop-Vaidya is a rule based performance diagnostic tool for Map/Reduce jobs. It performs a post execution analysis of map/reduce job by parsing and collecting execution statistics through job history log and job configuration files. It runs a set of predefined tests/rules against job execution statistics to diagnose various performance problems. Each test rule detects a specific performance problem with job and provides a targeted advice to the user. This tool generates an XML report based on the evaluation results of individual test rules.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Suhas Gogate added a comment - 15/Sep/08 11:59 PM
patch added

Suhas Gogate added a comment - 16/Sep/08 12:04 AM
This JIRA takes care of post execution diagnosis of map/reduce job. The linked jira (HADOOP-3956) would extend it further for runtime progress analysis of Map/Reduce jobs.

Lohit Vijayarenu added a comment - 16/Sep/08 12:46 AM
This would be very useful to have. Can jobdetails.jsp or jobtracker.jsp (may be via StatusHttpServlet) serve this analysis after the job is done? That will help users to check how job performed directly from webUI.
PS : Definition of Vaidya

Milind Bhandarkar added a comment - 18/Sep/08 07:35 PM
Lohit,

This is the first version of Hadoop Vaidya, which works only in the postmortem analysis mode. Later, as needed, it can be extended to do analysis of a running job too.

P.S. Can you review the patch ? Thanks.


Mac Yang added a comment - 01/Oct/08 12:52 AM
Looks good over all. But the @author tags should probably be removed.

Devaraj Das added a comment - 10/Oct/08 01:02 PM
Comments on the coding style - Indentation needs to be fixed to conform to hadoop's coding conventions. Also the variable names are better read without the underscore prefix.

Enis Soztutar added a comment - 17/Oct/08 12:33 PM
moving this from 0.19 due to feature freeze.

Suhas Gogate added a comment - 19/Oct/08 08:10 PM
Thanks Lohit, Mac & Devraj for providing the comments. Attaching the new patch incorporating comments to make it comply with hadoop coding conventions.

Thanks.


Sachin added a comment - 23/Oct/08 11:00 PM
Hi Suhas, Lohith

We are students from University of Minnesota looking for a project for our distributed systems class. We found this project to be quite interesting. We would like to work on the extension for run time analysis. How feasible is that considering we have a time frame of 4 weeks to complete our project.

Thanks

Sachin


Milind Bhandarkar added a comment - 23/Oct/08 11:14 PM
Enis, I believe the feature freeze applies to hadoop core, and not contrib, right ?

Suhas Gogate added a comment - 23/Oct/08 11:27 PM
Hi Sachin, I sent you a detailed reply to your email couple of days back. Yes there is a possibility, although need to know few details about your familiarity with hadoop M/R and what is the start date for your project etc. We can discuss it off-line to keep JIRA updates related to technical discussion. will send you the email again today. thx. --Suhas

Milind Bhandarkar added a comment - 29/Oct/08 10:33 PM
Making it patch available.

Hadoop QA added a comment - 30/Oct/08 03:35 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12392443/hadoop-vaidya-0.2.patch
against trunk revision 709040.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 23 new or modified tests.

-1 patch. The patch command could not apply the patch.

Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3509/console

This message is automatically generated.


Suhas Gogate added a comment - 01/Nov/08 01:09 AM
error applying patch. adding new patch.

Suhas Gogate added a comment - 01/Nov/08 05:01 PM
patch submitted along with documentation.

Suhas Gogate added a comment - 01/Nov/08 05:01 PM
cancelled

Suhas Gogate added a comment - 01/Nov/08 05:09 PM
latest patch file (hadoop_vaidya.0.3.patch) attached.

Suhas Gogate added a comment - 01/Nov/08 05:11 PM
making it patch available.

Hadoop QA added a comment - 03/Nov/08 05:26 PM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12393197/hadoop_vaidya.0.3.patch
against trunk revision 709609.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 20 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

-1 core tests. The patch failed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3517/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3517/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3517/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3517/console

This message is automatically generated.


Suhas Gogate added a comment - 03/Nov/08 10:01 PM
Patch added for branch 0.19.0

Suhas Gogate added a comment - 03/Nov/08 10:01 PM
Added patch for trunk

Suhas Gogate added a comment - 03/Nov/08 10:03 PM
Making patch available

Owen O'Malley added a comment - 03/Nov/08 10:53 PM
This can't go into 0.19, because it is new functionality.

Why is it in chukwa, rather than some where more global? Does it depend on chukwa?


Suhas Gogate added a comment - 04/Nov/08 12:40 AM
1. Although it is an independent tool right now, eventually would be integrated with Chukwa (part of health care center), providing rule based analysis of the data it collects for M/R jobs. At this time it does not use any Chukwa interface/library etc.

2. As Milind mentioned in one of his earlier comments, I was under impression that feature freeze applies to hadoop core and not "contrib". Is it true?


Owen O'Malley added a comment - 04/Nov/08 12:51 AM
1. I think it would add value outside of Chukwa and be usable without Chukwa. Maybe the right place is another contrib project?

2. No. 0.19 is closed for both core and contrib.


Suhas Gogate added a comment - 04/Nov/08 01:51 AM
Owen, Sorry, I did not mean that Hadoop Vaidya would loose its independent identity after integration with Chukwa. Optionally It should also run outside Chukwa framework, and so I agree with you. Although based on prior discussions between Me, Runping, Mac and Milind, it appears to have better fit and more potential with Chukwa framework and so was decided to be part of Chukwa project.

Milind/Mac/Runping, comments?


Hadoop QA added a comment - 04/Nov/08 01:50 PM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12393268/hadoop_vaidya_trunk.patch
against trunk revision 709609.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 20 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

-1 core tests. The patch failed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3525/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3525/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3525/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3525/console

This message is automatically generated.


Owen O'Malley added a comment - 09/Nov/08 02:19 PM
I believe that the right solution is to make this project a stand-alone contrib project and handle the chukwa dependencies dynamically. Otherwise, users that aren't using chukwa won't see this tool.

Suhas Gogate added a comment - 18/Nov/08 01:28 AM
making a patch available for hadoop-vaidya in the contrib (moved it outside the chukwa as thought initially).

Suhas Gogate added a comment - 18/Nov/08 05:12 PM
changed the fixed version to 0.20.0 (by mistake was selected 0.19 )

Hadoop QA added a comment - 21/Nov/08 06:08 AM
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394120/hadoop-vaidya-trunk.patch
against trunk revision 719431.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 20 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3615/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3615/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3615/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3615/console

This message is automatically generated.


Suhas Gogate added a comment - 21/Nov/08 07:42 PM
Found a bug.. so canceling the patch and will be resubmitting the fixed version soon.

Suhas Gogate added a comment - 21/Nov/08 09:51 PM
submitting the patch

Hadoop QA added a comment - 23/Nov/08 05:54 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394453/hadoop_vaidya_0200.patch
against trunk revision 719787.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 21 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

-1 core tests. The patch failed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3636/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3636/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3636/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3636/console

This message is automatically generated.


Suhas Gogate added a comment - 24/Nov/08 06:22 PM
failed unit test is NOT related to this patch.

Owen O'Malley added a comment - 25/Nov/08 12:17 AM
Sorry, I've only glanced at the patch, but it would be better to make the contrib module 'vaidya' instead of 'hadoop-vaidya' since the hadoop is implied.

Suhas Gogate added a comment - 25/Nov/08 12:29 PM
New file attached hadoop_vaidya_0200-1.patch with changes making the module named vaidya under contrib (instead of hadoop-vaidya). Corresponding changes also made in the source and documentation.

Suhas Gogate added a comment - 25/Nov/08 12:30 PM
patch submitted

Hadoop QA added a comment - 26/Nov/08 01:41 PM
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394648/hadoop_vaidya_0200-1.patch
against trunk revision 720698.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 21 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3653/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3653/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3653/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3653/console

This message is automatically generated.


Suhas Gogate added a comment - 05/Dec/08 07:14 PM
Can we commit this patch? some folks inquiring for it.

Owen O'Malley added a comment - 08/Dec/08 10:46 PM
I just committed this. Thanks, Suhas!

Hudson added a comment - 09/Dec/08 03:32 PM
Integrated in Hadoop-trunk #683 (See http://hudson.zones.apache.org/hudson/job/Hadoop-trunk/683/)
. Add Vaidya tool to analyze map/reduce job logs for performanc
problems. (Suhas Gogate via omalley)