Settings for For RM scalability comparison :-
GridMix settings were for both Hadoop-0.20.204 and Hadoop-0.23 :
From JobHistroy Parsing and GridMix client found that :
Runtime (seconds): 2473
GridMix Simulation Time Spent: 41mins 8sec
Workflow End: 2046 (From histroy parsing)
While looking at GridMix log and JobHistory files :
1. Found that according to gm client last was completed was
12/02/07 08:32:26 INFO gridmix.JobMonitor: GRIDMIX000029 job_1328600848949_1182) success.
EndTime of Job is : 1328602818684 Tue, 07 Feb 2012 08:20:18
Which means somehow GridMix got Job completion event 12 minutes after the the actual job got completed .
2. Similarly acc. to JobHistory last Jo completed was :
job_1328600848949_1162: 1328603121882 Tue, 07 Feb 2012 08:25:21
Whereas according to GridMix client log:
12/02/07 08:32:08 INFO gridmix.JobMonitor: GRIDMIX000029 (job_1328600848949_1162) success
Which again means GridMix got job completion event nearly 7 minutes after the actual job got finished.
Whereas this problem does not exists with Hadoop-0.20.204
Seems that, Somehow in Hadoop-0.23, GridMix is getting job completion events long after the actual job getting completed