Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.5.0
    • Fix Version/s: 0.7.0
    • Component/s: new service
    • Labels:

      Description

      It would be nice if we could add Ganglia as a service and automatically configure gmond for the active roles. This should allow users to see how the cluster is working .

      1. WHIRR-258.patch
        40 kB
        Karel Vervaeke
      2. WHIRR-258.patch
        43 kB
        Karel Vervaeke
      3. WHIRR-258.patch
        46 kB
        Karel Vervaeke
      4. WHIRR-258.patch
        45 kB
        Andrei Savu

        Activity

        Andrei Savu created issue -
        Hide
        Lars George added a comment -

        Great idea, I missed to create an issue earlier since the older hbase EC2 scripts also enabled Ganglia, but I wasn't sure where to put this. Makes totally sense!

        Show
        Lars George added a comment - Great idea, I missed to create an issue earlier since the older hbase EC2 scripts also enabled Ganglia, but I wasn't sure where to put this. Makes totally sense!
        Hide
        Tom White added a comment -

        The Python scripts have some Ganglia code you might be able to resurrect.

        Show
        Tom White added a comment - The Python scripts have some Ganglia code you might be able to resurrect.
        Tom White made changes -
        Field Original Value New Value
        Component/s new service [ 12314368 ]
        Hide
        Karel Vervaeke added a comment -

        Basic ganglia support

        Show
        Karel Vervaeke added a comment - Basic ganglia support
        Karel Vervaeke made changes -
        Attachment WHIRR-258.patch [ 12489977 ]
        Hide
        Karel Vervaeke added a comment -

        Commit notes for the patch:

        • Only a 1-deep hierarchy is supported (n monitors all report to one meta)
        • The metad is also part of the monitored cluster (i.e. reports to itself)
        • The webfrontend is installed on the metad node.
        • There can be only one instance with the ganglia-metad node. Things will probably (certainly) break if there are more than 1 metas.
        • No real service tests yet (should do HTTP get on the webfrontend?)
        Show
        Karel Vervaeke added a comment - Commit notes for the patch: Only a 1-deep hierarchy is supported (n monitors all report to one meta) The metad is also part of the monitored cluster (i.e. reports to itself) The webfrontend is installed on the metad node. There can be only one instance with the ganglia-metad node. Things will probably (certainly) break if there are more than 1 metas. No real service tests yet (should do HTTP get on the webfrontend?)
        Hide
        Andrei Savu added a comment -

        Nice work Karel!

        > No real service tests yet (should do HTTP get on the webfrontend?)

        Sounds like a good idea to make a simple request just to check the webfrontend is running. For checking Ganglia you could do some simple asserts on the XML output generated by gmetad & gmond (e.g. http://code.google.com/p/embeddedgmetric/wiki/ParsingGmond )

        Show
        Andrei Savu added a comment - Nice work Karel! > No real service tests yet (should do HTTP get on the webfrontend?) Sounds like a good idea to make a simple request just to check the webfrontend is running. For checking Ganglia you could do some simple asserts on the XML output generated by gmetad & gmond (e.g. http://code.google.com/p/embeddedgmetric/wiki/ParsingGmond )
        Hide
        Karel Vervaeke added a comment -

        Thanks. I wasn't going to check the XML because the service is bound to localhost (or was it the internal ip), but it's easy enough to do it over ssh.

        Show
        Karel Vervaeke added a comment - Thanks. I wasn't going to check the XML because the service is bound to localhost (or was it the internal ip), but it's easy enough to do it over ssh.
        Hide
        Karel Vervaeke added a comment -

        Updated patch - includes testcase for ganglia webfrontend and ganglia XML
        I didn't know how to SSH to the hosts for getting the ganglia XML, so I just opened up the port in via the firewall rules. Perhaps we could expose the 'getCompute' in the TestClusterController class?

        Show
        Karel Vervaeke added a comment - Updated patch - includes testcase for ganglia webfrontend and ganglia XML I didn't know how to SSH to the hosts for getting the ganglia XML, so I just opened up the port in via the firewall rules. Perhaps we could expose the 'getCompute' in the TestClusterController class?
        Karel Vervaeke made changes -
        Attachment WHIRR-258.patch [ 12490240 ]
        Hide
        Karel Vervaeke added a comment -

        Another patch. I'm now testing all gmond services - so both ganglia-metad and ganglia-monitors service are tested.

        Show
        Karel Vervaeke added a comment - Another patch. I'm now testing all gmond services - so both ganglia-metad and ganglia-monitors service are tested.
        Karel Vervaeke made changes -
        Attachment WHIRR-258.patch [ 12490938 ]
        Hide
        Karel Vervaeke added a comment -

        This patch adds ganglia-metad and ganglia-monitor roles.
        You can have only 1 ganglia-metad instance.

        Show
        Karel Vervaeke added a comment - This patch adds ganglia-metad and ganglia-monitor roles. You can have only 1 ganglia-metad instance.
        Karel Vervaeke made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Affects Version/s 0.5.0 [ 12316248 ]
        Labels ganglia service
        Fix Version/s 0.7.0 [ 12317571 ]
        Hide
        Tom White added a comment -

        +1

        Looks like there are some spurious changes in recipes/nodes-byon.yaml which should probably be removed.

        Show
        Tom White added a comment - +1 Looks like there are some spurious changes in recipes/nodes-byon.yaml which should probably be removed.
        Andrei Savu made changes -
        Assignee Karel Vervaeke [ karel1980 ]
        Hide
        Karel Vervaeke added a comment -

        Just to be sure: Should I clean up the patch or will you drop the changes to recipes/nodes-byon.yaml

        Show
        Karel Vervaeke added a comment - Just to be sure: Should I clean up the patch or will you drop the changes to recipes/nodes-byon.yaml
        Hide
        Andrei Savu added a comment -

        Updated patch just for the record.

        Show
        Andrei Savu added a comment - Updated patch just for the record.
        Andrei Savu made changes -
        Attachment WHIRR-258.patch [ 12491320 ]
        Hide
        Andrei Savu added a comment -

        I've just committed this. Thanks Karel! Integration tests pass on ec2 & cloudservers-us.

        Show
        Andrei Savu added a comment - I've just committed this. Thanks Karel! Integration tests pass on ec2 & cloudservers-us.
        Andrei Savu made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]

          People

          • Assignee:
            Karel Vervaeke
            Reporter:
            Andrei Savu
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development