Wednesday 30 April 2008

Trouble at Mill

With some unfortunate timing, it looks like the "Axis of Openness" webpages (SourceForge, Slashdot, Freshmeat, ...) have gone for a burton. There seems to be some networking problems with these sites, with web traffic timing out. Assuming traceroute output is valid, the problem appears soon after traffic leaves the Santa Clara location of the Savvis network [dead router(s)?]

This is a pain because we've just done the v0.10 release of MonAMI and both the website and the file download locations are hosted by SourceForge. Whilst SourceForge is down, no one can download MonAMI!

If you're keen to try MonAMI, in the mean-time, you can download the RPMs from the (rough and ready) dev. site:
http://monami.scotgrid.ac.uk/

The above site is generously hosted by the ScotGrid project [their blog].

Thanks guys!

Monday 28 April 2008

Version 0.10 has left the building

After many months of work, v0.10 has been tagged and source-/binary-RPMs and tar-balls are available.

This is a major release with many enhancements to MonAMI. Perhaps the two improvements that top the list are:
  • adaptive monitoring,
  • writing monitoring data into a database.
Some other note-worthy changes include:
  • New plugins:
  • Updates to existing plugins:
    • maui
      • support for QoS (a maui term) monitoring added,
      • added a timeout option (maui can take ages to reply sometimes).
    • Torque
      • better error handling (the library has a somewhat amusing way reporting problems),
      • enforce thread-safety (some torque library API isn't),
    • Ganglia
      • fixed gmond.conf parser,
      • transmission now less bursty (reduces likelihood of overloading gmond)
      • unicast support: sending data to just the one gmond, support for multiple gmonds (for failover in unicast deployments) pencilled in for the next release.
    • null
      • adjustable time delay (useful when playing with adaptive monitoring)
    • MySQL
      • added per-Table monitoring statistics (also can now act as a reporting plugin).
  • Other changes:
    • Added the "MonAMI by Example" tutorial (has been available from the web for a while)
    • MonAMI-core will use the recent history of a monitoring target's response time when estimating how long it future requests will take. This uses quite a nice algorithm, which responds quickly to a service suddenly taking a longer time to respond, but isn't fooled if a service responds very quickly.
    • Added per-Thread CPU profiling. This is so, if someone says "MonAMI is consuming vast amounts of CPU" we can figure out why.
    • Spring-clean of user-guide and tutorial: lots of effort has gone into this, mostly in ensuring a consistency in the typesetting. The document should look a lot nicer now and hopefully be easier to read.
You can download MonAMI from the SourceForge page:
http://sourceforge.net/project/showfiles.php?group_id=151885
or configure your YUM to download it automatically. Details are available here:
http://monami.scotgrid.ac.uk/

Enjoy!