Monitoring Opportunities

  1. (Central)Disconnecting the reporter from the problem: machine A can report a problem involving machine B, and the notification etc. will be for machine B.

    This will enable e.g. a central web monitor, or intelligent reporting of NFS server outage.

  2. (Central)Configurable reporting
  3. (Central)Configurable contacts, like pilot, but with plugins.
  4. (Central)Monitoring tools that work from home/everywhere (even for systems on p172)
  5. (Central)Event correlation
  6. (Central)Tie in between 'dashboard' display of events and current ownership/status of an event (who's working on it, page sent waiting for call back, reviewed and downgraded, etc)
  7. (Central)Authentication/authorization for who can see/change what.
  8. (Global)Built in diagnostics, i.e. an app that produces a web page can embed explicit diagnostic comments instead of a probe parsig the output.
  9. (Local)Retain and report on the collected performance metrics, for capacity planning, trend analysis, and recent problem forensics.
  10. (Local)Proactive diagnostics, e.g. page space low triggering a hunt thru the process list.