mon·i·tor·ing (transitive verb)
Monitoring is two things:
- Trend analysis for long-term capacity planning and postmortem analysis
- Fault detection or alerting
Without monitoring you don't know what your system is doing, which means you can't fix it and
you can't improve it. Feedback loops require data. Monitoring systems to collect metrics is
systems administration 101. However, systems are more complicated, there are more metrics—and correlation
is needed to troubleshoot problems or plan for growth. But unfortunately, as the
problem has gotten bigger, existing tools haven't gotten better. Anyone managing hundreds of machines has
felt the intense pain of collecting, correlating and making good sense of the information across
all of them. Circonus is a large-scale monitoring and trend analysis system designed to nip
these problems in the bud.
Collecting simple metrics like CPU, memory and disk utilization are obvious basics. Network
switch ports and host interfaces are commonly polled for traffic metrics. Applications expose
their own key metrics that can be trended: space, users, connections, requests or transactions
processed. A good administrator goes beyond these systemic metrics and starts looking into
business-level metrics like registered users, widgets sold, and credit card transaction volumes.
The sheer quantity of information is overwhelming and traditional RRDTool-based systems bust
at the seams. At Circonus, we’ve seen this break down, and we created a system that eliminates
this pain. Circonus is designed from the ground up to perform at scale.