I'm looking for good books on the general subject on the art
of monitoring computer systems. As in correlation, alerting,
handling alarms from variuos sources, aggregation, root cause
ysis and the like. Any hints?

--
Ståle Johansen