[Top] [All Lists]

[nv-l] question: generating alerts for iferrors, discards, and utilizati

To: nv-l@lists.tivoli.com
Subject: [nv-l] question: generating alerts for iferrors, discards, and utilization
From: dmahler@telcordia.com
Date: Wed, 12 Feb 2003 11:13:12 -0500
Delivered-to: mailing list nv-l@lists.tivoli.com
Delivery-date: Wed, 12 Feb 2003 16:15:53 +0000
Envelope-to: nv-l-archive@lists.skills-1st.co.uk
In-reply-to: <OFCE7FCDE9.3B0D0E71-ON85256CC9.0001D7E0@LocalDomain>
List-help: <mailto:nv-l-help@lists.tivoli.com>
List-post: <mailto:nv-l@lists.tivoli.com>
List-subscribe: <mailto:nv-l-subscribe@lists.tivoli.com>
List-unsubscribe: <mailto:nv-l-unsubscribe@lists.tivoli.com>
Mailing-list: contact nv-l-help@lists.tivoli.com; run by ezmlm


We have been monitoring interface utilization, discards, and errors  for years now, generating alerts into Tivoli TEC from netview when they go over threshold.  We also track and graph them via mrtg/rrdtool.  

Recently, I have been having an internal debate as to the merits of this strategy.  I believe that it is useful to track all three and alert on them if they are over threshold.  Others think that only misbehaving links are of interest (errors/discards), and utilization does not matter (is not actionable) unless the link is "broken/impaired".   (I suppose it gets into how deeply one wants to react to possibly service affecting conditions)

we check in /out snmp variables every 10 min and alert as follows:

If% Discards        >25 % of inbound packets discarded  
If% Errors        >20 % of inbound packets with errors
If% Util        >95 % of packets received / bandwidth

Q: I was wondering what other people do for interface performance alerting?     do they focus mostly on interface up/down?  or node up/down?  
if you are polling and thresholding, what values are you using?   when do you consider a line to be sufficiently impaired that it time to call the carrier?

any comments appreciated


Don Mahler
Enterprise Management
<Prev in Thread] Current Thread [Next in Thread>
  • [nv-l] question: generating alerts for iferrors, discards, and utilization, dmahler <=

Archive operated by Skills 1st Ltd

See also: The NetView Web