To: | <nv-l@lists.us.ibm.com> |
---|---|
Subject: | RE: [nv-l] nvtecia still hanging or falling behind processing TEC_ITS.rs |
From: | "Van Order, Drew \(US - Hermitage\)" <dvanorder@deloitte.com> |
Date: | Tue, 14 Sep 2004 11:09:07 -0500 |
Delivery-date: | Tue, 14 Sep 2004 17:33:50 +0100 |
Envelope-to: | nv-l-archive@lists.skills-1st.co.uk |
Reply-to: | nv-l@lists.us.ibm.com |
Sender: | owner-nv-l@lists.us.ibm.com |
Thread-index: | AcSabqqz06PQhA7PQiGCpAxytBPOsgAAi7Pg |
Thread-topic: | [nv-l] nvtecia still hanging or falling behind processing TEC_ITS.rs |
Thank you for replying,
I
just finished researching the tracing capability you mention James--turning
it on now to see if it's going that far or not. I hope it doesn't impact
performance much as it's something that seems valuable to keep on 100%, much
like logging TEC events. This alone should tell us where the holdup is and cut
troubleshooting in half.
We
have two thresholds for how we react to not seeing the hourly HB in a timely
fashion. If it shows up before the next hourly heartbeat, it is deemed OK
(slowdown), but the Ops folks know they have to start relying more on Event
Browser until things sync back up. If the HB does not appear after the one hour,
we are pretty sure it is hung, my team is paged, and we cycle daemons. We
can't watch any longer than that because it's likely events have been
missed.
The
trap surges aren't anything near the sustained 6-8 traps/sec. that NV is rated
for. What we usually see are surges of traps over an hour, each
trap within 4-5 seconds of each other--usually when an ATM VC is flipping
between up/down. We think this should be easily handled, especially based on our
TEC_ITS load testing performed a few months ago. That testing showed we could
handle
Test Two—trapd and TEC_ITS flood. Success criteria: No degradation in console or command line performance during test. No loss of TEC events, and no more than a 2 minute delay before events are seen at TEC console
Our
stress testing used Interface Down traps, so it was much more strenuous. Cisco
traps don't even get to netview.rls correlation, they just use TEC_ITS to get to
TEC. It's bizarre how we could hammer it for hours with internal NV traps
but fewer external Cisco traps seem to choke it. The NV ruleset node used for
Cisco traps appears fine when you look at it.
Thanks again--Drew
|
<Prev in Thread] | Current Thread | [Next in Thread> |
---|---|---|
|
Previous by Date: | Re: [nv-l] nvtecia still hanging or falling behind processing TEC_ITS.rs, James Shanks |
---|---|
Next by Date: | [nv-l] Virtual Cluster Node, CMazon |
Previous by Thread: | Re: [nv-l] nvtecia still hanging or falling behind processing TEC_ITS.rs, James Shanks |
Next by Thread: | RE: [nv-l] nvtecia still hanging or falling behind processing TEC_ITS.rs, Van Order, Drew \(US - Hermitage\) |
Indexes: | [Date] [Thread] [Top] [All Lists] |
Archive operated by Skills 1st Ltd
See also: The NetView Web