James,
Thanks a lot for the
input. First, I checked for the ovwdb and you were right. The number in cache
was 10000 and the objects number was 12000. Therefore, I changed the number to
15000. Now, regarding the graph I changed the collection time to 10 min but
still it does not seem to work. What I have noticed in the log is maybe what
you are calling gaps. For many routers it says is not up.
Any other input regarding
the graph problem will be appreciated.
We are planning to
upgrade the version of Netview to the latest.
Thanks again,
Regards,
Ioannis
-----Original
Message-----
From: owner-nv-l@lists.us.ibm.com
[mailto:owner-nv-l@lists.us.ibm.com] On
Behalf Of James Shanks
Sent: Wednesday, April 28, 2004
4:16 PM
To: nv-l@lists.us.ibm.com
Subject: RE: [nv-l] graph and
daemon problems
OVs_NON_WELL_BEHAVED is not an error condition
but an indication that these daemons can stay up independently of the core
daemons like ovwdb, pmd, and trapd. The ones you cite will always be
OVs_NON_WELL_BEHAVED, as will others like nvpagerd.
You high ovwdb CPU rate on the other hand is not good.
Is your cache size properly adjusted? Do "ovobjprint -S"
and see what you object count is. Then make certain that your cache size
on ovwdb is at least 20% greater. Otherwise the daemon will spend a lot
of time allocating and reallocating storage to hold your object database in
memory.
And you really should plan to move to a supported level of
the code. 7.1.1 has not been supported ever since 7.1.3 came out over two
years ago. The current maintenance level on 7.1.3 is FixPack2, which
means that you are 5 version of the code downlevel so far. FixPack3 for
7.1.3 is in the works now.
But none of these things has anything to do with your
problem. graphing is done by xnmgraph on data collected by snmpCollect.
I am not an expert in this area but I do know that data will
not graph if there are gaps in the data, and I'll bet that if you look in the
snmpCollect log you will find lots of errors, producing those gaps.
Collections get deferred if an error occurs, which produces those gaps.
Offhand, I would guess that you cannot successfully poll 200 routers for
data in 3 minutes. If you quadruple the number of devices to collect
from, I'd suspect you have to more than quadruple the collection interval.
Anybody else?
James Shanks
Level 3 Support for Tivoli NetView for UNIX and Windows
Tivoli Software / IBM Software Group
"Ioannis Yerou"
<ioannisi@ieee.org>
Sent
by: owner-nv-l@lists.us.ibm.com
04/28/2004 08:11 AM
|
To
|
<nv-l@lists.us.ibm.com>
|
cc
|
|
Subject
|
RE: [nv-l] graph and deamon problems
|
|
Hi to all,
It has been some days now that the grapher stopped to display
the BandwidthUtilIn and BandwidthUtilOut (from MIB Data Collection) for the
interfaces of the routers selected. Even though I can see that the data is
collected for all the routers. I have changed the time interval from 1 min to 3
min but no effect. I should also mention that I was collecting data for about
50 routers without problems and the last month that number was increased to
200. I noticed also the following things which I don’t know if they are
related. Two deamons trapgend and mgragentd even though running are
OVs_NON_WELL_BEHAVED. The last thing is that the process running the ovwdb
takes about 99 to 100% of CPU continuously.
Any Ideas?
Netview
7.1.1
AIX
4.3
ARN
Nortel Routers
Regards,
Ioannis