|
|
| Author |
Message |
mlnospam
Joined: 27 Mar 2006 Posts: 3
|
Posted: Mon Mar 27, 2006 2:09 pm Post subject: ucd/net - CPU Usage - User -> NaN! |
|
|
Hello,
I am trying to get the CPU Usage graphed of various hosts on our network, unforutunately everything works fine except for the "User" value, user just gets NaN as result. So the user parameter doesn't get graphed but System and Nice are OK. Now to be sure it's not a problem with the server itself I tryed this with other servers running other version of net-snmp but still the same problem.
I am using the latest version of Cacti and RRDTool on Solaris 10 (SPARC), that's our monitoring host.
Does anyone have an idea what's the problem here ? Any help would be greatly appreciated.
Many thanks in advance
Best regards
PS: I have attached a sample graph so that you can see how it looks like with the missing "user" value.
| Description: |
|
| Filesize: |
31.74 KB |
| Viewed: |
7603 Time(s) |

|
|
|
| Back to top |
|
 |
gandalf Developer
Joined: 02 Dec 2004 Posts: 17009 Location: Muenster, Germany
|
Posted: Tue Mar 28, 2006 12:30 pm Post subject: |
|
|
Please check | Code: | snmpwalk -c xxxx -v 1 <target host> .1.3.6.1.4.1.2021.11
UCD-SNMP-MIB::ssIndex.0 = INTEGER: 1
UCD-SNMP-MIB::ssErrorName.0 = STRING: systemStats
UCD-SNMP-MIB::ssSwapIn.0 = INTEGER: 0
UCD-SNMP-MIB::ssSwapOut.0 = INTEGER: 0
UCD-SNMP-MIB::ssIOSent.0 = INTEGER: 3
UCD-SNMP-MIB::ssIOReceive.0 = INTEGER: 5
UCD-SNMP-MIB::ssSysInterrupts.0 = INTEGER: 4
UCD-SNMP-MIB::ssSysContext.0 = INTEGER: 4
UCD-SNMP-MIB::ssCpuUser.0 = INTEGER: 5
UCD-SNMP-MIB::ssCpuSystem.0 = INTEGER: 2
UCD-SNMP-MIB::ssCpuIdle.0 = INTEGER: 91
UCD-SNMP-MIB::ssCpuRawUser.0 = Counter32: 49101958
UCD-SNMP-MIB::ssCpuRawNice.0 = Counter32: 0
UCD-SNMP-MIB::ssCpuRawSystem.0 = Counter32: 21404990
UCD-SNMP-MIB::ssCpuRawIdle.0 = Counter32: 749970670
UCD-SNMP-MIB::ssCpuRawWait.0 = Counter32: 55037420
UCD-SNMP-MIB::ssCpuRawKernel.0 = Counter32: 21220636
UCD-SNMP-MIB::ssCpuRawInterrupt.0 = Counter32: 184354
UCD-SNMP-MIB::ssIORawSent.0 = Counter32: 300669432
UCD-SNMP-MIB::ssIORawReceived.0 = Counter32: 2145935664
UCD-SNMP-MIB::ssRawInterrupts.0 = Counter32: 2697872243
UCD-SNMP-MIB::ssRawContexts.0 = Counter32: 1533510369
UCD-SNMP-MIB::ssCpuRawSoftIRQ.0 = Counter32: 0
UCD-SNMP-MIB::ssRawSwapIn.0 = Counter32: 28
UCD-SNMP-MIB::ssRawSwapOut.0 = Counter32: 52 | to verify, that this host responds to the wanted OID UCD-SNMP-MIB::ssCpuRawUser.0
Reinhard
|
|
| Back to top |
|
 |
mlnospam
Joined: 27 Mar 2006 Posts: 3
|
Posted: Thu Mar 30, 2006 3:21 am Post subject: |
|
|
| Well it does respond yes. I think the problem is maybe due to the fact that this server uses multiple Pentium IV Xeon CPUs and maybe the value is too high because it reflects the values of not only one CPU but 2 or more. What do you guys think ?
|
|
| Back to top |
|
 |
gandalf Developer
Joined: 02 Dec 2004 Posts: 17009 Location: Muenster, Germany
|
Posted: Thu Mar 30, 2006 10:54 am Post subject: |
|
|
You may be correct. Check MAXIMUM values of the corresponding data source of the Data Template. If you plan to change this, remember that existing rrd files must be rrdtool tuned for this, too
Reinhard
|
|
| Back to top |
|
 |
NoogiE
Joined: 12 Jun 2006 Posts: 2
|
Posted: Mon Jun 12, 2006 5:49 pm Post subject: |
|
|
I've been experiencing these same problems on a dual Xeon machine.
Mlnospam - I think you are probably right in saying that a dual cpu machine can have a load up to 200% - which will break the graphing. Currently a server I manage is reporting 150% user cpu load.
To fix I changed the maximum calue of the data template, as lvm suggested, and also rrdtool tune'd the existing graph.
Graphs are all working again now and have done so for the last few weeks. I'm not sure how hyperthreading could affect these values, but I am yet to see load above 200%
Perhaps the associated data template should be adjusted in future cacti releases? (I dont see the problem with having a higher maxium cpu figure in the data template).
Greg.
|
|
| Back to top |
|
 |
gandalf Developer
Joined: 02 Dec 2004 Posts: 17009 Location: Muenster, Germany
|
Posted: Tue Jun 13, 2006 3:06 am Post subject: |
|
|
All of you may want to follow the discussion/FAQ on CPU reporting at net-snmp.org. But be prepared to get frustrated
Reinhard
|
|
| Back to top |
|
 |
foobie
Joined: 27 Jun 2006 Posts: 3
|
Posted: Mon Jul 17, 2006 7:47 am Post subject: |
|
|
Thanks for this discussion - my cpu graphs were looking really odd before
The commands I used, in case anyone is stuck:
| Code: |
rrdtool tune HOST_cpu_nice_46.rrd --maximum cpu_nice:200
rrdtool tune HOST_cpu_system_47.rrd --maximum cpu_system:200
rrdtool tune HOST_cpu_user_48.rrd --maximum cpu_user:200
|
obviously your HOST and the numbers will differ...
|
|
| Back to top |
|
 |
timvandijk
Joined: 20 Jul 2006 Posts: 3
|
Posted: Thu Jul 20, 2006 9:37 am Post subject: |
|
|
Hi,
I experience the same problem with regards to the cpu raw user graph. We have a two XEON processor system with Hyperthreading disabled. Suddenly the user portion of the graph disappeard with no obvious reason. The maximum value in de datasource was set automatic. We've had however a performance problem during the period that the user portion of the graph disappeared.
I'm absolutly sure that there must have been cpu user activity. Does anyone have an idea about this strange behaviour?
Kindest regards,
Tim
| Description: |
| Example of CactiGraph with missing cpu user graph |
|
| Filesize: |
8.66 KB |
| Viewed: |
6625 Time(s) |

|
|
|
| Back to top |
|
 |
gandalf Developer
Joined: 02 Dec 2004 Posts: 17009 Location: Muenster, Germany
|
Posted: Thu Jul 20, 2006 10:10 am Post subject: |
|
|
I'm quite sure this is due to clipping of high user cpu values. Perform a rrdtool info on that rrd file and see sth similar to
| Quote: | rrdtool info gandalf_cpu_user_10.rrd |more
filename = "gandalf_cpu_user_10.rrd"
rrd_version = "0001"
step = 300
last_update = 1153321503
ds[cpu_user].type = "COUNTER"
ds[cpu_user].minimal_heartbeat = 600
ds[cpu_user].min = 0.0000000000e+00
ds[cpu_user].max = 1.0000000000e+02
ds[cpu_user].last_ds = "268239"
ds[cpu_user].value = 4.1720000000e+01
ds[cpu_user].unknown_sec = 0
| With the max value set to 100 (=default). This may clip off data in multi-cpu systems. Change it using rrdtool tune
Reinhard
|
|
| Back to top |
|
 |
timvandijk
Joined: 20 Jul 2006 Posts: 3
|
Posted: Fri Jul 21, 2006 2:24 am Post subject: |
|
|
Hi Ivm,
The value for cpu_user (and of course cpu_nice en cpu_system) were indead set to a maximum of 100%:
ds[cpu_user].type = "COUNTER"
ds[cpu_user].minimal_heartbeat = 600
ds[cpu_user].min = 0.0000000000e+00
ds[cpu_user].max = 1.0000000000e+02
ds[cpu_user].last_ds = "30306547"
ds[cpu_user].value = 4.9091000000e+02
ds[cpu_user].unknown_sec = 0
I've changed this value using rrdtool tune. Fortunately the performance problems have been solved so I don't expect to see any abnormalities in the near future. I'll just have to wait and see.
Thanks for your input.
Kindest regards,
Tim
|
|
| Back to top |
|
 |
timvandijk
Joined: 20 Jul 2006 Posts: 3
|
Posted: Fri Jul 21, 2006 4:20 am Post subject: |
|
|
Hi Ivm,
Earlier than I expected I can now report the results are great. I now can see the "true" performance graphs. On the other hand I'll need to do a thourough perfomance analysis because I suspect a structural performance problem.
Thanks again!
Tim
|
|
| Back to top |
|
 |
pheezy Cacti User
Joined: 26 Oct 2006 Posts: 59
|
Posted: Mon Jan 15, 2007 5:07 pm Post subject: |
|
|
I'm having the same problem, however when I tuned the rrd, the following is now reported, and even setting it back 100 causes the same thing.
| Quote: | | ds[cpu_user].max = NaN |
|
|
| Back to top |
|
 |
gandalf Developer
Joined: 02 Dec 2004 Posts: 17009 Location: Muenster, Germany
|
Posted: Wed Jan 17, 2007 12:52 pm Post subject: |
|
|
Please post the exact command you've run to set it back to 100 and the resulting rrdtool info
Reinhard
|
|
| Back to top |
|
 |
|