Dell Poweredge Temperature Templates

Templates, scripts for templates, scripts and requests for templates.

Moderators: Moderators, Developers

Author
Message
Brettw
Posts: 26
Joined: Fri Mar 18, 2005 12:25 am

Dell Poweredge Temperature Templates

#1 Post by Brettw » Mon Jan 02, 2006 7:39 pm

Hi All,

Below are some temperature templates for Dell poweredge servers. It works on most servers using the latest openmanage installation.

Cheers

Brett
Attachments
cacti_graph_template_dell_poweredge_temperature with data dependancies.zip
(3.86 KiB) Downloaded 4374 times
Dell Poweredge separate files.zip
(9.44 KiB) Downloaded 4969 times
temp.JPG
temp.JPG (33.92 KiB) Viewed 51652 times

chewymix
Posts: 10
Joined: Fri May 27, 2005 10:41 am

#2 Post by chewymix » Wed Feb 22, 2006 11:08 am

these are great - i don't suppose you have the same sort of template available for fans do you?

thanks

Brettw
Posts: 26
Joined: Fri Mar 18, 2005 12:25 am

#3 Post by Brettw » Sun Feb 26, 2006 9:58 pm


Discogalaxy
Posts: 20
Joined: Sat Feb 25, 2006 4:51 am
Location: Germany
Contact:

#4 Post by Discogalaxy » Thu Mar 09, 2006 6:36 am

nice work!

thank you

tman
Cacti User
Posts: 97
Joined: Thu Oct 14, 2004 4:14 pm

#5 Post by tman » Fri Mar 10, 2006 10:39 am

You'll have to be careful with these as the OID's in Dell servers change depending on the server type/spec etc.

For example, an OID in one PE server that references a CPU temp might reference the backplane temp in another.

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#6 Post by adrianmarsh » Sat Nov 04, 2006 1:42 pm

I had started to put together an snmp index for Dell Temps... but I've hit a snag..

Attached are "inprogress" results (be warned!)

They simply take an SNMP Index result, and use the name as provided by the name OID. But unfortunately.. it hits a snag when reading the CPU minimum temps.

Dell have decided not to give out a value for CPU Minimum values.. so when I walk the tree I get:

Code: Select all

[[email protected] backups]# snmpwalk -On -v1 -cpublic serv1 .1.3.6.1.4.1.674.10892.1.700.20.1.12
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.2 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.3 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.4 = INTEGER: 80
.1.3.6.1.4.1.674.10892.1.700.20.1.12.1.5 = INTEGER: 80
In my example.. the CPU is ...12.1.1 which in the OpenManage GUI gets shown as N/A.

This seems to cause Cacti to think there's an SNMP timeout, and it then marks the whole host as offline.

Code: Select all

 11/04/2006 06:26:37 PM - SYSTEM STATS: Time:95.0280 Method:cactid Processes:1 Threads:1 Hosts:12 HostsPerProcess:12 DataSources:350 RRDsProcessed:174
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[364] WARNING: SNMP timeout detected [500 ms], ignoring host 'ubiq-serv1'
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[363] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[362] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[361] WARNING: Result from SNMP not valid. Partial Result: ...
11/04/2006 06:26:33 PM - CACTID: Poller[0] Host[4] DS[360] WARNING: Result from SNMP not valid. Partial Result: ...

So the question is then: For reading SNMPIndexed, how can you specify that no-return isn't a failure.
Attachments
dell_temp.xml
XML file for snmp_queries
(1.81 KiB) Downloaded 1675 times
cacti_data_query_dell_poweredge_temperatures.xml
Data query inc. all elements (GT, DS, CDEF)
(25.75 KiB) Downloaded 1907 times

User avatar
gandalf
Developer
Posts: 22375
Joined: Thu Dec 02, 2004 2:46 am
Location: Muenster, Germany
Contact:

#7 Post by gandalf » Mon Nov 13, 2006 1:00 pm

Please post a walk of .1.3.6.1.4.1.674.10892.1.700.20.1
Reinhard

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#8 Post by adrianmarsh » Wed Nov 15, 2006 8:09 pm

Here you go.. note the missing OIDs...

12.1.1
and
13.1.1
Attachments
walk.txt
(4.13 KiB) Downloaded 1830 times

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#9 Post by adrianmarsh » Sat Dec 02, 2006 7:42 am

Any ideas??

User avatar
BSOD2600
Cacti Moderator
Posts: 12160
Joined: Sat May 08, 2004 12:44 pm
Location: USA

#10 Post by BSOD2600 » Sat Dec 02, 2006 3:03 pm

Funny you should be working on Dell templates -- I just started on some this morning ;-).

Look in OpenManager in the temperatures section. Is the CPU missing any threshhold min/max values? If so, thats why they're not present via SNMP.

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#11 Post by adrianmarsh » Mon Dec 04, 2006 8:04 am

new job... new toys... same old tools...

Yeah.. Openmanage lists them as n/a... but the real question is how can we get cacti to take this into account?

User avatar
BSOD2600
Cacti Moderator
Posts: 12160
Joined: Sat May 08, 2004 12:44 pm
Location: USA

#12 Post by BSOD2600 » Mon Dec 04, 2006 12:19 pm

I've noticed that too now. Well, my snmp data query has all the thresholds, but of course many of them are not present for some devices. Actually, all I notice in my cacti log is a warning about snmp being invalid, NOT cacti getting an snmp timeout.

Have you tried increasing the timeouts for that device? Don't forget about the ones in php.ini too.

I don't think there really is any way for cacti to take this lack of data into account...easily.

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#13 Post by adrianmarsh » Tue Dec 05, 2006 6:42 pm

One way would be to have an "exception" list per DS.. so that you could - by default - treat all normal DSs as normal, but for cases where you want to ignore certain DSs, you could "blacklist" those OIDs.. maybe.. Not sure how that would work if the indexes shifted..

User avatar
adrianmarsh
Cacti User
Posts: 435
Joined: Wed Aug 17, 2005 8:51 am
Location: UK

#14 Post by adrianmarsh » Sat Dec 09, 2006 1:37 pm

Tried increasing the timeout to 2000, but still no difference. I'm now experimenting with yours...

AndrewRobinson
Posts: 13
Joined: Mon Dec 04, 2006 6:18 am

#15 Post by AndrewRobinson » Wed Nov 14, 2007 4:33 pm

Trying to use this on a PowerEdge 2650, latest version of OpenManage, using Cacti 0.8.7, MySQL 5.0.45, PHP 5.2.5, Apache 2.2.6

Getting the following error from RRDTool:

RRDTool Command:

c:/cacti/rrdtool.exe graph - \
--imgformat=PNG \
--start=-86400 \
--end=-300 \
--title="LANDISA - Temperature" \
--base=1000 \
--height=120 \
--width=500 \
--alt-autoscale-max \
--lower-limit=0 \
--vertical-label="Temperature (Celsius)" \
--slope-mode \
--font TITLE:10:c:/windows/fonts/calibri.ttf \
--font AXIS:8:c:/windows/fonts/calibri.ttf \
--font LEGEND:8:c:/windows/fonts/calibri.ttf \
--font UNIT:8:c:/windows/fonts/calibri.ttf \
DEF:a="C\:/htdocs/cacti/rra/landisa_pege_amb_temp_83.rrd":pege_amb_temp:AVERAGE \
DEF:b="C\:/htdocs/cacti/rra/landisa_pege_cpu1_temp_84.rrd":pege_cpu1_temp:AVERAGE \
DEF:c="C\:/htdocs/cacti/rra/landisa_pege_cpu2_temp_85.rrd":pege_cpu2_temp:AVERAGE \
DEF:d="C\:/htdocs/cacti/rra/landisa_pege_riser_temp_87.rrd":pege_riser_temp:AVERAGE \
DEF:e="C\:/htdocs/cacti/rra/landisa_pege_planar_temp_86.rrd":pege_planar_temp:AVERAGE \
LINE1:a#FF0000:"Ambient" \
GPRINT:a:LAST:"Last\:" \
GPRINT:a:AVERAGE:"Average\:" \
GPRINT:a:MAX:"Max\:\n" \
LINE1:b#FF7D00:"CPU1" \
GPRINT:b:LAST:" Last\:" \
GPRINT:b:AVERAGE:"Average\:" \
GPRINT:b:MAX:"Max\:\n" \
LINE1:c#FF5700:"CPU2" \
GPRINT:c:LAST:" Last\:" \
GPRINT:c:AVERAGE:"Average\:" \
GPRINT:c:MAX:"Max\:\n" \
LINE1:d#B90054:"Riser" \
GPRINT:d:LAST:" Last\:" \
GPRINT:d:AVERAGE:"Average\:" \
GPRINT:d:MAX:"Max\:\n" \
LINE1:e#FF5576:"Planar" \
GPRINT:e:LAST:" Last\:" \
GPRINT:e:AVERAGE:"Average\:" \
GPRINT:e:MAX:"Max\:\n"
RRDTool Says:

ERROR: bad format for GPRINT in 'Last:'

Post Reply