Monday 23 July 2012

Zenoss Filesystems Monitoring

Have came across many zenoss users asking about changing the default thresholds for filesystem, disk utilisation. In many cases, the issue is about the fact that the default template calculates percentage utilisation and and a threshold like (95%) may not be appropriate for large (hundreds of terabytes) filesystems.

There are a couple of things that can be done.
1. Modify the default template
2. Use a transform in /Events/Perf/Filesystem that calculate actual figures, then decide the fate of events based on these figures (you can obtain such a transform here)

If the first approach above sounds good, I may be able to upload a zenpack which adds a second graph on the template, and also contains a modified copy of FileSystem.py

Thursday 5 July 2012

GPFS Highly Available (HA) SNMP monitoring configuration

The available/documented GPFS SNMP implementation (By IBM) is not designed to be highly available.
There is only one SNMP collector node at a time. If that node fails the SNMP monitoring does not fail over to any other node and thus complete loss of cluster monitoring/reporting.

This blog post offers a simple implementation of gpfs snmp monitoring failover. The failover scheme uses a callback mechanism triggered by a quorumNodeLeave event and the eventNode is its only parameter.

First create a folder to contain your callbacks (if you already have a location for your callbacks, use that instead)
Download/copy the following script into the callbacks location and make it executable.


Modify the script to indicate your available collector nodes by substituting "quorum" and "quorum_node_2" with the hostnames of your gpfs quorum nodes.

Note that in this case the callbacks location is /callback so you may have to modify the script accordingly.

Copy the modified script to all quorum nodes

Add the callback (run once from any quorum node);
mmaddcallback NodeDownCallback --command  /callback/snmp_collector_failover.sh --event quorumNodeLeave --parms %eventNode

if you want monitoring to be reverted back to the default prefered collector node after it comes back online, you may consider adding a node join callback;
mmaddcallback NodeJoinCallback --command  /callback/snmp_collector_failover.sh --event quorumNodeJoin --parms %eventNode

Unix --> Linux --> Android

What is Unix?
[From Wikipedia] Unix (officially trademarked as UNIX, sometimes also written as Unix) is a multitasking, multi-user computer operating system originally developed in 1969 by a group of AT&T employees at Bell Labs, including Ken Thompson, Dennis Ritchie, Brian Kernighan, Douglas McIlroy, Michael Lesk and Joe Ossanna. The Unix operating system was first developed in assembly language, but by 1973 had been almost entirely recoded in C, greatly facilitating its further development and porting to other hardware. Today's Unix system evolution is split into various branches, developed over time by AT&T as well as various commercial vendors, universities (such as University of California, Berkeley's BSD), andnon-profit organizations.
The Open Group, an industry standards consortium, owns the UNIX trademark. Only systems fully compliant with and certified according to the Single UNIX Specification are qualified to use the trademark; others might be called Unix system-like or Unix-like, although the Open Group disapproves[1] of this term. However, the term Unix is often used informally to denote any operating system that closely resembles the trademarked system.
During the late 1970s and early 1980s, the influence of Unix in academic circles led to large-scale adoption of Unix (particularly of the BSD variant, originating from the University of California, Berkeley) by commercial startups, the most notable of which are SolarisHP-UX and AIX, as well as Darwin, which forms the core set of components upon whichApple's OS XApple TV, and iOS are based.[2][3] Today, in addition to certified Unix systems such as those already mentioned, Unix-like operating systems such as MINIXLinuxAndroid, and BSD descendants (FreeBSDNetBSD,OpenBSD, and DragonFly BSD) are commonly encountered. The term traditional Unix may be used to describe an operating system that has the characteristics of either Version 7 Unix or UNIX System V
What are other unix-like operating systems?
Linux (including all its flavours eg, Redhat, Suse, Ubuntu, Centos, Debial etc)
Android
BSD (OpenBSD, FreeBSD etc)


Apple Mac OS (including iOS)
Apple's operating systems are derived from Unix, therefore they share a lot of features with Linux and other Unix/Unix-like operating systems.


Where is Linux used
Linux is, and has been the main driver of Internet (servers) for decades. It is the most used operating system in Servers (behind the scenes stuff) with Windows being the most popular for Client systems (Personal computers and desktops)
There are some upcoming popular Linux Desktops eg. Ubuntu


Linux, just as its parent Unix, has given birth to its own set of various derivatives like Android
Because Linux is highly customization, a lot of hardware manufacturers have used it as the operating system in their electronic products. Because these products are specialised (one task) computers, this operating system within them is generally know as the firmware


Firmware for many networking (internet) capable hardware devices use Linux
eg. Routers, TVs, Portable Media Players, Soft Phones all use some form of Linux at the core of its operations.
Some sensor devices like the Zigbee sensors also use some form of Linux as their central system
A majority of mobile phones (both feature and smartphones) use a derivative of Unix/Linux as their operating systems


Why Linux is so popular, whilst little known
Linux is like underwear in our life. Its necessary and everyone subtly uses one, without thinking about it, but there are great companies producing these undergarments around the clock. You have freedom to wear the one you like, that suits your needs, or even to modify at will. You have choice to have the very cheap (or free) yet functional ones, or flashy brands according to your taste. Most people who use Linux say the major thing about is "Freedom" Whatever that means, Linux is a basic component of our modern technologies.