IBM®
Skip to main content
    Country/region [select]      Terms of use
 
 
    
     Home      Products      Services & solutions      Support & downloads      My account     
 
developerworks > My developerWorks >  Dashboard > AIX > ... > AIXV53Howtos > AIXV53dump
developerWorks
Log In   View a printable version of the current page.
Overview Connect Spaces Forums Wikis
AIXV53dump
Added by OneSkyWalker, last edited by OneSkyWalker on Nov 13, 2009  (view change)
Labels: 
(None)

How to monitor for issues with dump space on AIX V5.3

The /usr/lib/ras/dumpcheck command can be used to confirm that AIX system dump space is large enough to hold a stand-alone dump if one were taken at the time dumpcheck is run. If dumpcheck finds issues with dump space, it adds an entry to the AIX error log. Since the dump space requirement tends to grow as the system gets busier, configure dumpcheck to run regularly at a time when the system is likely to be fairly heavily loaded. Use crontab -l to confirm that root's crontab is configured to run dumpcheck regularly at an appropriate time and, if not, use crontab -e to update root's crontab.

If dumpcheck logs an error, please consider that dumpcheck will not necessarily be run at a time when the dump space requirement is at its peak. So allocate more space than dumpcheck says will be required. An uplift of at least 20% is recommended, and more if there is plenty of space available in rootvg.

The AIX error notification facility can be used to monitor for errors logged by dumpcheck.

A sample Korn shell script is available which configures an AIX error notification exit so that a note is sent to specified email addresses when dumpcheck adds an entry to the AIX error log. The email addresses are specified in a file with a suffix of .emailaddrs, residing the same directory and with the same base name as the shell script. (That is, if the shell script is named dumperr, then the address file must be named dumperr.emailaddrs.) It is possible to modify the sample shell script to take other actions instead of (or in addition to) sending a note.

Important note

To configure AIX so that the shell script can send a note to a user on another host, follow instructions on the How to configure AIX V5.3 to send mail to users on other hosts web page.

The shell script accesses a configuration file (dumperr.emailaddrs) in the directory in which it resides, so it is best to put the script is a directory dedicated to error notification (eg, /usr/local/errnotify) rather than a directory such as /usr/local/bin.

When invoked with no parameters, the shell script produces help text documenting the flags it supports:

surveyor:/ # /usr/local/errnotify/dumperr
Usage: dumperr {-e | -d | -q}
       -e - enable notification
       -d - disable notification
       -q - query to determine if notification is enabled
surveyor:/ #

Note

The Mirroring the root volume group article says, "Mirrored dump devices are supported in AIX® 4.3.3 or later." so mirroring a dump device is supported, but there is a reason to avoid mirroring a dump device. If access to an hdisk is temporarily lost, the hdisk ends up in the "missing" state:

root@ec04.dfw.ibm.com / # lsvg -p rootvg
rootvg:
PV_NAME           PV STATE          TOTAL PPs   FREE PPs    FREE DISTRIBUTION
hdisk0            active            511         66          00..00..00..00..66
hdisk1            missing           511         66          02..13..00..00..51
root@ec04.dfw.ibm.com / #

Once an hdisk is in the "missing" state, an attempt to sync the volume group containing the hdisk will fail even if access to the hdisk is restored:

root@ec04.dfw.ibm.com / # syncvg -p hdisk1
0516-068 lresyncpv: Unable to completely resynchronize volume. Run
        diagnostics if neccessary.
0516-936 /usr/sbin/syncvg: Unable to synchronize physical volume hdisk1.
root@ec04.dfw.ibm.com / #

The only way to get the hdisk back to the active state is to issue the varyonvg command (even though the volume group is already varied on):

root@ec04.dfw.ibm.com / # varyonvg rootvg
0516-1774 varyonvg: Cannot varyon volume group with an active dump device on
        a missing physical volume. Use sysdumpdev to temporarily replace the
        dump device with /dev/sysdumpnull and try again.
root@ec04.dfw.ibm.com / #

But, as shown above, the varyonvg fails if there is an active dump device on a missing physical volume.

So when AIX rootvg is mirrored, it is better to allocate two unmirrored dump devices (one defined as primary and one as secondary) rather than a single mirrored dump device. That way, the dump device on the missing physical volume can be deconfigured while still allowing a dump to be taken to the other dump device should a system crash occur.

But there is an issue with that approach, as well. The /usr/lib/ras/dumpcheck command will confirm only that the larger of the two dump devices is large enough. If one of the dump devices is too small, if access is lost to the hdisk on which the larger dump device resides, and AIX then crashes, the system dump will fail and diagnostic information will not be collected which is required to determine the cause of the crash.

So when two dump devices are allocated, care must be taken to make sure the primary and secondary dump devices are the same size.

Testing modifications

If the shell script is modified, it is prudent to test the script before putting it into production. The shell script can be tested by using the AIX sysdumpdev command to allocate an AIX dump space which is too small and then running /usr/lib/ras/dumpcheck. When dumpcheck adds an entry to the AIX error log, the script should send a note to the email addresses it finds in the hdwerr.emailaddrs file. If no note is received, please note that it is possible to enable tracing in the shell script by modifying two lines within it:

# Uncomment next two lines to enable trace
# set -x
# exec >>$logfile 2>&1 # Log everything to log file

The contents of this web page solely reflect the personal views of the authors and do not necessarily represent the views, positions, strategies or opinions of IBM or IBM management. Please use the Add Comment link at the bottom of the page to provide feedback. Note: Until you log in (using the link in the upper right corner of this web page), you will not see the Add Comment link and you can not add a comment. If you do not already have an IBM ID, use the Register Now link on the sign in page to obtain one. Registration is quick and easy.


 
    About IBM Privacy Contact