Welcome to the Data Center Automation Blog, where you can read the perspectives from storage management experts. This Blog provides insights into the data center automation solution, as well as technical details about specific IBM products.
Ever tried to bring a resource offline only for it to result in a state of "Stuck online" ? A "Stuck online" situation could also prevent a move request (failover) since the first step of a move is to stop/offline all resources that are in the scope of the move. Your first sign of a "Stuck online" situation will likely be from the output of the 'lssam' command. Here is some sample lssam output : Stuck online IBM.ResourceGroup:App-rg Request=Move Control=MemberInProblemState Nominal=... [More]
A Network TieBreaker is a popular configuration option for a Tivoli System Automation for Multiplatforms (TSA MP) managed environment. But what is a TieBreaker and why is it needed ? To understand the what and why, you first need to understand the concept of "quorum" ... please see my blog titled "TSA Blog Series: High Availability Concepts - What is Quorum ?" https://www.ibm.com/developerworks/community/blogs/d6a38b59-943a-434b-a473-b408ed64847d/entry/what_is_quorum?lang=en ... in a nutshell, a group of nodes is... [More]
Why are you using the Tivoli System Automation for Multiplaforms (TSAMP) product with your DB2 HADR or DB2 HA Shared Disk environment ? The answer should be to keep the DB2 service highly available (and provide some operational convenience via some basic automation TSAMP can provide). But how do you know your HA environment is prepared for that unexpected problem. In my mind, its similar to a "backup" you take to help you recover from an unexpected event ... how do you know the backup is good and will restore when you need it? ... [More]
Hi, I'm also including the URL to the official guide Configuring an IBM Tivoli Storage Manager cluster with IBM Tivoli System Automation for Multiplatforms that shows how to set up and operate an SA MP cluster to keep TSM highly available.
Here are some power-user commands mainly for servicing SA MP which I have gathered over time and are worth sharing. They can be made into shell aliases or just used as is. Have fun! Get SAMP version samversion -b Get RSCT version ctversion -b Check operator commands cat /var/ct/IBM.RecoveryRM.log Check operator commands on every node in the cluster for NODE in `lsrpnode -x | cut -f1 -d' '` ; do ssh $NODE cat /var/ct/IBM.RecoveryRM.log; done Periodically check RSCT subsystems (watch) - LINUX while [ 1 -eq 1 ]; do clear;... [More]
We're very close to our first Open Program Call on Tivoli System Automation Application Manager(https://ibm.biz/BdxKhQ) Main focus of this call is the new User Interface we intend to ship based on that new platform in a v3.next release. We'll post a recording on our Open Program developerWorks community (https://ibm.biz/BdxSCL)- stay tuned!
This self-paced audio-visual course provides an overview of the System Automation for z/OS 3.4 functional differences as they relate to implementation and administration. This is is the first of three courses in a set of courses that cover implementation and administration differences. The other two courses provide demonstrations and additional details on specific topics. The functional differences associated with operational commands are covered in a separate set of courses. Link: http://youtu.be/VsipoR6n52Q
Today we added a new utility to the SA Application Manager WIKI which allows to model tasks as SA resources. This utility can be used together with the Agentless Adapter of the SA Application Manager as well as for the instrumentation of resources in SA for Multiplatforms. What are task resources? With System Automation it is possible to
manage the availability of all resources in a datacenter. System Automation
ensures the availability 24x7 by constantly monitoring the availability of defined
resources and automatically reacts on... [More]
It is officially live! The IT Service Management ( Tivoli) support and development staff's are daily contributing Q&A in IBM's dWAnswers forum. Check it out at https://developer.ibm.com/answers/questions/index.html Simply type in a "tag" - which is usually your products acronym to see what topics exist. For example: IBM Tivoli Monitoring version 6 is simply ITMv6
I wrote a new tool I call "db2tsacheck" to help avoid unwanted surprises due to configuration problems. Its sole purpose is to look for as many configuration problems as possible for TSAMP managed DB2 HADR or DB2 HA Shared Disk environments. Its a tool you can run on a periodic basis to validate all is well with your configuration. To download the new version, please visit the following URL: http://www.ibm.com/support/docview.wss?uid=swg21685755 Although the first two releases provided good coverage of known configuration problems and... [More]
A new paper has been released on the System Automation Application Manager WIKI: Paper: Integrating OSLC It contains information and examples (in Java® and PERL) on how to control End-To-End resources via REST calls. For scripting this eliminates the necessity to log on to the node hosting the SA Application Manager and does not need to start a JVM (like eezcs does). As always we are very interested in your feedback and any nice solutions (like handy scripts) you are developing. - Sebastian Wegmann
Hi, since it it not easy to find, I'm including the URL to the official TWS integration guide here. Tivoli Workload Scheduler - Integrating with other products Chapter 6 of this guide explains how to set up and operate a cluster with SA MP to keep TWS highly available.
Hi, there is a new cool video on youtube showing the (old) integration of SA Application Manager with ITM to add policy-based automation on resources of a datacenter being monitored by ITM. This video also impressively shows the integration of SA and ITM widgets in one combined dashboard. Have a look: http://youtu.be/_5OHhZ0czdU cheers, Josi.
Given a 2 node domain, what option do you have to allow a single node to obtain Operational Quorum after bringing one node offline and knowing your TieBreaker device will not be accessible from the surviving node ? Firstly, if you want to know more about "Quorum", check out one of my earlier blogs: https://www.ibm.com/developerworks/community/blogs/d6a38b59-943a-434b-a473-b408ed64847d/entry/what_is_quorum There are a number of scenarios where clients have ended up with a surviving node unable to obtain quorum during... [More]
To understand how things are organized in a TSAMP/RSCT environment, we start with the idea that almost everything is considered a resource. Of course there are different kinds of resources and that is where we introduce the concept of a resource "class". Then there are different Resource Managers, each responsible for managing or controlling resources that belong to a particular set of resource classes. The following diagram shows the mapping of three key Resource Managers to some Resource Classes they manage and then to some example... [More]