Welcome to the Data Center Automation Blog, where you can read the perspectives from storage management experts. This Blog provides insights into the data center automation solution, as well as technical details about specific IBM products.
Ever tried to bring a resource offline only for it to result in a state of "Stuck online" ? A "Stuck online" situation could also prevent a move request (failover) since the first step of a move is to stop/offline all resources that are in the scope of the move. Your first sign of a "Stuck online" situation will likely be from the output of the 'lssam' command. Here is some sample lssam output : Stuck online IBM.ResourceGroup:App-rg Request=Move Control=MemberInProblemState Nominal=... [More]
A Network TieBreaker is a popular configuration option for a Tivoli System Automation for Multiplatforms (TSA MP) managed environment. But what is a TieBreaker and why is it needed ? To understand the what and why, you first need to understand the concept of "quorum" ... please see my blog titled "TSA Blog Series: High Availability Concepts - What is Quorum ?" https://www.ibm.com/developerworks/community/blogs/d6a38b59-943a-434b-a473-b408ed64847d/entry/what_is_quorum?lang=en ... in a nutshell, a group of nodes is... [More]
Why are you using the Tivoli System Automation for Multiplaforms (TSAMP) product with your DB2 HADR or DB2 HA Shared Disk environment ? The answer should be to keep the DB2 service highly available (and provide some operational convenience via some basic automation TSAMP can provide). But how do you know your HA environment is prepared for that unexpected problem. In my mind, its similar to a "backup" you take to help you recover from an unexpected event ... how do you know the backup is good and will restore when you need it? ... [More]
Hi, I'm also including the URL to the official guide Configuring an IBM Tivoli Storage Manager cluster with IBM Tivoli System Automation for Multiplatforms that shows how to set up and operate an SA MP cluster to keep TSM highly available.
Here are some power-user commands mainly for servicing SA MP which I have gathered over time and are worth sharing. They can be made into shell aliases or just used as is. Have fun! Get SAMP version samversion -b Get RSCT version ctversion -b Check operator commands cat /var/ct/IBM.RecoveryRM.log Check operator commands on every node in the cluster for NODE in `lsrpnode -x | cut -f1 -d' '` ; do ssh $NODE cat /var/ct/IBM.RecoveryRM.log; done Periodically check RSCT subsystems (watch) - LINUX while [ 1 -eq 1 ]; do clear;... [More]
We're very close to our first Open Program Call on Tivoli System Automation Application Manager(https://ibm.biz/BdxKhQ) Main focus of this call is the new User Interface we intend to ship based on that new platform in a v3.next release. We'll post a recording on our Open Program developerWorks community (https://ibm.biz/BdxSCL)- stay tuned!
This self-paced audio-visual course provides an overview of the System Automation for z/OS 3.4 functional differences as they relate to implementation and administration. This is is the first of three courses in a set of courses that cover implementation and administration differences. The other two courses provide demonstrations and additional details on specific topics. The functional differences associated with operational commands are covered in a separate set of courses. Link: http://youtu.be/VsipoR6n52Q
Today we added a new utility to the SA Application Manager WIKI which allows to model tasks as SA resources. This utility can be used together with the Agentless Adapter of the SA Application Manager as well as for the instrumentation of resources in SA for Multiplatforms. What are task resources? With System Automation it is possible to
manage the availability of all resources in a datacenter. System Automation
ensures the availability 24x7 by constantly monitoring the availability of defined
resources and automatically reacts on... [More]
It is officially live! The IT Service Management ( Tivoli) support and development staff's are daily contributing Q&A in IBM's dWAnswers forum. Check it out at https://developer.ibm.com/answers/questions/index.html Simply type in a "tag" - which is usually your products acronym to see what topics exist. For example: IBM Tivoli Monitoring version 6 is simply ITMv6
I wrote a new tool I call "db2tsacheck" to help avoid unwanted surprises due to configuration problems. Its sole purpose is to look for as many configuration problems as possible for TSAMP managed DB2 HADR or DB2 HA Shared Disk environments. Its a tool you can run on a periodic basis to validate all is well with your configuration. To download the new version, please visit the following URL: http://www.ibm.com/support/docview.wss?uid=swg21685755 Although the first two releases provided good coverage of known configuration problems and... [More]
A new paper has been released on the System Automation Application Manager WIKI: Paper: Integrating OSLC It contains information and examples (in Java® and PERL) on how to control End-To-End resources via REST calls. For scripting this eliminates the necessity to log on to the node hosting the SA Application Manager and does not need to start a JVM (like eezcs does). As always we are very interested in your feedback and any nice solutions (like handy scripts) you are developing. - Sebastian Wegmann
Hi, since it it not easy to find, I'm including the URL to the official TWS integration guide here. Tivoli Workload Scheduler - Integrating with other products Chapter 6 of this guide explains how to set up and operate a cluster with SA MP to keep TWS highly available.
Hi, there is a new cool video on youtube showing the (old) integration of SA Application Manager with ITM to add policy-based automation on resources of a datacenter being monitored by ITM. This video also impressively shows the integration of SA and ITM widgets in one combined dashboard. Have a look: http://youtu.be/_5OHhZ0czdU cheers, Josi.
Given a 2 node domain, what option do you have to allow a single node to obtain Operational Quorum after bringing one node offline and knowing your TieBreaker device will not be accessible from the surviving node ? Firstly, if you want to know more about "Quorum", check out one of my earlier blogs: https://www.ibm.com/developerworks/community/blogs/d6a38b59-943a-434b-a473-b408ed64847d/entry/what_is_quorum There are a number of scenarios where clients have ended up with a surviving node unable to obtain quorum during... [More]
To understand how things are organized in a TSAMP/RSCT environment, we start with the idea that almost everything is considered a resource. Of course there are different kinds of resources and that is where we introduce the concept of a resource "class". Then there are different Resource Managers, each responsible for managing or controlling resources that belong to a particular set of resource classes. The following diagram shows the mapping of three key Resource Managers to some Resource Classes they manage and then to some example... [More]
Find System Automation on Pulse2014... Following is a compilation of session talking about System Automation Application Manager. 2/24 Monday - 6PM-7PM BME-2437 Managing Your System Automation Landscape with Mobile Devices 2/25 Tuesday - 12.15PM-1.00PM SZM-2436 The "New Face" of IBM Tivoli System Automation Application Manager - 2.15PM-3.15PM SZM-1213 Whats new in Tivoli System Automation 4.1 and what it means for your cloud 2/26 Wednesday - 10.45AM-11.45AM SZM-1798 Generali's perspective on Tivoli System Automation Application... [More]
Hi, after the pre-announcement for the EOTC2014 in Boeblingen/Germany - here you find more information and the link for enrollment: EOTC2014 This conference addresses the needs of System, Network and Automation Administrators. Sessions are presented by representatives from IBM software development teams and YOU, the product exploiters. Interaction and feedback is both expected and encouraged. Topics cover a range of IBM system management products, including IBM Tivoli System Automation Family (SA z/OS, SA for Multiplatforms... [More]
The simple answer is RSCT (the "cluster" software) trace data from key daemons (sub-systems) like IBM.ConfigRM, cthags, cthats, and possibly even RMCd. IBM Support would use this trace data to confirm whether or not it was RSCT that forced the reboot, and if so, then why. However what isn't always that simple is collecting these trace files before they wrap. The trace files for all the RSCT and TSAMP core daemons and Resource Managers are setup to be a fixed size and First-In, First Out (FIFO). Some of these trace files are... [More]
IBM Education Assistant or "IEA" has released 135 videos and links to You Tube videos so far this year. What is IEA? The IBM Education Assistant (IEA) integrates narrated presentations, Teach Me Demonstrations, tutorials, and resource links to help you successfully configure and use IBM software products. In detail, the information provided in IEA is role and task based to demonstrate how to accomplish "specific" tasks in your software product(s). The term coined for this is aptly named "Just in Time"... [More]
We are pleased to announce the European Operations Technical Conference (EOTC) 2014 . The conference will be held from Monday, March 17 through Friday, March 21, 2014 in the IBM Lab in Boeblingen, Germany. Conference language will be English. The conference is a 2 track event: • First track is focused on Mainframe (z/OS) themes with IBM Tivoli System Automation for z/OS. This track will be held from Monday, March 17 (1 PM) through Wednesday, March 19 (5 PM). • ... [More]
"The Majority Rules !" In a nutshell, you have Quorum if you have the majority. Quorum is the number of "operational" nodes in a cluster that are required to control the resources, modify the cluster definition, or perform certain cluster operations. The main goals of quorum operations: - identify who has the majority when a cluster is broken up into sub-clusters - keep data consistent, especially when shared file systems are being used - protect critical resources … maintain HA control If critical resources are... [More]
In response to: October 15th, 3 PM CET: Open Program Demo Call Date blocked so far. Just a question on behalf of TK: when will we
see the policy editor ( either a new one, or the old one in "new
clothes" ;-) . What is here planned so far?
In a previous blog, I talked about the importance for collecting and providing diagnostic data. For the Tivoli System Automation for Multiplatforms (TSA MP) product, this means running its automated data collection utility called "getsadata". However, there are details about a problem situation that cannot be obtained by a tool, script, or bunch of commands. The most obvious is the problem description itself. So what does a good problem description entail ? Well a timeline for one. Lets say Support staff have to dig into the log and... [More]
If you're planning on asking IBM Support for help, more than likely there will be a minimum amount of detail they will need up front. The most obvious being details about your environment, such as platform/OS and product versions. Now if you're needing the root cause for some event that has since pasted, then keep in mind that someone providing "remote" support will likely need historical log or trace data before they will be able to offer you anything significant. For the Tivoli System Automation for Multiplaforms (TSAMP)... [More]
We've created a "landing page" for a collection of support resources pertaining to the Tivoli System Automation for Multiplatforms (TSA MP) product. Think of it as the home page for your initial Support needs. Here's the direct URL : http://www.ibm.com/support/docview.wss?uid=swg27024950 This page is permanently linked off TSAMP's IBM Support Portal site, referred to as "Featured documents". The landing page is actually divided into a collection of categories, each with their own home page. For example : -... [More]
IBM Tivoli System Automation for Multiplatforms - Introduction & Overview - A software solution that leverages policy-based, self-healing and cross-platform application automation capabilities to realize monitoring and operation of a multitiered environment, manage outages, automatic recovery and movement of resources. (Tivoli System Automation for Mutiplatforms will be abbreviated as SAMP in this article.) Terms High availability : A system that is continuously available and self-healing to prevent downtime caused by system... [More]
"Consolidation drives value." "Manage less and do more." These initiatives, and many others like them, make sense from a financial standpoint, but can often lead to sleepless nights for operations managers once they realize the majority of their business, sometimes as much as 85%, is now run out of a single datacenter or a single platform. What happens when the system goes down? What happens when my datacenter loses connectivity? How quickly can I recover from an outage? Do I even... [More]
We just published a solution in our product wiki on how to setup an SA MP end-to-end automation adapter with a non-root user: https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/Tivoli%20System%20Automation/page/SAMP%20Adapter%20non-root%20setup Feedback very welcome!
You are interested in a sneak preview of content we intend to deliver with the next releases of SA Application Manager and SA Multiplatforms? You always wanted to know how you can integrate IBM Tivoli Workload Scheduler with SA Application Manager? Have you ever wondered how our new user interface looks like? Or what a ServerGroup is? Or how you can easily understand what's going on in a SA MP cluster? Yes?? Then our Tivoli System Automation Open Program developerWorks group will link you to answers. We... [More]