The questions started with the simple: What's this "steal" CPU metric column?
In response to the original question, a wiki page for measuring stolen CPU cycles was developed and posted on developerWorks. Turns out this was just the simple "get your feet wet" introduction.
And more questions came in.
- In one case, someone only uses vmstat. The averaging of the individual CPU metrics gets confusing.
- In another case, there's a regular user of nmon, with the question of how nmon works with the steal metrics.
- What happens when partitions are sharing more CPU resources on the larger systems? Can steal go over 100%?
- How does a customer monitor and reconcile CPU metrics across AIX, SLES, and Red Hat partitions systems running on a single physical system?
- And we quickly discovered that the various terms of CPUs, cores, SMT threads, virtual processor, logical CPUs, Linux's lparcfg, Linux terms across platforms, AIX terms, HMC terms all get muddled together
So work is proceeding. Several more Linux wiki pages are being developed. First we're settling on the terms and how tools like vmstat and nmon report on the "steal" metrics.
In this forum thread, we'll report on the progress being made, while trolling for more questions. We expect to post some draft wiki pages to solicit comment and feedback in the coming days.