News
Abstract
The previous versions of PowerVP allowed you to record the PowerVP performance statistics in a file on the client where you were running the GUI. While this was very useful, it required you to have the GUI running in order to record the information so it would be available at a later time to investigate problems. With version 1.1.3, you can now configure the PowerVP agents to “record” the performance statistics to a file on the agent partition. This allows you to have the PowerVP information recorded 24x7 and you only need to use the GUI to view the information when you are investigating a problem.
Content
Configuring the Data Store
This function is not enabled by default. When sampling every second for 24 hours, the size of these files can get very large, for example, on a Power 8 system with 24 cores, the file size for 24 hours was 455,626,157 bytes. Also, if the partition has a maximum file size, you will need to configure PowerVP so that it doesn’t exceed that maximum. Therefore, you need to plan for the PowerVP data store files. This configuration is done with 5 directives in the configuration file. The configuration file is located in /QIBM/UserData/PowerVP/powervp.conf on IBM i and in /etc/opt/ibm/powervp/powervp.conf on AIX/VIOS and Linux. If you change the directives in the configuration file, you will need to stop the PowerVP agent and start it again to pick up the changes. If the PowerVP agent encounters an error due to running out of space or hitting the maximum file size, it will simply stop logging data, but continue to run. This will enable the PowerVP GUI to still display live performance data after encountering problems writing the data store.
LogData directive
The LogData directive directs the PowerVP agent if it is to do logging and what information is logged. The default for this directive is No. A value of Yes will enable the PowerVP agent to log both system level and partition level information to the file. A value of System will enable the PowerVP agent to log only the system level performance data. A value of Partition will enable the PowerVP agent to log only the partition level performance data.
On the partition level agents, you simply need to decide if you want to have the partition level information logged in a file. This information is the basic partition CPU utilization, Disk I/O, Ethernet I/O, and the cycles per instruction (CPI) information you see on the partition drilldown. If you decide you want this information to be logged, use the Partition value for the LogData directive on your partition level agent.
On your system level agent, you will need to decide if you want to log the partition level information in addition to the system level information. If you are running the system level agent on your VIOS, the partition drill down information is more than likely not necessary to retain. Therefore, you will probably use the System value for the LogData directive on your system level agent. If your system level agent isn’t a VIOS, you have the same decision you have on the partition level agents, as to whether you want the partition information logged.
SampleInterval directive
The PowerVP agent gets the performance metrics based on the value of the SampleInterval directive. It defaults to 1 second, which means the agent will retrieve new metrics every second and if you are using the data store, it will write that information to the file every second. You may want to consider using a larger sample interval to help reduce the size of the data store file. The SampleInterval directive has just one value, numeric, in seconds, with valid values from 1 to 120.
LogFilePath directive
The location of the data store is configurable through the LogFilePath directive. The default location on IBM i is /QIBM/UserData/PowerVP/logs and on AIX/VIOS and Linux, the default location is /opt/ibm/powervp/logs. The PowerVP agent can create the log file in any location that the operating system can access, so you can put the file where you want to.
LogFileRotation
The LogFileRotation directive controls when the PowerVP agent will close the current data store file and start logging to a new data store file. The PowerVP agent will always rotate files at midnight. If you want more frequent rotation, you can provide a value from 1 to 24 hours. You can instead provide a file size in megabytes or gigabytes for the LogFileRotation directive which will cause the PowerVP agent to rotate the file when it reaches the configured size. The value is numeric for both options. It is followed by an H for hours, M for megabytes, or G for gigabytes, for example, 12H or 40M or 8G. If you use more frequent log file rotation, you can move the old log files off of your partition more frequently to conserve space. You may find it beneficial to rotate on size if you want to analyze the data using a spreadsheet application. Smaller files will load quicker and take less time to process.
LogFileArchive
The LogFileArchive directive controls when the PowerVP agent will archive or delete old data store files. This directive defaults to 7 days. At midnight, the PowerVP agent will check its LogFilePath directory for log files that are older than the configured number of days and delete files older than that. Again, you can specify this in days, using a smaller value to cause PowerVP to reclaim the disk used by old files. This will also give you time to move the files to another location if you want to retain them longer.
Data Store Data
The data store file is a standard comma separated values (CSV) format, allowing you to import the data into spreadsheet applications for further manipulation. You can sort the records, and then apply your own formulas to generate graphs and charts for your system utilization over time.
The first part of every file contains information records that describe what is contained in the data records. Column 1 is used as the row type. Column 2 is the “occurrence” count, and an occurrence count of 0 indicates the record is column labels defining the data records. Column 3 of every row except the AAA and ZZZZ rows is the time column which correlates the data record to the time it occurred. Let’s take a look at each of the records. We will include a brief description of the record followed by a table that contains the column headings and one or more data records so you can get an idea of what the data will look like.
AAA – General information records
These records are always located at the beginning of each file. These records consist of 3 columns. Column 2 contains the type of information and column 3 contains the value.
|
AAA |
progname |
PowerVP |
|
AAA |
Version |
V1R1M3 |
|
AAA |
Operating system |
IBM i |
|
AAA |
Operating system version |
V7R2M0 |
|
AAA |
Agent type |
system |
|
AAA |
Agent hostname |
MYSYSTEM.Com |
|
AAA |
Processor version |
POWER8 |
|
AAA |
Sample rate in seconds |
1 |
|
AAA |
Clock frequency in MHz |
3591 |
|
AAA |
Timebase frequency in MHz |
8 |
|
AAA |
System serial number |
12CD95T |
|
AAA |
System model |
42A |
ZZZZ – Time records.
There is a time record for every sample interval. Column 2 is the time “id” which is used in the other data records so they can be correlated to a time. All of the other data records have this time “id” in column 3.
|
ZZZZ |
T00000 |
Military time in HH:MM:SS |
Date in DD/MM/YYYY |
seconds since 00:00 hours Jan 1 1970 UTC |
timebase register |
|
ZZZZ |
T00001 |
0:00:00 |
7/21/2015 |
1437454800 |
338908111659353 |
|
ZZZZ |
T00002 |
0:00:01 |
7/21/2015 |
1437454801 |
338909117204676 |
|
.. |
|||||
|
ZZZZ |
T09623 |
2:45:04 |
7/21/2015 |
1437464704 |
343978974075930 |
The rest of the entries use column 2 as an occurrence count, for example, if the data record is “by core”, that data record will occur once for each core per time interval. The occurrence count is incremented for each core.
SYSTOP – System topology record.
There is normally only 1 system topology record in the file. If the system topology changes (the number of nodes or chips or cores or virtual processors changes), this record will be rewritten to show the change.
|
SYSTOP |
0 |
T00000 |
Number of nodes |
Number of chips |
Number of cores |
Number of virtual processors |
|
SYSTOP |
1 |
T00001 |
2 |
4 |
24 |
144 |
CHIPTOP – Chip topology records.
There is one chip topology record per chip. If the system topology changes, these records will be rewritten to show the change. Note that this record is 30 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
CHIPTOP |
0 |
T00000 |
Physical chip id |
Hardware chip id |
Fabric chip id |
Hardware node id |
A (ABC) bus width |
|
X (WXYZ) bus width |
GX bus width |
MC bus width |
PHB bus width |
A0 (A) link attached |
A0 (A) link endpoint node id |
A1 (B) link attached |
|
|
A1 (B) link endpoint node id |
A2 (C) link attached |
A2 (C) link endpoint node id |
X0 (W) link attached |
X0 (W) link endpoint node id |
X1 (X) link attached |
X1 (X) link endpoint node id |
|
|
X2 (Y) link attached |
X2 (Y) link endpoint node id |
X3 (Z) link attached |
X3 (Z) link endpoint node id |
PHB0 (GX0) bus attached |
PHB1 (GX1) bus attached |
PHB2 bus attached |
|
|
PHB3 bus attached |
MC0 bus attached |
MC1 bus attached |
MC2 bus attached |
MC3 bus attached |
|||
|
CHIPTOP |
1 |
T00001 |
0 |
0 |
0 |
0 |
8 |
|
8 |
0 |
0 |
0 |
No |
255 |
Yes |
|
|
2 |
Yes |
2 |
No |
255 |
Yes |
1 |
|
|
No |
255 |
No |
255 |
Yes |
Yes |
No |
|
|
No |
No |
No |
No |
Yes |
CORETOP – Core topology records
There is one core topology record per physical core. If the system topology changes, these records will be rewritten to show the change. Note that this record is 15 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
CORETOP |
0 |
T00000 |
Core id |
Chip id |
Module id |
Node id |
Core State |
|
Logical Processor Id |
Processor identification register |
Assigned partition |
Nominal frequency in MHZ |
Current frequency in MHZ |
Primary affinity domain |
Secondary affinity domain |
|
|
CORETOP |
1 |
T00001 |
0 |
0 |
0 |
0 |
Dedicated |
|
0 |
32 |
7 |
0 |
0 |
0 |
0 |
REGLPARS – Registered partition records
There is one per partition level agent registered to the system level agent and column 2 is used to count the number of partitions. These records are rewritten whenever a partition registers or de-registers from the system level agent. Note that this record is 12 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
REGLPARS |
0 |
T00000 |
partition id |
version |
operating system |
agent type |
|
Processor version |
authentication type |
host name |
ip address |
recording |
||
|
REGLPARS |
1 |
T00001 |
6 |
3 |
IBM i |
system |
|
POWER8 |
system |
MYHOST2.COM |
10.5.1.2 |
Yes |
||
|
REGLPARS |
2 |
T00001 |
8 |
3 |
AIX |
partition |
|
POWER8 |
system |
MYHOST3.COM |
10.5.1.3 |
No |
AFFDTOP – Affinity information by domain
Affinity information domain records define the affinity domains on the system. There are one of these for each affinity domain on the system. If the system topology changes, these records will be rewritten to show the change. Note that this record is 11 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
AFFDTOP |
0 |
T00000 |
primary domain |
secondary domain |
total processor units |
|
free dedicated processor units |
free shared processor union |
total memory |
free memory |
number of partitions in domain |
|
|
AFFDTOP |
1 |
T00001 |
0 |
0 |
600 |
|
0 |
0 |
128 |
0 |
11 |
AFFPTOP – Affinity information by partition
Affinity information by partition defines the affinity for each partition. Each partition has AFFPELE records associated with it. If the system topology changes, these records will be rewritten to show the change.
|
AFFPTOP |
0 |
T00000 |
partition id |
assignment order |
placement spread |
affinity score (0-100) |
number of affinity elements |
|
AFFPTOP |
1 |
T00001 |
1 |
1024 |
2 |
100 |
2 |
AFFPELE – Affinity elements for the partition
The affinity elements go with the AFFPTOP affinity information by partition records. There can be several of these per partition. If the system topology changes, these records will be rewritten to show the change. Note that this record is 11 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
AFFPELE |
0 |
T00000 |
partition id |
primary affinity domain |
secondary domain index |
|
dedicated processor units allocated |
dedicated memory allocated default |
dedicated memory allocated reserved 1 |
dedicated memory allocated reserved 2 |
dedicated memory allocated 16 GB pages |
|
|
AFFPELE |
1 |
T00001 |
1 |
2 |
2 |
|
600 |
124 |
0 |
0 |
0 |
AFFVPROC – Affinity domain information by virtual processor
The Affinity domain information by virtual processor records define the virtual processor affinity of the system. If the system topology changes, these records will be rewritten to show the change.
|
AFFVPROC |
0 |
T00000 |
partition id |
virtual processor index |
physical processor index |
primary affinity domain index |
secondary affinity domain index |
|
AFFVPROC |
1 |
T00001 |
1 |
0 |
12 |
2 |
2 |
The next entries are data records that are written on the system level agent each sample interval.
SCPUBC – CPU utilization by core
The CPU utilization by core records contain the information on CPU utilization for each physical core. There is a row for each core. PowerVP computes the core utilization on POWER 7 by dividing the number in column 5 by the number in column 9 and on POWER 8 by dividing the number in column 5 by the number in column 6. Then to determine system CPU utilization, it adds the core utilizations for all the cores together.
|
SCPUBC |
0 |
T00000 |
core id |
user plus kernel PURR delta |
unfiltered PURR delta |
run instructions delta |
total run cycles delta |
timebase delta for this sample |
current core frequency in MHz |
|
SCPUBC |
1 |
T00001 |
0 |
5452 |
253935 |
5529186 |
22 |
227970280 |
1729 |
SCYCBC – Time base cycle utilization by core
The time base cycle utilization records contain the CPU cycles delta and timebase delta for each core for the sample period. There is a row for each core.
|
SCYCBC |
0 |
T00000 |
core id |
timebase cycles delta |
timebase delta |
|
SCYCBC |
1 |
T00001 |
0 |
527393129 |
527389555 |
SBUSBCH – bus utilization by chip
The bus utilization by chip records contain the bus utilization percent for each power bus on the system for each chip. There is a row for each chip. Note that this record is 31 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
SBUSBCH |
0 |
T00000 |
chip id |
A0 (A) bus utilization percent |
A1 (B) bus utilization percent |
A2 (C) bus utilization percent |
|
X0 (W) bus utilization percent |
X1 (X) bus utilization percent |
X2 (Y) bus utilization percent |
X3 (Z) bus utilization percent |
inbound PHB0 or GX0 bus utilization percent |
inbound PHB0 or GX0 bus rate |
|
|
inbound PHB1 or GX1 bus utilization percent |
inbound PHB1 or GX1 bus rate |
inbound PHB2 bus utilization percent |
inbound PHB2 bus rate |
inbound PHB3 bus utilization percent |
inbound PHB3 bus rate |
|
|
outbound PHB0 or GX0 bus utilization percent |
outbound PHB0 or GX0 bus rate |
outbound PHB1 or GX1 bus utilization percent |
outbound PHB1 or GX1 bus rate |
outbound PHB2 bus utilization percent |
outbound PHB2 bus rate |
|
|
outbound PHB3 bus utilization percent |
outbound PHB3 bus rate |
MC0 bus utilization percent |
MC1 bus utilization percent |
MC2 bus utilization percent |
MC3 bus utilization percent |
|
|
SBUSBCH |
1 |
T00001 |
0 |
5 |
3 |
0 |
|
2 |
5 |
0 |
10 |
14 |
5 |
|
|
0 |
0 |
0 |
0 |
0 |
0 |
|
|
0 |
0 |
10 |
12 |
0 |
0 |
|
|
0 |
0 |
0 |
21 |
17 |
0 |
SCPUBP – CPU utilization by partition
The CPU utilization by partition records contain the CPU utilization for each partition. There is a row for each partition. PowerVP uses this information to calculate the LPAR CPU utilization by adding the capped cycles and uncapped cycles, subtracting the idle cycles, and dividing the result by the entitled capacity.
|
SCPUBP |
0 |
T00000 |
partition id |
entitled capacity |
capped cycles |
uncapped cycles |
donated cycles |
idle cycles |
|
SCPUBP |
5 |
T00001 |
6 |
99.99 |
3.32 |
0 |
0 |
1.43 |
SMETRICBP – partition metrics records
The partition metrics record contain some metrics for each partition. There is a row for each partition on the system. Note that this record is 16 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
SMETRICBP |
0 |
T00000 |
partition id |
performance data version |
delta timebase cycles waiting for entitlement |
delta number of times waited on entitlement |
|
delta timebase cycles waiting for a physical processor |
delta number of times lpar dispatched to run |
delta home processor dispatches |
delta primary affinity domain dispatches |
delta secondary affinity domain dispatches |
delta remote dispatches |
|
|
delta dedicated donate processor dispatches |
instruction count delta |
timebase cycles delta |
||||
|
SMETRICBP |
5 |
T00001 |
6 |
8 |
0 |
0 |
|
17454 |
0 |
66 |
125 |
0 |
0 |
|
|
0 |
82237456 |
20216239 |
The next entries are data records that are written for partition level data each sample interval.
PSTAT – partition configuration information for the partition
The partition configuration information record contains specific configuration information for the partition. This is partition specific information, so there is only one record per sample interval.
|
PSTAT |
0 |
T00000 |
partition id |
dedicated or shared |
capped or uncapped |
donate enabled |
entitled capacity |
active processors in shared pool |
partition name |
|
PSTAT |
1 |
T00001 |
6 |
shared |
uncapped |
No |
1 |
7 |
partition1 |
PCPI – partition cycles per instruction information
The partition CPI (cycles per instruction) information is retrieved from the PMU. PowerVP collects event counts for the following groups on POWER8: pm_utilization, pm_cpi_stack2, pm_cpi_stack4, pm_cpi_stack15, pm_cpi_stack18, pm_dsource1, pm_dsource4, pm_dsource5, pm_dsource6, pm_dsource7, and pm_dsource8; and for the following groups on POWER7: pm_dlatencies3, pm_cpi_stack1, pm_cpi_stack2, pm_cpi_stack7, pm_dsource1, pm_dsource2, pm_dsource3, pm_dsource4, pm_dsource5, pm_dsource6, pm_psource10, pm_dsource12, and pm_prefetch2. Note that this record is 10 columns and since that doesn’t fit well on this page, we are showing it as continued.
|
PCPI |
0 |
T00000 |
group name |
event 1 count |
event 2 count |
event 3 count |
|
event 4 count |
event 5 count |
event 6 count |
||||
|
PCPI |
1 |
T00057 |
pm_utilization |
5167095267 |
265248469 |
208611066 |
|
154403910 |
154404030 |
265248862 |
||||
|
PCPI |
2 |
T00057 |
pm_cpi_stack2 |
2895997 |
85212474 |
6151185 |
|
11442943 |
73865727 |
162055910 |
||||
|
PCPI |
3 |
T00057 |
pm_cpi_stack4 |
20068825 |
6529613 |
0 |
|
789891 |
122718306 |
245905097 |
||||
|
PCPI |
4 |
T00057 |
pm_cpi_stack15 |
11868 |
165849 |
59573 |
|
0 |
85662264 |
180491237 |
||||
|
PCPI |
5 |
T00057 |
pm_cpi_stack18 |
230683 |
4973615 |
777788 |
|
2571942 |
69545632 |
145341703 |
||||
|
PCPI |
6 |
T00057 |
pm_dsource1 |
517490 |
424380 |
36668 |
|
10103 |
87454950 |
178839097 |
||||
|
PCPI |
7 |
T00057 |
pm_dsource4 |
252 |
2343 |
0 |
|
0 |
73672003 |
159681410 |
||||
|
PCPI |
8 |
T00057 |
pm_dsource5 |
483212 |
24381 |
6 |
|
33829 |
135258799 |
250535399 |
||||
|
PCPI |
9 |
T00057 |
pm_dsource6 |
14098 |
30 |
7 |
|
2453 |
91641172 |
193962053 |
||||
|
PCPI |
10 |
T00057 |
pm_dsource7 |
114283 |
120053 |
0 |
|
3064 |
118343117 |
266909340 |
||||
|
PCPI |
11 |
T00057 |
pm_dsource8 |
102121038 |
33487 |
151260 |
|
223914 |
102121038 |
196340232 |
PENET – Ethernet throughput
The Ethernet throughput records show the throughput for each Ethernet interface on the partition.
|
PENET |
0 |
T00000 |
name |
kilobytes sent |
kilobytes received |
|
PENET |
1 |
T00001 |
ETHLINE |
974180 |
732290 |
PDISK – disk I/O information
The disk I/O information records show I/O information for each disk on the partition.
|
PDISK |
0 |
T00000 |
name |
kilobytes read |
kilobytes written |
|
PDISK |
1 |
T00002 |
DD001 |
104200 |
207360 |
|
PDISK |
2 |
T00002 |
DD002 |
0 |
0 |
|
PDISK |
3 |
T00002 |
DD003 |
253840 |
368640 |
|
PDISK |
4 |
T00002 |
DD004 |
0 |
0 |
PCPU – CPU cycles
The CPU cycles records show the CPU used by the partition. This information is the same information as shown in the SCPUBP record, but only for the specific partition.
|
PCPU |
0 |
T00000 |
entitled cycles |
capped cycles |
uncapped cycles |
donated cycles |
idle cycles |
|
PCPU |
1 |
T00002 |
299.99 |
299.99 |
0 |
0 |
293.56 |
Was this topic helpful?
Document Information
Modified date:
19 December 2019
UID
ibm11128633