IBM Support

PowerVP version 1.1.3 Agent Data Store

News


Abstract

The previous versions of PowerVP allowed you to record the PowerVP performance statistics in a file on the client where you were running the GUI. While this was very useful, it required you to have the GUI running in order to record the information so it would be available at a later time to investigate problems. With version 1.1.3, you can now configure the PowerVP agents to “record” the performance statistics to a file on the agent partition. This allows you to have the PowerVP information recorded 24x7 and you only need to use the GUI to view the information when you are investigating a problem.

Content

Configuring the Data Store

This function is not enabled by default. When sampling every second for 24 hours, the size of these files can get very large, for example, on a Power 8 system with 24 cores, the file size for 24 hours was 455,626,157 bytes. Also, if the partition has a maximum file size, you will need to configure PowerVP so that it doesn’t exceed that maximum. Therefore, you need to plan for the PowerVP data store files. This configuration is done with 5 directives in the configuration file. The configuration file is located in /QIBM/UserData/PowerVP/powervp.conf on IBM i and in /etc/opt/ibm/powervp/powervp.conf on AIX/VIOS and Linux. If you change the directives in the configuration file, you will need to stop the PowerVP agent and start it again to pick up the changes. If the PowerVP agent encounters an error due to running out of space or hitting the maximum file size, it will simply stop logging data, but continue to run. This will enable the PowerVP GUI to still display live performance data after encountering problems writing the data store.

LogData directive

The LogData directive directs the PowerVP agent if it is to do logging and what information is logged. The default for this directive is No. A value of Yes will enable the PowerVP agent to log both system level and partition level information to the file. A value of System will enable the PowerVP agent to log only the system level performance data. A value of Partition will enable the PowerVP agent to log only the partition level performance data.

On the partition level agents, you simply need to decide if you want to have the partition level information logged in a file. This information is the basic partition CPU utilization, Disk I/O, Ethernet I/O, and the cycles per instruction (CPI) information you see on the partition drilldown. If you decide you want this information to be logged, use the Partition value for the LogData directive on your partition level agent.

On your system level agent, you will need to decide if you want to log the partition level information in addition to the system level information. If you are running the system level agent on your VIOS, the partition drill down information is more than likely not necessary to retain. Therefore, you will probably use the System value for the LogData directive on your system level agent.  If your system level agent isn’t a VIOS, you have the same decision you have on the partition level agents, as to whether you want the partition information logged.

SampleInterval directive

The PowerVP agent gets the performance metrics based on the value of the SampleInterval directive. It defaults to 1 second, which means the agent will retrieve new metrics every second and if you are using the data store, it will write that information to the file every second. You may want to consider using a larger sample interval to help reduce the size of the data store file. The SampleInterval directive has just one value, numeric, in seconds, with valid values from 1 to 120.

LogFilePath directive

The location of the data store is configurable through the LogFilePath directive. The default location on IBM i is /QIBM/UserData/PowerVP/logs and on AIX/VIOS and Linux, the default location is /opt/ibm/powervp/logs. The PowerVP agent can create the log file in any location that the operating system can access, so you can put the file where you want to.

LogFileRotation

The LogFileRotation directive controls when the PowerVP agent will close the current data store file and start logging to a new data store file. The PowerVP agent will always rotate files at midnight. If you want more frequent rotation, you can provide a value from 1 to 24 hours. You can instead provide a file size in megabytes or gigabytes for the LogFileRotation directive which will cause the PowerVP agent to rotate the file when it reaches the configured size. The value is numeric for both options. It is followed by an H for hours, M for megabytes, or G for gigabytes, for example, 12H or 40M or 8G. If you use more frequent log file rotation, you can move the old log files off of your partition more frequently to conserve space. You may find it beneficial to rotate on size if you want to analyze the data using a spreadsheet application. Smaller files will load quicker and take less time to process.

LogFileArchive

The LogFileArchive directive controls when the PowerVP agent will archive or delete old data store files. This directive defaults to 7 days. At midnight, the PowerVP agent will check its LogFilePath directory for log files that are older than the configured number of days and delete files older than that. Again, you can specify this in days, using a smaller value to cause PowerVP to reclaim the disk used by old files. This will also give you time to move the files to another location if you want to retain them longer.

Data Store Data

The data store file is a standard comma separated values (CSV) format, allowing you to import the data into spreadsheet applications for further manipulation. You can sort the records, and then apply your own formulas to generate graphs and charts for your system utilization over time.

The first part of every file contains information records that describe what is contained in the data records.  Column 1 is used as the row type. Column 2 is the “occurrence” count, and an occurrence count of 0 indicates the record is column labels defining the data records. Column 3 of every row except the AAA and ZZZZ rows is the time column which correlates the data record to the time it occurred. Let’s take a look at each of the records. We will include a brief description of the record followed by a table that contains the column headings and one or more data records so you can get an idea of what the data will look like.

AAA – General information records

These records are always located at the beginning of each file. These records consist of 3 columns. Column 2 contains the type of information and column 3 contains the value. 

AAA

progname

PowerVP

AAA

Version

V1R1M3

AAA

Operating system

IBM i

AAA

Operating system version

V7R2M0

AAA

Agent type

system

AAA

Agent hostname

MYSYSTEM.Com

AAA

Processor version

POWER8

AAA

Sample rate in seconds

1

AAA

Clock frequency in MHz

3591

AAA

Timebase frequency in MHz

8

AAA

System serial number

 12CD95T

AAA

System model

 42A

ZZZZ – Time records.

There is a time record for every sample interval. Column 2 is the time “id” which is used in the other data records so they can be correlated to a time. All of the other data records have this time “id” in column 3.

ZZZZ

T00000

Military time in HH:MM:SS

Date in DD/MM/YYYY

seconds since 00:00 hours Jan 1 1970 UTC

timebase register

ZZZZ

T00001

0:00:00

7/21/2015

1437454800

338908111659353

ZZZZ

T00002

0:00:01

7/21/2015

1437454801

338909117204676

..

ZZZZ

T09623

2:45:04

7/21/2015

1437464704

343978974075930

The rest of the entries use column 2 as an occurrence count, for example, if the data record is “by core”, that data record will occur once for each core per time interval. The occurrence count is incremented for each core.

SYSTOP – System topology record.

There is normally only 1 system topology record in the file. If the system topology changes (the number of nodes or chips or cores or virtual processors changes), this record will be rewritten to show the change.

SYSTOP

0

T00000

Number of nodes

Number of chips

Number of cores

Number of virtual processors

SYSTOP

1

T00001

2

4

24

144

CHIPTOP – Chip topology records.

There is one chip topology record per chip. If the system topology changes, these records will be rewritten to show the change. Note that this record is 30 columns and since that doesn’t fit well on this page, we are showing it as continued.

CHIPTOP

0

T00000

Physical chip id

Hardware chip id

Fabric chip id

Hardware node id

A (ABC) bus width

X (WXYZ) bus width

GX bus width

MC bus width

PHB bus width

A0 (A) link attached

A0 (A) link endpoint node id

A1 (B) link attached

A1 (B) link endpoint node id

A2 (C) link attached

A2 (C) link endpoint node id

X0 (W) link attached

X0 (W) link endpoint node id

X1 (X) link attached

X1 (X) link endpoint node id

X2 (Y) link attached

X2 (Y) link endpoint node id

X3 (Z) link attached

X3 (Z) link endpoint node id

PHB0 (GX0) bus attached

PHB1 (GX1) bus attached

PHB2 bus attached

PHB3 bus attached

MC0 bus attached

MC1 bus attached

MC2 bus attached

MC3 bus attached

CHIPTOP

1

T00001

0

0

0

0

8

8

0

0

0

No

255

Yes

2

Yes

2

No

255

Yes

1

No

255

No

255

Yes

Yes

No

No

No

No

No

Yes

CORETOP – Core topology records

There is one core topology record per physical core. If the system topology changes, these records will be rewritten to show the change. Note that this record is 15 columns and since that doesn’t fit well on this page, we are showing it as continued.

CORETOP

0

T00000

Core id

Chip id

Module id

Node id

Core State

Logical Processor Id

Processor identification register

Assigned partition

Nominal frequency in MHZ

Current frequency in MHZ

Primary affinity domain

Secondary affinity domain

CORETOP

1

T00001

0

0

0

0

Dedicated

0

32

7

0

0

0

0

REGLPARS – Registered partition records 

There is one per partition level agent registered to the system level agent and column 2 is used to count the number of partitions. These records are rewritten whenever a partition registers or de-registers from the system level agent. Note that this record is 12 columns and since that doesn’t fit well on this page, we are showing it as continued.

REGLPARS

0

T00000

partition id

version

operating system

agent type

Processor version

authentication type

host name

ip address

recording

REGLPARS

1

T00001

6

3

IBM i

system

POWER8

system

MYHOST2.COM

10.5.1.2

Yes

REGLPARS

2

T00001

8

3

AIX

partition

POWER8

system

MYHOST3.COM

10.5.1.3

No

AFFDTOP – Affinity information by domain

Affinity information domain records define the affinity domains on the system. There are one of these for each affinity domain on the system. If the system topology changes, these records will be rewritten to show the change. Note that this record is 11 columns and since that doesn’t fit well on this page, we are showing it as continued.

AFFDTOP

0

T00000

primary domain

secondary domain

total processor units

free dedicated processor units

free shared processor union

total memory

free memory

number of partitions in domain

AFFDTOP

1

T00001

0

0

600

0

0

128

0

11

AFFPTOP – Affinity information by partition

Affinity information by partition defines the affinity for each partition. Each partition has AFFPELE records associated with it. If the system topology changes, these records will be rewritten to show the change.

AFFPTOP

0

T00000

partition id

assignment order

placement spread

affinity score (0-100)

number of affinity elements

AFFPTOP

1

T00001

1

1024

2

100

2

AFFPELE – Affinity elements for the partition

The affinity elements go with the AFFPTOP affinity information by partition records. There can be several of these per partition. If the system topology changes, these records will be rewritten to show the change. Note that this record is 11 columns and since that doesn’t fit well on this page, we are showing it as continued.

AFFPELE

0

T00000

partition id

primary affinity domain

secondary domain index

dedicated processor units allocated

dedicated memory allocated default

dedicated memory allocated reserved 1

dedicated memory allocated reserved 2

dedicated memory allocated 16 GB pages

AFFPELE

1

T00001

1

2

2

600

124

0

0

0

AFFVPROC – Affinity domain information by virtual processor

The Affinity domain information by virtual processor records define the virtual processor affinity of the system. If the system topology changes, these records will be rewritten to show the change.

AFFVPROC

0

T00000

partition id

virtual processor index

physical processor index

primary affinity domain index

secondary affinity domain index

AFFVPROC

1

T00001

1

0

12

2

2

The next entries are data records that are written on the system level agent each sample interval.

SCPUBC – CPU utilization by core

The CPU utilization by core records contain the information on CPU utilization for each physical core. There is a row for each core. PowerVP computes the core utilization on POWER 7 by dividing the number in column 5 by the number in column 9 and on POWER 8 by dividing the number in column 5 by the number in column 6. Then to determine system CPU utilization, it adds the core utilizations for all the cores together.

SCPUBC

0

T00000

core id

user plus kernel PURR delta

unfiltered PURR delta

run instructions delta

total run cycles delta

timebase delta for this sample

current core frequency in MHz

SCPUBC

1

T00001

0

5452

253935

5529186

22

227970280

1729

SCYCBC – Time base cycle utilization by core

The time base cycle utilization records contain the CPU cycles delta and timebase delta for each core for the sample period. There is a row for each core. 

SCYCBC

0

T00000

core id

timebase cycles delta

timebase delta

SCYCBC

1

T00001

0

527393129

527389555

SBUSBCH – bus utilization by chip

The bus utilization by chip records contain the bus utilization percent for each power bus on the system for each chip. There is a row for each chip.  Note that this record is 31 columns and since that doesn’t fit well on this page, we are showing it as continued.

SBUSBCH

0

T00000

chip id

A0 (A) bus utilization percent

A1 (B) bus utilization percent

A2 (C) bus utilization percent

X0 (W) bus utilization percent

X1 (X) bus utilization percent

X2 (Y) bus utilization percent

X3 (Z) bus utilization percent

inbound PHB0 or GX0 bus utilization percent

inbound PHB0 or GX0 bus rate

inbound PHB1 or GX1 bus utilization percent

inbound PHB1 or GX1 bus rate

inbound PHB2 bus utilization percent

inbound PHB2 bus rate

inbound PHB3 bus utilization percent

inbound PHB3 bus rate

outbound PHB0 or GX0 bus utilization percent

outbound PHB0 or GX0 bus rate

outbound PHB1 or GX1 bus utilization percent

outbound PHB1 or GX1 bus rate

outbound PHB2 bus utilization percent

outbound PHB2 bus rate

outbound PHB3 bus utilization percent

outbound PHB3 bus rate

MC0 bus utilization percent

MC1 bus utilization percent

MC2 bus utilization percent

MC3 bus utilization percent

SBUSBCH

1

T00001

0

5

3

0

2

5

0

10

14

5

0

0

0

0

0

0

0

0

10

12

0

0

0

0

0

21

17

0

SCPUBP – CPU utilization by partition

The CPU utilization by partition records contain the CPU utilization for each partition. There is a row for each partition. PowerVP uses this information to calculate the LPAR CPU utilization by adding the capped cycles and uncapped cycles, subtracting the idle cycles, and dividing  the result by the entitled capacity.

SCPUBP

0

T00000

partition id

entitled capacity

capped cycles

uncapped cycles

donated cycles

idle cycles

SCPUBP

5

T00001

6

99.99

3.32

0

0

1.43

SMETRICBP – partition metrics records

The partition metrics record contain some metrics for each partition. There is a row for each partition on the system. Note that this record is 16 columns and since that doesn’t fit well on this page, we are showing it as continued.

SMETRICBP

0

T00000

partition id

performance data version

delta timebase cycles waiting for entitlement

delta number of times waited on entitlement

delta timebase cycles waiting for a physical processor

delta number of times lpar dispatched to run

delta home processor dispatches

delta primary affinity domain dispatches

delta secondary affinity domain dispatches

delta remote dispatches

delta dedicated donate processor dispatches

instruction count delta

timebase cycles delta

SMETRICBP

5

T00001

6

8

0

0

17454

0

66

125

0

0

0

82237456

20216239

The next entries are data records that are written for partition level data each sample interval.

PSTAT – partition configuration information for the partition

The partition configuration information record contains specific configuration information for the partition. This is partition specific information, so there is only one record per sample interval.

PSTAT

0

T00000

partition id

dedicated or shared

capped or uncapped

donate enabled

entitled capacity

active processors in shared pool

partition name

PSTAT

1

T00001

6

shared

uncapped

No

1

7

partition1

PCPI – partition cycles per instruction information

The partition CPI (cycles per instruction) information is retrieved from the PMU. PowerVP collects event counts for the following groups on POWER8: pm_utilization, pm_cpi_stack2, pm_cpi_stack4, pm_cpi_stack15, pm_cpi_stack18, pm_dsource1, pm_dsource4, pm_dsource5, pm_dsource6, pm_dsource7, and pm_dsource8; and for the following groups on POWER7: pm_dlatencies3, pm_cpi_stack1, pm_cpi_stack2, pm_cpi_stack7, pm_dsource1, pm_dsource2, pm_dsource3, pm_dsource4, pm_dsource5, pm_dsource6, pm_psource10, pm_dsource12, and pm_prefetch2. Note that this record is 10 columns and since that doesn’t fit well on this page, we are showing it as continued.

PCPI

0

T00000

group name

event 1 count

event 2 count

event 3 count

event 4 count

event 5 count

event 6 count

PCPI

1

T00057

pm_utilization

5167095267

265248469

208611066

154403910

154404030

265248862

PCPI

2

T00057

pm_cpi_stack2

2895997

85212474

6151185

11442943

73865727

162055910

PCPI

3

T00057

pm_cpi_stack4

20068825

6529613

0

789891

122718306

245905097

PCPI

4

T00057

pm_cpi_stack15

11868

165849

59573

0

85662264

180491237

PCPI

5

T00057

pm_cpi_stack18

230683

4973615

777788

2571942

69545632

145341703

PCPI

6

T00057

pm_dsource1

517490

424380

36668

10103

87454950

178839097

PCPI

7

T00057

pm_dsource4

252

2343

0

0

73672003

159681410

PCPI

8

T00057

pm_dsource5

483212

24381

6

33829

135258799

250535399

PCPI

9

T00057

pm_dsource6

14098

30

7

2453

91641172

193962053

PCPI

10

T00057

pm_dsource7

114283

120053

0

3064

118343117

266909340

PCPI

11

T00057

pm_dsource8

102121038

33487

151260

223914

102121038

196340232

PENET – Ethernet throughput

The Ethernet throughput records show the throughput for each Ethernet interface on the partition.

PENET

0

T00000

name

kilobytes sent

kilobytes received

PENET

1

T00001

ETHLINE

974180

732290

PDISK – disk I/O information

The disk I/O information records show I/O information for each disk on the partition.

PDISK

0

T00000

name

kilobytes read

kilobytes written

PDISK

1

T00002

DD001

104200

207360

PDISK

2

T00002

DD002

0

0

PDISK

3

T00002

DD003

253840

368640

PDISK

4

T00002

DD004

0

0

PCPU – CPU cycles

The CPU cycles records show the CPU used by the partition. This information is the same information as shown in the SCPUBP record, but only for the specific partition.

PCPU

0

T00000

entitled cycles

capped cycles

uncapped cycles

donated cycles

idle cycles

PCPU

1

T00002

299.99

299.99

0

0

293.56

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSXLMV","label":"PowerVP Standard Edition"},"Component":"","Platform":[{"code":"PF002","label":"AIX"},{"code":"PF012","label":"IBM i"},{"code":"PF016","label":"Linux"}],"Version":"1.1.3","Edition":"","Line of Business":{"code":"LOB57","label":"Power"}}]

Document Information

Modified date:
19 December 2019

UID

ibm11128633