Configuration considerations for EventMonitor
service
The IBM® Sterling Control Center
Monitor
EventMonitor
service only runs in the Controller Event Processor (CEP) and is
responsible for passing all events created by all Event Processors (EPs) to the SLC (and
FileAgent
) service.
TIP
When running Control Center Monitor with a single EP, set engine property
BYPASS_EVENT_MONITOR_FOR_EVENTS
to TRUE
. When
BYPASS_EVENT_MONITOR_FOR_EVENTS
is TRUE
, events will go directly
to the SLC, and FileAgent
, service instead of indirectly via the
EventMonitor
service, and contention on the IBM Sterling Control Center
Monitor
EVENTS
database table will be decreased.
Gathering Diagnostic statistics
To know how the EventMonitor
service is performing
look in the Engine log for the metrics it outputs once every
hour.
Example prior to Control Center Monitor v6.1.2.1
|
Example from ICC 6.1.2.1 iFix 01
|
processEvents
was invoked relative to how many times it was invoked without
waiting. When processEvents
is invoked without waiting, it is an indication that
the EventMonitor
service is unable to keep up with the current load of events. And
this may, or may not, be related to poor database performance. processEvents
is invoked matches the times processEvents
is
invoked without waiting it means the EventMonitors
service was not keeping up with
the number of events generated by the system at any point during the hour prior to the metric being
logged. TIP
EventMonitor
is currently behind in getting events to the SLC service, and by how far:SELECT LAST_EVENT_SERIAL_NUM_PROC FROM CC_CONTROLLER
The query result returns the serial number value of the event the
EventMonitor
last processed. It is a unique value assigned by the database as data is inserted into theEVENTS
tableSELECT MAX (SERIAL_NUM) FROM EVENTS
The query result returns the serial number of the last event in
EVENTS
that needs to be processed.
The difference between those two values is approximately how far behind the
EventMonitor
logic is. The values should be close and within a range of thousand or
two.
ACTION_COMPLETED
and
when an event was reconstituted was added. For
example,
|
Since the ACTION_COMPLETED
value represents the time in milliseconds an EP
completed processing of an event, this metric gives a user an approximation of the delay between
when an event was processed by one EP and its delivery to other services, including the
SLCService
, by the CEP. When this delay is long, and SLC schedule tolerances are
small, erroneous SLC events are likely to occur. This maximum value and average should be as small
as possible.
Average time to retrieve event data
Average time to construct notifications
Average time to broadcast notifications
Milliseconds running SQL to retrieve events
Milliseconds broadcasting SCCNotifications
constructing SCCNotifications
When the values for Average time to retrieve event data
or Milliseconds
running SQL to retrieve events
are large – queries should take just a few milliseconds,
poor database performance is causing a negative impact on the performance of the
EventMonitor
service.
When the values for Average time to construct notifications
or
constructing SCCNotifications
are large, CPU limitations might have a negative
impact on the performance of the EventMonitor
service. Note that constructing
notifications also requires database I/O. Increasing the value for the engine property
EVENT_MONITOR_THREADS
, described below, may improve this performance.
When the values for Average time to broadcast notifications
or
Milliseconds broadcasting SCCNotifications
are large – more than a few
milliseconds, it could be an indication that Alert 0
actions are in use. Erroneous
use of Alert 0
actions can be a major source of performance problems in IBM Sterling Control Center
Monitor. See, Rule, Actions and performance issues for more details.