Threshold Manager
The data types that display when you click the Data Source Type list box are for the types of monitoring agents and data collectors that are installed in your managed environment. Select the data type for which you want to create or view thresholds.
- New opens the Threshold Editor for defining a threshold for the selected data type.
- Select a threshold and click Edit to open the Threshold Editor for editing the definition.
- Select a threshold that you no longer want and click Delete. After you confirm that you want to delete the threshold, it is removed from the list and from any resource groups that it was assigned to. Any open events for the threshold are closed.
- For a long list, you can click inside the filter text box and type the beginning of the value to filter by. As you type, the rows that do not fit the criteria are filtered out. To clear the filter, click the in the filter box or press the Backspace key.
For more information about the predefined thresholds and custom thresholds that are displayed in the table and the significance of the resource group assignment (or lack thereof), see Background information. For a quick hands-on lesson, see Tutorial: Defining a threshold.
Threshold Editor
After you click New or select a threshold and click Edit, the Threshold Editor is displayed with the following fields:- Name
- Enter a unique name for the threshold. The name must begin with a letter and can be up to 31
letters, numbers, and underscores, such as
Average_Processor_Speed_Warning
. The threshold name is displayed in the Application Performance Dashboard Events tab and in certain dashboard tables. - Description
- Optional. A description is useful for recording the purpose of the threshold that users can see in the Threshold Manager.
- Severity
- Select the appropriate event severity from the list: Fatal, Critical, Minor, Warning, or Unknown.
- Forward EIF Event?
- If you configured event forwarding in the Event Manager), open events are forwarded by default to the event destinations that you configured, for example, EIF event targets, or to Cloud Event Management, or Alert Notification. Change the setting to No if you do not want to forward events for this threshold to any event destinations. page (
- Interval
- Enter or select the time to wait between taking data samples in HHMMSS
format, such as 00 15 00 for 15 minutes. For sampled-event thresholds, the minimum interval is
000030 (30 seconds) and the maximum is 235959 (23 hours, 59 minutes, and 59 seconds).
A value of 000000 (six zeros) indicates a pure event threshold. Pure events are unsolicited notifications. Thresholds for pure events have no sampling interval, thus they have no constant metric that can be monitored for current values. Pure events are closed after 24 hours or as set in the Advanced Configuration page Pure Event Close Time field in category Event Manager.
- Required consecutive samples
- Specify how many consecutive threshold samples must evaluate to true before an event is generated: For any threshold with a setting of 1 and a sample that evaluates to true, an event is generated immediately; a setting of 2 means that two consecutive threshold samples must evaluate to true before an event is opened.
- Data set
- Select the data set (attribute group) for the type of data to be sampled. The attributes that
are available for inclusion in the condition are from the chosen data set. If the threshold has
multiple conditions, they must all be from the same data set.
To get a short description of a data set, hover the mouse over the name. You can get the complete description of the data set and attributes by clicking the
Learn more
link in the hover help. You can also click or in the navigation bar, and open the help or download the reference for the monitoring agent.Some agents are categorized as multi-node agents, which have subnodes for monitoring multiple agent resources. A multi-node agent might have data sets that can be used in a threshold but any events opened for the threshold do not display in the Application Performance Dashboard. A message notifies you of the limitation. Such events can be forwarded to the IBM® Netcool/OMNIbus event manager.
- Display item
- Optional. For multiple row data sets only. After a row evaluation causes an event to open, no more events can be opened for this threshold on the monitored system until the event is closed. By selecting a display item, you enable the threshold to continue evaluating the other rows in the data sampling and open more events if other rows qualify. As well, the display item is shown in the Events tab of the Application Performance Dashboard so that you can easily distinguish among the rows for which events were opened. The list contains only the attributes that you can designate as display items.
- Logical Operator
- Ignore this field if your threshold has only one condition. If you are measuring multiple
conditions, select one of the following operators before you click
New to add a second or third (or more) condition:
- And (&) if the previous condition and the next condition must be met for the threshold to be breached
- Or (|) if either of them can be met for the threshold to be breached
A mix of logical operators is not supported; use either all And operators or all Or operators. The threshold can have up to nine conditions when the Or operator is used; up to 10 conditions when the And operator is used.
If you are using the Missing function (described later in the Operator section), you can use only the And operator in the formula.
- Conditions
- The threshold definition can logically include multiple simultaneous thresholds or conditions.
- Group assignment
- Assign a resource group to distribute the threshold to the managed systems of the same type
within the resource group. The resource groups that are available are the user
defined groups that you have Modify permission for and the system groups (for the agent type) that
you have View permission for. The available system groups are also limited to those that are
suitable for the chosen data set.
A threshold with no group assigned is distributed to no monitored systems and remains stopped until it is distributed to a resource group.
A system group, such as Linux OS or HTTP Server, distributes the threshold to all managed systems where that agent is installed. By default, every predefined threshold is assigned to the system group for that agent. (You can disable all predefined thresholds in the Advanced Configuration page, as described in Thresholds Enablement.)
The exception is managed systems from the IBM Tivoli® Monitoring domain: Managed systems from the Tivoli Monitoring domain must be monitored with situations that were distributed in your Tivoli Monitoring environment.
To assign groups to the threshold, select the check box of one or more resource groups. If the list of assigned groups is long, you can select Show only selected groups.
If you do not see a resource group that you want to assign the threshold to, you can save the threshold definition, and click OK when prompted to confirm that you want to save the threshold without assigning it to a group. You can then create a new group in the Resource Group Manager, and assign a threshold to the new group in the Resource Group Editor. For more information, see Resource Group Manager.
- Execute command
- After an event is opened for a threshold that evaluates to
true, you can have a command or script of commands run automatically. For example, you might want to
log information, trigger an audible beep, or stop a job that is overusing resources when an event is
opened. The command or script is run on the system of the monitoring agent that opened the event.The command uses the following syntax:
where data_set is the data set name and attribute is the attribute name as shown in the Threshold Editor. If the data set or attribute name contains a space, replace with an underscore. The data_set must be the same data set that you select in the Data set selection field.&{data_set.attribute}
The following example shows how you can pass the disk name parameter to your managed resource:
You can pass in one or more attributes from the data set. If specified, multiple attributes are passed into the command in order ($1, $2, and so on)./scripts/clean_logs.sh &{KLZ_Disk.Disk_Name}
You must ensure the script or programs executed by the command are installed on the agent system since Cloud APM does not provide a mechanism to distribute scripts or programs. The command runs from the command line with the same user account that the agent was started with. Ensure the user that starts the agent has permission to execute the command. For example, if the agent is running as root, then root runs the command on the managed system.
The following options control how often the command is run:- Select On first event only if the data set returns multiple rows and you want to run the command for only the first event occurrence in the data sample. Clear the check box to run the command for every row that causes an event.
- Select For every consecutive true interval to run the command every time the threshold evaluates to true. Clear the check box to run the command when the threshold is true, but not again until the threshold evaluates to false, followed by another true evaluation in a subsequent interval.