Working with Resource Alerts

Overview of Alerts

Alerts are a user-defined type of event that indicates a critical condition in a selected resource. When an alert occurs, it must be acknowledged, and alerts are listed until they are marked as being fixed. You can define escalation schemes to identify the actions that happen if an alert is not fixed within a specified time.
Alerts are logged by the agents and all events throughout the entire deployment are displayed in the Alert Center. To access the Alert Center, select Analyzethen selectEvent Center.
alert center
You can filter the alerts, for example, so that only the most recent ones or the ones of a specified type are displayed. Click the icon to acknowledge an alert. Select the check box next to an alert, and then click Fixed to fix the alert. You can also click on the entry in the Alert Definition column to display the Alert Details page, where you can view details about the alert and mark the alert as fixed (with comments).
alert detail

Defining an Alert

To define an alert, follow these steps:
  1. Select Resourcesthen selectBrowse or use a dashboard portlet to locate the resource for which you want to create an alert. Click on the resource name to view the Detail page for the resource.
  2. On the Resource Detail page, select Alerts to view the alerts for the resource.
  3. Click Configure to view the Alert Configuration page for the resource. Click New to display the New Alert page.
    alert definition
  4. In the Alert Properties area, specify the name and priority for the alert and whether it is active.
  5. In the Condition Set area, specify the conditions that must be met in order for the alert to be triggered. You can specify up to three conditions for each alert. Use these fields to specify the condition that triggers the alert
    Metric
    specifies that the alert is triggered based on the value of a metric that is monitored for the resource. You can specify that the condition is based on comparison to a fixed value, a percentage of a value, or a change in value. If you want to create an alert for a metric that is not listed, you must first enable collection of that metric.
    Update the metric collection settings for the resource type (choose Monitoring Defaults from the Manage page) or for the specific resource (click Metrics on the resource’s Monitor page).
    Inventory Property
    specifies that the alert is triggered based on a change in the value of a resource property (such as version number). This condition is available only for certain types of resources (such as platforms and SAS Metadata Servers).
    Control Action
    specifies that the alert is triggered when an action meets a specified condition (such as the action of stopping a failed server ). This condition is available only for servers that can be controlled through control actions. See Controlling Resources Using Control Actions for more information.
    Events/Logs Level
    specifies that the alert is triggered when a selected string (such as Error) appears in the log.
    Config changed
    specifies that the alert is triggered when a configuration file changes (you can choose to specify the name of the configuration file).
  6. In the Enable Action field, specify whether the alert is triggered only once, or periodically as long as the alert condition persists.
  7. Click OK to define the alert and display the Alert Configuration page.
    alert list
  8. On the Alert Configuration page, you can specify an escalation scheme and identify the users and roles that should be notified when the alert occurs. To create an escalation scheme, see Defining an Escalation Scheme.
  9. Click Return to Alert Definitions when you are finished.

Defining an Escalation Scheme

An escalation scheme is a series of actions that take place when an alert is not acknowledged or fixed within a certain period of time. An escalation scheme can be applied to multiple alerts. You can define an escalation scheme to perform any of these actions:
  • send an e-mail or SMS message
  • make an entry in a system log
  • issue an SNMP notification
To define an escalation scheme, select Managethen selectEscalation Schemes Configuration (in the Server Settings area).
new escalation scheme
For information about the information required when defining an escalation scheme, refer to the Help for the page.

Example: Defining an Alert for SAS Work Directory Space

This example provides information for setting up an alert to be triggered whenever the SAS Work directory reaches 90% of its capacity. The alert should be issued once every two hours until the condition is cleared. When the alert is triggered, users with the Operations role should be notified.
  1. Locate the service SAS Home Directory 9.4 SAS work directory. The service is under the SAS Home Directory 9.4 server.
  2. Navigate to the Resource Detail page for the service. On the Detail page, select Alertthen selectConfigure to display the Alert Configuration page. Click New to display the New Alert Configuration page.
  3. Name the alert, select the priority, and specify that the alert should be active.
    alert name
  4. In the If Condition area, select the Metric radio button, then select Use Percent in the Metric field.
  5. To specify 90% capacity, enter .9 in the absolute value field. To specify that the alert is triggered whenever the used capacity exceeds 90%, specify and select > (Greater than) from the comparison menu.
    alert condition
  6. In the Enable Action(s) field, specify 1 for the number of times the alert is issued, 2 for the timer period, and select hours for the time period units. These values specify that the alert is issued one time every two hours while the alert conditions are met.
    alert timer
  7. Click OK to define the alert and display the Configuration page for the new alert.
  8. Select Notify Roles, and then select Add to List.
  9. Select the check box beside Operations in the Roles list and use the arrow control to move the role to the Add Role Notification list.
  10. Click OK to close the Role Selection page and then Return to Alert Definitions to complete the process of defining the alert.

Example: Defining an Alert for a Log File

This example provides information for setting up an alert to be triggered whenever an I/O Subsystem error appears in the log of the SAS Metadata Server. The alert should be issued every time an error appears in the log.
  1. Locate the server SASMeta – SAS Metadata Server in the Resource page.
  2. Navigate to the Resource Detail page for the server. On the Detail page, select Alertthen selectConfigure to display the Alert Configuration page. Click New to display the New Alert Configuration page
  3. Name the alert, select the priority, and specify that the alert should be active.
    new alert name
  4. In the If Condition area, select the Event/Logs Level radio button, then select Error in the Event/Logs Level field.
    In the match substring field, enter I/O Subsystem. These values specify that an alert is issued whenever the string “I/O Subsystem” appears in an error message in the log
    alert for logs
  5. In the Enable Action(s) area, select the Each time conditions are met radio button. This specifies that the alert is triggered each time an I/O Subsystem error appears in the log.
  6. Click OK to define the alert.

Example: Defining an Alert for Available Memory

This example provides information for setting up an alert to be triggered whenever the free memory on a SAS Web Application Server falls below 90% of 1.3 GB. The alert should be issued once every fifteen minutes until the condition is cleared.
  1. Locate the server in the Resource page. The Resources page lists SAS Web Application Server using the server type SpringSource tc Runtime.
  2. Navigate to the Resource Detail page for the server. On the Detail page, select Alertthen selectConfigure to display the Alert Configuration page. Click New to display the New Alert Configuration page
  3. Name the alert, select the priority, and specify that the alert should be active.
  4. In the If Condition area, select the Metric radio button, then select Heap Memory Free in the Metric field.
  5. To specify the memory condition for the alert, enter 90 in the % field and then select 1.3 GB (Min Value). Select < (Less than) from the comparison menu.
    alert for free memory
  6. In the Enable Action(s) field, specify 1 for the number of times the alert is issued, 15 for the timer period, and select minutes for the time period units. These values specify that the alert is issued one time every 15 minutes while the alert condition is met.
    alert time interval
  7. Click OK to define the alert.