Alerts and Actions

Un= derstanding Alerts

When a problem occurs at a datacenter, Application, or SL= A, the Monitoring Station can send alerts to users. Alerts are notification= s that inform users who are configured to receive alerts of the problem. Th= e notification message contains the following information:

the type of notification, either Problem or Recovery
the date and time when the problem occurred
the name of the host on which the problem occurred
the status of the host (see Understanding the Status of Serv= ices for more information)
the name of the service that is experiencing the probl= em
the current state of the service
any output from the monitor

Whenever the status of an Element changes (for example, f= rom Critical to Warning), Uptime Infrastructure Monitor sends an alert.

You can also configure alert esca= lations that occur if a warning is sent and is not acted upon. For exa= mple, if an alert is sent to a system administrator, and the administrator = does not attend to the problem within a specified amount of time, then the = alert is sent to the administrator=E2=80=99s manager.

Uptime Infrastructure Monitor can send alert to a phone, = pager, or one or more email addresses.

The following is a sample email alert:

Notific=
ation type: Problem  1/12/2008 10:52
Host: filter
Host State: N/A
Service: FS Capacity - Filter
Service State: WARN/
Output: /var is 92% full

The following is a sample pager alert:

subject=
:  CRIT Alert
content:
5/7/2005 13:22
Type: Problem
Service: FTP (CRIT)
Host: filter (CRIT)

Un= derstanding the Alert Flow

Alerts in Uptime Infrastructure Monitor follow a specific= flow. When Uptime Infrastructure Monitor detects a problem with a host, it= issues an alert. Uptime Infrastructure Monitor then continues to check the= host at specific intervals and reports on the status of the host.

Considering the following example:

Uptime Infrastructure Monitor checks the host system e= very 15 minutes
alerts are sent continually every check interval until= Uptime Infrastructure Monitor detects a change in the state of the host sy= stem
whenever an error is encountered, Uptime Infrastructur= e Monitor rechecks the system every minute
if all rechecks up to the maximum number of rechecks f= ails, Uptime Infrastructure Monitor issues an alert

Uptime Infrastructure Monitor encounters a critical error= on a host. Uptime Infrastructure Monitor performs three rechecks at one mi= nute intervals=E2=80=93all of which return a critical error=E2=80=93an= d then sends an alert after the third recheck.

Uptime Infrastructure Monitor then checks the host every = two hours. While Uptime Infrastructure Monitor encounters two critical erro= rs, it does not send an alert. Then, the status of the host changes from Cr= itical to Warning. When this change is detected, Uptime Infrastructure Moni= tor sends an alert informing recipients of the change in status. When the s= tatus of the host changes to OK, Uptime Infrastructure Monitor issues an al= ert informing recipients that the host has recovered.

This alert flow is illustrated in the following diagram:<= /p>

=3D""

All service monitors have a common set of Monitor Alert Settings that configure aspects of the alert flow.

Alert Profiles=

Alert Profiles are templates that tell Uptime Infrastruct= ure Monitor how to react to various alerts that are generated by service ch= ecks. Alert Profiles enable Uptime Infrastructure Monitor to execute a seri= es of actions in response to the failure of a service check or when a thres= hold is exceeded. The following diagram illustrates how an Alert Profile wo= rks:

3D""

An Alert Profile can send an alert via email, or to a pag= er or a cell phone. You can configure any or all of these actions to occur = simultaneously by associating the Alert Profile to multiple Notification Gr= oups. For example, if a Web server process stops responding, both the syste= m administrator and Web server administrator can be notified.

Custom Alert Formats and Alert Scripts

Alert Profiles include standard message templates for ema= ils and pagers, which are well suited for most alerting needs. However, you= can customize the format of the alert using predefined variables. When cre= ating or configuring an Alert Profile, selecting the Custom Format<= /strong> option provides you with a template to modify, and override the me= ssage template for the alert type you have selected:

See Custom Alert Message Variables for more information.

In addition to sending alert messages, Uptime Infrastruct= ure Monitor can also execute an alert script. When an outage occurs, the sc= ript is run on the Monitoring Station, once for each user who receives noti= fication. Like custom alert messages, alert scripts use predefined variable= s to represent outage-specific information; these variables are passed to t= he script at the time of the outage.

For information on alert script variables, see Script Alert Variables.= For more information on alert scripts, see the IDERA Knowledge Base articl= e, Creating Custom Alert Scr= ipts in Uptime Infrastructure Monitor Alert Profiles.

Creati= ng Alert Profiles

To create Alert Profiles, do the following:

On the Uptime Infrastructure Monitor tool bar, click= Services.

In the tree panel, click Add Alert Profile.
The Add Alert Profile window appears.

Type a descriptive name for the profile in the Name of Alert Profile field.

In the Start alerting on notification number= field, enter the number of times an error must occur before Uptim= e Infrastructure Monitor sends an alert notification.

Enter the number of times to re-send the notificatio= n in the End alerting on notification number field.
You can also select the Never Stop Notifying check box to have Uptime Infrastructure Monitor send notifications i= ndefinitely.

Select one or more of the following notification opt= ions:

Email Alert
Sends the alert to the email addresses of the members of a Notification Gr= oup.

Pager Alert
Sends the alert to the pagers of the members of a Notification Group.
<= /li>
Script Alert
Executes an alert script on the Monitoring Station, once for each user who= receives notification of the alert.
Because this alert option relies on a script or batch file, enter its name= and path in the Script Path field (for example, on Linux,= /usr/local/uptime/scripts/scriptAlert.sh).

If you are using an email or pager notification, and want to use a cust= om message instead of the standard template, click the Custom Forma= t check box to begin creating a custom alert message. Use the foll= owing steps to create the message: =20

To expedite message creation, select a Short Template,= Medium Template, or Long Template, then = click Fill.

Optionally modify the alert subject header.

Optionally modify the alert message body.

For information on custom alert message variables, see Custom Alert Message Variables= .

Select one or more Notification Groups that receive th= e notifications.

Optionally attach this alert profile to one or more existing Se= rvice Monitors.

Click Save.

Viewing= Alert Profiles

To view Alert Profiles, do the following:

On the Uptime Infrastructure Monitor tool bar, click= Services.

In the tree panel, click View Alert Pro= files.
The Alert Profiles subpanel appears. The subpanel display= s the settings that you configured when you created the profile, as well as= a list of the services that are attached to the profile.

To test whether the profile sends alerts, click = Test Alert Profile.
A popup window appears, and the alert is sent using the notification metho= d (email, pager, or script) that is specified in the profile. The fol= lowing is an example of an email alert:
Notification type: Problem= 27/4/2006 09:19
Host: Test Host (OK)
Ser= vice: Test Monitor
Service State: OK
Outp= ut: This is a test notification; please ignore.
When the alert is sent, the message Alert Profile Tested appe= ars in the popup window. If an error message appears in the popup window, e= dit the profile and test it again.

Editing= Alert Profiles

To edit Alert Profiles, do the following:

On the Uptime Infrastructure Monitor tool bar, click= Services.

In the tree panel, click View Alert Profiles= .

Click the Edit Alert Profile icon b= eside the name of the profile that you want to edit.
The Edit Alert Profile window appears.

Edit the Alert Profile fields as de= scribed in the section, Creating Alert Profiles.

Associating Alert Profiles to Elements

You can associate an Alert Profile to any Service Monitor= , Application, or SLA if their state changes from OK to Warning or Critical= . Alert Profiles are normally associated with any of these monitored items = at the time of their configuration; you can also modify Alert Profile assoc= iations using existing service monitor definitions.

See Usin= g Service Monitors, Working with Applica= tions, and Adding and Editing SLA Definit= ions for more information about configuring Service Monitors, Applicati= ons, and SLAs, respectively.

Action Profil= es

Action Profiles are templates that direct Uptime Infrastr= ucture Monitor when it encounters a problem on a monitored system. You can = associate an Action Profile to any Service Monitor, Application, or SLA if = their state changes from OK to Warning or Critical. Action Profiles are nor= mally associated with any of these monitored Elements at the time of their = configuration; Action Profile associations can also be changed when you are= modifying existing service monitor definitions.

See Usin= g Service Monitors, Working with Applica= tions, and Adding and Editing SLA Definit= ions for more information about configuring Service Monitors, Applicati= ons, and SLAs, respectively.

Actions include one of the following tasks:

write an entry to a log file

run a recovery script that can reboot a non-responsive= server; or restart an application, process, or service

stop, start, or restart a Windows server

initiate a VMware vCenter Orchestrator workflow

send an SNMP trap to a specific trap host and trap com= munity

As templates, Action Profiles can be reused for any numbe= r of Service Monitor configurations. This means you can create a series of = them as standard actions used to respond to typical types of problems you m= ay encounter, depending on what role a Service Monitor is playing (for exam= ple, availability or performance).

VMware vCenter Orchestrator Workflow Actions

If an administrator has integrated Uptime Infrastructure = Monitor with VMware vCenter Orchestrator (see VMware vCenter Orchestrato= r Integration), you can configure Action Profiles to initiate Orchestra= tor workflows.

Orchestrator is a VMware vCenter Server add-on that allow= s its administrators to create workflows that automate vCenter management t= asks. These Orchestrator workflows are open ended: all vCenter actions are = available for automation through the processing of parameters and runtime a= rguments. Uptime Infrastructure Monitor Action Profiles can be configured t= o provide input parameters to specific workflows, thus integrating vCenter = management with Uptime Infrastructure Monitor=E2=80=99s monitoring and aler= ting capabilities.

For example, if Uptime Infrastructure Monitor is monitori= ng memory, CPU, and hard disk use for a virtualized server, the passing of = performance thresholds can trigger an Action Profile that, in turn, trigger= s an Orchestrator workflow that creates a new virtual machine to alleviate = resource strain. In a converse example, if Uptime Infrastructure Monitor is= monitoring a virtualized server for long periods of inactivity, a triggere= d Action Profile can initiate an Orchestrator workflow that shuts down the = instance to free up resources.

By tightly integrating Uptime Infrastructure Monitor=E2= =80=99s monitoring and alerting with VMware vCenter Orchestrator=E2=80=99s = automated virtual environment administration, you can accelerate your organ= ization=E2=80=99s reaction time with virtual systems management, and map es= tablished policies to automated actions.

When configuring Action Profiles, Uptime Infrastructure M= onitor communicates with Orchestrator and dynamically produces a list of al= l available workflows. (This includes any third-party workflow packages tha= t are installed on the Orchestrator server, including the Uptime Infrastruc= ture Monitor Orchestrator package.)

When a workflow is selected, and the Get Paramete= rs button is clicked, the corresponding input parameter fields are= dynamically displayed, allowing you to specify parameter values required t= o completely configure the workflow for execution should an Uptime Infrastr= ucture Monitor alert initiate it.

Orchestrator Input Parameter Variables

When configuring a VMware vCenter Orchestrator workflow, = you have at your disposal a set of Uptime Infrastructure Monitor-specific v= ariables that can be entered as parameter variables, and whose ensuing runt= ime values are passed to the Orchestrator workflow during execution. The va= riables available to you are those that are used when creating a custom ale= rt format. See = Custom Alert Message Variables for information.

SNMP Trap Ac= tions

You can also configure an Action Profile to send an SNMP = trap to a particular host. An SNMP trap is a notification issued by a syste= m that is running SNMP when a problem occurs. The host to which the SNMP tr= ap is sent must be running an SNMP trap listener.
If you use SNMP traps,= the trap message is sent in the format specified by the Uptime Infrastruct= ure Monitor MIB. This MIB is found in the /scripts directory. = The Uptime Infrastructure Monitor enterprise OID is .1.3.6.1.4.1.242= 16.

Creat= ing Action Profiles

To create Action Profiles, do the following:<= /p>

On the Uptime Infrastructure Monitor tool bar, click Services.

In the tree panel, click Add Action Profile.
The Add Action Profile window appears.

Enter a name for this profile in the Name of Action Profile field.

Specify the number of times an error must occur before Uptime Infrastru= cture Monitor sends a notification in the Start action on notificat= ion number field.

Specify the number of times actions are carried out in the End = action on notification number field.
Optionally, select the Never Stop Notifying option to con= tinually carry out the action in this profile until the problem is resolved= .

If VMware vCenter Orchestrator integration is enabled, and you want the= Action Profile to drive an Orchestrator workflow, use the following steps:=

In the Select Workflow field, input a workflow to conf= igure.
You can either scroll through and select the workflow from the drop-down l= ist, or begin typing the workflow=E2=80=99s name.

Click Get Parameters.
Uptime Infrastructure Monitor retrieves information from the Orchestrator = server and dynamically display configuration fields for the chosen workflow= =E2=80=99s input parameters.

Configure the input parameter fields for the workflow.
For information on the specific configuration parameters available for the= chosen workflow, consult the appropriate developer=E2=80=99s documentation= .

If you want the Action Profile to write to a log, in the Log Fi= le field, enter the name and path to a log file on the Monitoring = Station to which error information is written.

If you want the Action Profile to run a recovery script, in the Recovery Script field, enter the name and path to a script that r= eboots a server, or restart an application, process, or service.
The recovery script also has the following information appended to it:

the date and time on which the error occurred

the type of error notification that was sent

the name of the host on which the error occurred

the state of the host

the name of the service that threw the error

the state of the service

the output that was generated by the error
for example:
"/usr/local/uptime/recover.sh" "24/12/2014 5:01:05" = "Problem" "printserver" "null" "WinSrv-Print Spooler" "CRIT/threshold error= " "servicestatus: Not Running does not match Running (Service 'Print Spoole= r' found, status: Not Running, took 12ms)"

For information on predefined variables that can be used in Action = Profile scripts, see

You can also use the recovery script to file trouble tickets with a syst= em like Remedy, or to interact with third-party software packages.

If you are setting up an Action Profile for a Windows server, you can a= lso leave the Windows Service as Agent, and complete the following fields:<= br>

Windows Host
The name of the host on which the service is running.

Enter $HOSTNAME$ in this field to create a dynamic hostname= . For failing services that call this Action Profile, the corresponding hos= tname is used when this action runs.

You can use this dynamic hostname in conjunction with service groups, wh= ere an issue can originate from one of many hosts.

Agent Port
The port on which the Uptime Infrastructure Monitor agent that is installe= d on the system is listening. The default is 9998.

Use SSL
Select this option if Uptime Infrastructure Monitor securely communicates = with the host using SSL (Secure Sockets Layer).

Agent Password
Enter the password that is required to access the agent that is running on= the monitored system. For information on setting the agent password, see t= he Uptime Infrastructure Monitor Knowledge Base article, What is the password for the Windows agent?

Windows Service
The display name of the specific Windows service to which the Action Profi= le applies. The display name of a service appears in the Name column of the Services Control Panel, or in the Description column of the Windows Task Manager Services<= /strong> tab.

The service display name must be entered verbatim, including spaces, oth= erwise it is not correctly processed. Double-clicking a service name in the= Services Control Panel opens a properties window where yo= u can highlight and copy the service Display name.

Action
Select one of the following actions:

None

Start

Stop

Restart

If you are setting up an Action Profile for a Windows server that is us= ing a WMI implementation, you can also select the Windows Service as WMI, a= nd complete the following fields:

WMI Host:
The name of the host on which the service is running.

Windows Domain:
The Windows domain in which WMI is implemented.

Username:
The name of the account with access to WMI on the Windows domain.

Password:
The password for the account with access to WMI on the windows domain.
Windows Service
The display name of the specific Windows service to which the Action Profi= le applies. The display name of a service appears in the Name column of the Services Control Panel, or in the Description column of the Windows Task Manager Services<= /strong> tab.

The service display name must be entered verbatim, including spaces, oth= erwise it is not correctly processed. Double-clicking a service name in the= Services Control Panel opens a properties window where yo= u can highlight and copy the service Display name.

Action
Select one of the following actions:

None

Start

Stop

Restart

If you want to send SNMP traps to a particular host, complete the follo= wing fields:

SNMP Trap Host
The name of the host that monitors SNMP traps.

SNMP Trap Port
The port number on the trap host to which the SNMP trap is sent.

SNMP Trap Community
The name which acts as a password for sending trap notifications to the tr= ap host.

SNMP Trap OID (optional)
The object identifier (OID) that identifies the SNMP trap - for example, <= code>.1.3.6.1.2.1.34.4.1.7.

If Splunk integration is enabled, and you want the Action Profile to wr= ite to the Splunk log, complete the following fields:

Splunk Hostname
The host name of the server on which Splunk is running.

Logging Port
The port on which the Splunk server is listening for logging requests. Thi= s port is configured in Splunk, and you must contact the Splunk administrat= or for this information.

Click the Use SSL option to securely access the Splunk server using SSL= .

For more information on Splunk integration, see Splunk Settings.

Optionally attach this alert profile to one or more existing Service Mo= nitors.

Click Save.

Viewin= g Action Profiles

To view Action Profiles, do the following:

On the Uptime Infrastructure Monitor tool bar, click= Services.

In the Tree panel, c= lick View Action Profiles.
The Action Profiles subpanel appears, disp= laying the settings that you configured when you created the profile, as we= ll as a list of the services that are attached to the profile.

To test whether the profile works, click the Test Action Profile button.
A popup window appears, and the Monitoring Station tries to carry out the = action defined in the profile. When the action is completed, the message Action Profile tested appears in the popup windo= w.
If an error message appears in the popup window, edit the profile and test= it again.

Editin= g Action Profiles

To edit Action Profiles, do the following:

On the Uptime Infrastructure Monitor tool bar, click= Services.

In the tree panel, click View Action Profile= s.

Click the Edit Action Profile icon = beside the name of the profile that you want to edit.
The Edit Action Profile window appears.

Edit the Action Profile fields as described in the s= ection Creating Action= Profiles.

Monitoring= Periods

Monitoring Periods are the times over which a service mon= itor is actively monitoring a host. The Monitoring Periods also apply to th= e times when Uptime Infrastructure Monitor sends alerts

Uptime Infrastructure Monitor comes with the following Mo= nitoring Periods:

24x7 =E2=80=93 Monitoring is performed 24 hours a day,= seven days a week.

9am to 5pm weekdays =E2=80=93 Monitoring is performed = from 9 a.m. to 5 p.m., Monday to Friday.

Never =E2=80=93 No monitoring is carried out.

You can add Monitoring Periods that suit your needs. For = example, you can create a Monitoring Period called " Weekends" that only mo= nitors a host from 12:00 a.m. on Saturday to 11:59 p.m. on Sunday.

Addi= ng Monitoring Periods

To add Monitoring Periods, do the following:<= /p>

On the Uptime Infrastructure Monitor tool ba= r, click Services.
In the tree panel, click Add Monitoring Peri= od.
The Add Monitoring Periods window appears.
Type a name in the Monitoring Period Name field.

In the Definition section, enter on= e or more time period expressions that combine to create a full Monitoring = Period definition.
See Time Period Definitions for information on the = types of time period expressions that are valid in Uptime Infrastructure Mo= nitor.

Click Save.