Installation Health Event Monitor Reference Guide

Installation Health Event Monitor

Checks the overall health of the system where FrameFlow is installed.

Overview

The Installation Health Event Monitor watches the health of the system where your FrameFlow instance is running. It can check system health metrics like disk, ping, CPU, and bandwidth. It can also alert on monitoring actions that are taking excessive time. This monitor is recommended for your main console and, in multi-site configurations, for all remote nodes.

Use Cases

  • Maintaining the health of your FrameFlow installation
  • Alerting about monitoring actions that take longer than expected to complete

Monitoring Options

This event monitor provides the following options:

Alert with [Info/Warning/Error/Critical] if monitoring queue items take a long time to complete

Use this option to receive alerts when any item in your monitoring queue takes longer than usual to complete.

Alert with [Info/Warning/Error/Critical] if the update service is not running (remote node only)

This option will alert you if the update service for your remote nodes is found to be stopped.

Alert with [Info/Warning/Error/Critical] if posting data to the master console takes too long to complete (remote node only)

Enable this option to receive an alert if posting to your master console takes too long.

Alert based on the percentage of memory used

Use this option to receive alerts on used memory.

Alert based on the percentage of disk space available

This option will alert you based on the percentage of disk space still available.

Drives to ignore

Enter the names of the drives to ignore, separated by commas (e.g. "C, D"). To see the drives, leave this field blank and run the event monitor once. Check the resulting event history record to see the name of each drive.

Alert if the average CPU usage is greater than a specified percentage

Enter percentages that, when exceeded, will trigger each desired level of alert.

Alert if the average CPU usage on an individual core is greater than a specified percentage

This option will alert you if the CPU usage on a single core exceeds the specified percentages.

Alert if the incoming bandwidth on any interface exceeds a specified rate

This will alert you if the incoming bandwidth on any single interface exceeds the rates you specify.

Alert if the outgoing bandwidth on any interface exceeds a specified rate

Use this option to receive notification if the outgoing bandwidth on a single interface exceeds a specified rate.

Alert if the total bandwidth on any interface exceeds a specified rate

This option will alert if the total bandwidth on any interface exceeds a specified rate.

Interfaces to ignore

Enter the names of the interfaces to ignore, separated by commas (e.g. "Realtek,Marvell"). Note that interface names can be a partial match. To see all interface names, leave this field blank and run the event monitor once. Check the resulting event history record to see the name of each interface that was detected.

Authentication and Security

This event monitor does not require authentication.

Data Points

Data Point Description
CPU Usage Detected CPU usage in percentage.
Incoming/Outgoing/Total Bandwidth Detected bandwidth rates.
Number of Checks in Progress The number of checks occurring at the time the event monitor ran.
Number of Checks Pending The number of pending checks detected when the event monitor ran.
Space Available The amount of space left available.
Used Memory % The amount of memory currently used, expressed as a percent.

Sample Output

Tutorial

To view the tutorial for this event monitor, click here.

Back to Library

Comments

There are no user-contributed comments for this page. Be the first to submit a comment!

Add a comment