Prime InfraMonitoring

Prime InfraMonitoring

This module can be used to monitor any IP-enabled device via a multitude of protocols. The module uses Telegraf with its hundreds of input plugins to cover all commonly used protocols. Based on the gathered metrics one can set warning and critical thresholds used for alarming. By means of scripting any alarming method is available. Most commonly used and implemented is alarming via email and/or browser notifications.

Furthermore remote pollers can be used to query metrics from distant devices through tightly locked-down firewalls. Remote pollers are automatically reprovisoned if new sensors are created via the main instance.

Besides this, passively monitoring devices using SNMP Traps is available as well. One can match traps and their contents to identify ok, warning or critical states. OK traps can auto-acknowlegde no-OK traps.

Both sensors and traps are visualized on both a geographical and logical map.

1.1. Content

Feature

State

Details

Feature

State

Details

Monitor metrics using multitude of protocols

using Telegraf

Alarm based on gathered metrics

via Prometheus and Alertmanager

Retrieve only a subset of metrics

e.g. monitor only one interface of SNMP interface MIB

Getting alarm notifications

E-Mail / Browser notifications, extendable via scripting

Remote pollers (retrieve metrics from other instances)

for tightly locked-down firewalls

Receive SNMP traps, evaluate their contents and alarm on matches

 

Receive SNMP traps via remote trap receivers

 

Display state of device on geographical and logical maps

 

ACKnowledge sensor alarms / SNMP traps

 

Display list of historical alarms

 

Locate devices and map corresponding to alarms

 

Create new netelements based on templates, automatically adding corresponding sensors and thresholds

 

Adjust default metric visualization

via Grafana dashboards

Duplicate netelements including its sensors and thresholds

e.g. adding device of same type with different IP address