Troubleshooting

Alert not firing despite threshold being exceeded

Cause: The evaluation period has not elapsed, the notification channel is misconfigured, or the alert rule is in a silenced state.Resolution:

Verify the rule’s evaluation period — the condition must persist for the full duration before the alert fires:
Check alert rule configuration
```
monitoring alert rule show <RULE_NAME>
```
Check Monitor Center > Monitoring (Alert Channels, admin view) to confirm the notification channel is active and credentials are valid
Check Monitor Center > Monitoring (Silences, admin view) — confirm no active silence covers the alert:
List active silences
```
monitoring alert silence list --status active
```
Verify the metric has data in the time window — navigate to the Dashboards and check whether the metric panel shows values above the threshold

An alert rule evaluating polystack_compute_cpu_utilization > 90 will only fire if the metric is above 90% for the ENTIRE evaluation period. Brief spikes that resolve within the period will not trigger the alert.

Metrics not appearing in dashboards

Cause: The monitoring agent on the target host is not running, or the host is not registered with Monitoring.Resolution:

Check agent registration

monitoring agent list --status all

Look for hosts with status offline or unknown. If metrics are missing, contact your administrator. They can verify the monitoring agent status through the deployment console.

Log ingestion delayed or missing

Cause: Log collector is not configured for the service, the log file path has changed, or the collector is experiencing a backlog.Resolution:

Navigate to Monitor Center > Logging (Log Sources, admin view) and verify the log source configuration for the affected service
Confirm the file path pattern matches the current log file location
Check the collector queue depth:
Check ingestion queue depth
```
monitoring log ingest-status
```

Log ingestion uses file-based collection. If a service rotates logs to a new path after an update, the collector configuration must be updated to match. Contact your monitoring administrator to update log source configurations. Your administrator can configure this through the deployment console.

Dashboard shows 'No data' for a metric

Cause: The scrape target is down, the agent is offline, or the metric name has changed after a software update.Resolution:

Check the target health: navigate to Monitor Center > Monitoring (Scrape Targets, admin view) and look for targets in DOWN state
Verify the agent is active for that host:
Check agent status
```
monitoring agent list --node <HOSTNAME>
```
Search for the metric to verify it exists and find the correct name:
Search metrics by prefix
```
monitoring metric search --prefix polystack_compute_cpu
```

Alert notifications not delivered

Cause: The notification channel configuration is invalid, credentials have expired, or the destination is temporarily unreachable.Resolution:

Navigate to Monitor Center > Monitoring (Alert Channels, admin view) and use the Test button to send a test notification
If the test fails, review the channel configuration:
Check channel configuration
```
monitoring alert channel show <CHANNEL_NAME>
```
For email channels: verify SMTP credentials and server reachability
For webhook channels: verify the URL is accessible from the Monitoring server
For PagerDuty: verify the integration key has not been rotated

Send a test notification immediately after creating or modifying a channel. Do not rely on a real alert event to discover that a channel is broken.

Issue	First Step
Alert not firing	`monitoring alert rule show <RULE_NAME>`
Agent offline	Contact your administrator to verify agent status via the deployment console
Missing metric	`monitoring metric search --prefix <METRIC_PREFIX>`
Log ingestion backlog	`monitoring log ingest-status`
Channel test	Use Test button in Dashboard or `monitoring alert channel test <NAME>`

Monitoring Admin Guide

Infrastructure-level Monitoring administration and agent configuration

Metrics & Alerts

Review and adjust alert rule configurations

Dashboards

Verify metric availability in dashboard panels

Support

Contact Polystack support for issues requiring platform-level investigation

Core Services

Other Services

Overview

Common Issues

Diagnostics Reference

When to Contact Your Administrator

Next Steps

Monitoring Admin Guide

Metrics & Alerts

Dashboards

Support

Core Services

Other Services

Documentation Index

​Overview

​Common Issues

​Diagnostics Reference

​When to Contact Your Administrator

​Next Steps

Monitoring Admin Guide

Metrics & Alerts

Dashboards

Support

Overview

Common Issues

Diagnostics Reference

When to Contact Your Administrator

Next Steps