We’d like to apologize to all FME Cloud customers who were affected by incorrect alerts for their instances.
On March 2nd, 2023 Safe Software noticed that incorrect alerts were being triggered for FME Cloud instances for disk space and memory usage. Internal investigation showed that the composite data for disk space was missing and memory data was wrong. We reached out to our service provider for metrics and alerting (Librato) and updated the Safe Software status page to show degraded performance for FME Cloud Dashboards.
Overnight it appeared to recover, so the status was moved to monitoring while we waited for a response or confirmation from Librato.
On March 8th Librato reported that one of their internal services responsible for metrics was occasionally failing to keep up with realtime traffic, and as a result a subset of metrics were being impacted. This is fixed now.
Safe Software has not had any incorrect alerts since March 3rd. In addition to the response from Librato we are confident this issue has been resolved.
The status of FME Cloud Dashboards has returned to operational and the incident is resolved on status.safe.com.