Developer & DevOps
Apr 10, 16:59 EDT Resolved - Between approximately 9:42 AM and 10:38 AM ET on April 10, we observed delivery failures for customers in the US1, AP1, and AP2 regions. During this window, notifications to the following integrations were delayed or temporarily undelivered: • PagerDuty • Slack • Microsoft Teams • Webhooks • Jira • ServiceNow • OpsGenie • VictorOps • BigPanda • Zendesk • Sumologic All delayed notifications have been replayed with the following exceptions: PagerDuty pages and Slack no...
Mar 27, 15:30 EDT Resolved - This incident has been resolved. Mar 27, 15:21 EDT Monitoring - A fix has been implemented and we are monitoring the results. Mar 27, 15:17 EDT Identified - The issue has been identified and a fix is being implemented. Mar 27, 14:57 EDT Investigating - We are investigating increased latency in processing and storing Traces in APM. As a result of this issue, some users may see missing or delayed traces in APM Trace Search since 5pm UTC. They may also experience delay ...
Mar 26, 15:56 EDT Resolved - This incident has been resolved. Mar 26, 15:42 EDT Monitoring - A fix has been implemented and we are monitoring the results. Mar 26, 15:07 EDT Update - We are continuing to work on a fix for this issue. Mar 26, 14:38 EDT Update - We are continuing to work on a fix for this issue. Mar 26, 13:28 EDT Update - We are continuing to work on a fix for this issue. Mar 26, 12:48 EDT Identified - The issue has been identified and a fix is being implemented. Mar 26, 12:11 EDT ...
Mar 18, 22:53 EDT Resolved - This incident has been resolved. Mar 18, 22:39 EDT Monitoring - A fix has been implemented and we are monitoring the results. Mar 18, 22:04 EDT Update - We are continuing to investigate the issue and will provide updates as available. Mar 18, 21:27 EDT Update - We are continuing to investigate the issue and will provide updates as available. Mar 18, 20:54 EDT Investigating - We are investigating an issue causing some metric monitors to intermittently report "No Data"...
Mar 13, 06:30 EDT Resolved - This incident has been resolved. Mar 13, 06:05 EDT Monitoring - A fix has been implemented and we are monitoring the results. Mar 13, 06:00 EDT Identified - The issue has been identified and a fix is being implemented. Mar 13, 05:52 EDT Update - We are continuing to investigate this issue. Mar 13, 05:32 EDT Update - We are continuing to investigate this issue. Mar 13, 05:27 EDT Update - We are continuing to investigate this issue. Mar 13, 05:26 EDT Investigating - We...
Feb 24, 02:08 EST Resolved - This incident has been resolved. Feb 24, 00:52 EST Monitoring - We have deployed a fix and we are monitoring the results. We will provide another update once the issue is fully resolved. Feb 24, 00:29 EST Identified - We have identified the underlying issue and are working on a fix. It is important to note that no data has been lost, and it will be backfilled and available once the service is operational again. Feb 23, 23:31 EST Investigating - We are investigating i...
Feb 18, 15:37 EST Resolved - This incident has been resolved. Feb 18, 14:30 EST Monitoring - We are investigating delays in a subset of Distribution Metrics and Monitor Evaluations, which began at 18:33 UTC.
Feb 11, 21:08 EST Resolved - This incident has been resolved. Feb 11, 20:49 EST Update - The remaining impact is limited to AWS Streams metrics. We are continuing to monitor for any further issues. Feb 11, 20:32 EST Monitoring - A fix has been implemented and we are monitoring the results. Feb 11, 20:20 EST Identified - The issue has been identified and a fix is being implemented. Feb 11, 19:57 EST Investigating - We are investigating increased latency processing Traces in APM Trace Search and A...
Feb 11, 18:44 EST Resolved - This incident has been resolved. Feb 11, 18:28 EST Monitoring - A fix has been implemented and we are monitoring the results. Feb 11, 17:28 EST Update - Teams continue to work to mitigate the impact of this issue. At this time, APM Trace Processing, APM Trace Monitors are still delayed. Distribution Metrics monitors were delayed between 21:35 and 22:15 UTC Feb 11, 16:51 EST Update - Teams continue to work to mitigate the impact of this issue. At this time, APM Trace ...
Feb 5, 12:43 EST Resolved - This incident has been resolved. Feb 5, 12:23 EST Monitoring - We have observed full recovery of monitors and will continue to monitor Feb 5, 12:01 EST Update - We are observing recovery for the vast majority of monitors and are continuing to work on a full fix for this issue. Feb 5, 11:50 EST Update - We are continuing to work on a fix for this issue. Feb 5, 11:31 EST Identified - The issue has been identified and a fix is being implemented. Feb 5, 11:28 EST Up...
Jan 29, 14:55 EST Resolved - This incident has been resolved. Jan 29, 14:09 EST Update - We are continuing to monitor the fix and will continue to provide regular updates. Jan 29, 13:36 EST Monitoring - We have deployed a fix and we are monitoring the results. We will continue to provide regular updates. Jan 29, 13:06 EST Update - We are continuing to work on a fix for this issue. It is important to note that no data has been lost, and evaluations will be caught up once the service is operationa...
Jan 28, 18:36 EST Resolved - This incident has been resolved. Jan 28, 18:13 EST Monitoring - A fix has been implemented and we are monitoring the results. Jan 28, 17:08 EST Identified - The issue has been identified and a fix is being implemented. Jan 28, 16:21 EST Update - We are continuing to investigate this issue. Jan 28, 15:30 EST Investigating - We are investigating delays in service checks monitors evaluation, which began at 20:26 1/28/2026 UTC.
Jan 22, 14:27 EST Resolved - This incident has been resolved. Jan 22, 14:13 EST Monitoring - A fix has been implemented and we are monitoring the results. Jan 22, 14:01 EST Update - We are continuing to investigate this issue. Jan 22, 13:51 EST Investigating - We are investigating loading issues on our web application. As a result, some users might be getting errors when loading the web application. Please note that data processing and alerts are not affected by this incident.
Jan 18, 09:24 EST Resolved - This incident is resolved. There's no more delay for the processing of Events, nor impact on the event stream, event based widgets and event based monitors. Jan 18, 08:26 EST Update - Recovery is in progress and the new estimated time of recovery would be 14h30 UTC. Jan 18, 07:35 EST Update - We have identified the issue and scaled up for recovery, with a recovery estimated to be around 14h30 UTC. We'll continue to give updates as recovery progresses. Jan 18, 07:33 E...
Dec 12, 18:43 EST Resolved - This incident has been resolved. Dec 12, 18:31 EST Monitoring - A fix has been implemented and we are monitoring the results. Dec 12, 17:41 EST Identified - The issue has been identified and a fix is being implemented. Dec 12, 16:53 EST Update - We are continuing to investigate this issue. Dec 12, 16:49 EST Investigating - We are investigating increased latency processing Processes data. As a result of this issue, some users may see delays or gaps for data based on P...
Dec 12, 16:52 EST Resolved - All impact related to APM metrics has been resolved. A separate incident has been created to track the remaining impact in live process data. Dec 12, 15:53 EST Identified - We have identified the issue affecting ingestion delays in apm and process metrics and are working on recovery Dec 12, 14:26 EST Update - We are currently investigating lag in ingesting apm and process metrics, which affects monitor evaluation and in some cases led to incorrect monitor alerts. De...
Dec 9, 16:08 EST Resolved - This incident has been resolved. Live data is being processed normally and gaps in distribution metrics on graphs will be backfilled within the next hour. Dec 9, 16:00 EST Monitoring - Live distribution metrics are available and being evaluated for all monitors. Gaps in graphs from the beginning of the incident are in the process of being backfilled. Dec 9, 15:36 EST Identified - The issue has been identified and a fix is being implemented. Dec 9, 15:16 EST Invest...
Nov 19, 14:44 EST Resolved - This incident has been resolved as of 2:32PM ET. Nov 19, 14:37 EST Monitoring - A fix has been implemented and we are monitoring the results. Nov 19, 14:28 EST Update - We continue investigating the issue with web application. Data processing and alerting remain operational. Nov 19, 14:08 EST Investigating - We are investigating loading issues on our web application. As a result, some users might be getting errors when loading the web application.
Nov 18, 08:53 EST Resolved - This incident has been resolved. Notification delays were only affecting our internal monitoring and were due to the ongoing Cloudflare incident: https://www.cloudflarestatus.com/incidents/8gmgl950y3h7/. Nov 18, 08:17 EST Investigating - We are investigating delays in RUM-based Monitors Notifications, which began at 11:30am UTC.
Nov 17, 12:20 EST Resolved - All errors stopped as of 12:02ET. This incident has been resolved. Nov 17, 12:12 EST Monitoring - The rollout with a fix is in progress, and we're no longer seeing errors, and are currently monitoring the incident and we are on the path of recovery. Nov 17, 11:40 EST Identified - The issue has been identified and we taking measures to mitigate the issue, as well as working on a fix. Nov 17, 11:23 EST Investigating - We are investigating loading issues on the dashboar...