Datadog
All Systems Operational

About This Site

The status of Datadog.
If you would like to see the status of third party integrations you might have enabled with Datadog check out https://datadogintegrations.statuspage.io
If you are a customer running in our EU region check out https://status.datadoghq.eu

Alerting Engine Operational
API Operational
API Crawlers Operational
APM ? Operational
Corporate Site (www.datadoghq.com) ? Operational
Daily and Weekly Reports Operational
Event Pipeline Operational
Historical Data Operational
Logs Operational
Metrics Pipeline Operational
Package Repositories Operational
Processes Operational
Synthetics Operational
Web Application Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Past Incidents
Mar 5, 2021

No incidents reported today.

Mar 4, 2021

No incidents reported.

Mar 3, 2021

No incidents reported.

Mar 2, 2021

No incidents reported.

Mar 1, 2021
Resolved - This incident has been resolved.
Mar 1, 14:26 EST
Update - We are continuing to monitor for any further issues.
Mar 1, 14:25 EST
Update - AWS crawled metrics are up to date. Alerts have been reenabled for all customers.
Mar 1, 14:15 EST
Monitoring - We have implemented a fix and are monitoring recovery. Customers should see only slight delays in crawled AWS metrics.
Mar 1, 14:08 EST
Identified - The issue has been identified and a fix is being implemented.
Mar 1, 13:58 EST
Investigating - AWS crawled metrics are currently delayed. To prevent false positives, monitor evaluation for these metrics has been paused.
Mar 1, 13:38 EST
Resolved - Intake has fully recovered. All events monitors have been re-enabled.
Mar 1, 10:13 EST
Update - Less than 1% of events are lagging. To prevent false positives event alerts remain disabled while intake recovers.
Mar 1, 09:50 EST
Identified - Event processing is currently delayed for some customers. As a result you might experience a delay with the event timeline, event based widgets or event based monitors. All event alerts are disabled while we implement a fix.
Mar 1, 09:26 EST
Feb 28, 2021

No incidents reported.

Feb 27, 2021

No incidents reported.

Feb 26, 2021
Resolved - This incident has been resolved.
Feb 26, 08:18 EST
Update - We are continuing to monitor for any further issues.
Feb 26, 07:51 EST
Monitoring - A fix has been implemented and we are monitoring the results.
Feb 26, 07:50 EST
Identified - The issue has been identified and a fix is being implemented.
Feb 26, 07:05 EST
Investigating - We’re actively investigating increased latencies collecting Azure and GCP metrics. As an effect, there might be delays in graphs displaying these metrics. To avoid spurious alerts, we’ve temporarily disabled “no data” alerts for these metrics.
Feb 26, 06:31 EST
Feb 25, 2021

No incidents reported.

Feb 24, 2021
Resolved - This incident has been resolved.
Feb 24, 00:56 EST
Monitoring - A fix has been implemented and we are monitoring the results.
Feb 24, 00:36 EST
Identified - We're investigating elevated error rates on web app pages including Logs, Trace Analytics, RUM and Network Monitoring.

It is important to note that data is still being ingested and processed, data will be available once the incident is resolved.
Feb 24, 00:31 EST
Feb 23, 2021
Resolved - This incident has been resolved.
Feb 23, 11:32 EST
Update - All impacted products are now returning results as expected (Logs, Trace Analytics, RUM, NPM). We are investigating possible degraded performance for some specific Logs queries that impact a subset of customers and continue monitoring status until complete resolution of this incident.
Feb 23, 10:42 EST
Monitoring - At this time error rates are down across all products, including web queries accessing historical data for Logs, Trace Analytics, RUM or NPM. We are monitoring the status until full resolution.
Feb 23, 09:53 EST
Update - We are continuing to work on a fix for this issue.
Feb 23, 08:58 EST
Update - We are continuing to work on a fix for this issue.
Feb 23, 08:25 EST
Update - We are continuing to work on a fix for this issue.
Feb 23, 07:49 EST
Update - We are continuing to work on this issue, customers might still experience errors on web app pages including Logs, Trace Analytics, RUM and Network Monitoring.
It is important to note that data is still being ingested and processed, data will be available once the incident is resolved.
Feb 23, 07:01 EST
Update - We have deployed additional resources to mitigate the impact and the web app errors are subsiding. At this point some queries might still return errors. We are monitoring the status until full resolution.
Feb 23, 06:18 EST
Update - We are continuing to work on a fix for this issue, the error rates continue to be elevated on web app pages including Logs, Trace Analytics, RUM and Network Monitoring.
It is important to note that data is still being ingested and processed, data will be available once the incident is resolved.
Feb 23, 05:42 EST
Update - We are continuing to work on this issue and mitigate the web app impact. At this point, the error rate is still high on web app queries for Logs, Trace Analytics and RUM, causing empty queries or error messages to be returned for these pages. Delays for monitor evaluations are almost back to normal.
Feb 23, 04:59 EST
Update - We are continuing to work on a fix for this issue.
Feb 23, 04:45 EST
Update - Error rates for affected monitors are returning to normal.

Error rates and latencies for web app queries are elevated for Logs, Trace Analytics, and RUM.
Feb 23, 04:33 EST
Identified - The issue has been identified and a fix is being implemented.
Feb 23, 04:09 EST
Investigating - We’re investigating processing delays for Logs, Trace Analytics, and RUM monitors. As a result, alerts for these monitor types might be delayed.

Dashboards, Logs Search, and other views of this data are fully operational. Other monitor types such as metrics and event alerts are unaffected.
Feb 23, 03:29 EST
Feb 22, 2021
Resolved - This incident has been resolved.
Feb 22, 17:21 EST
Monitoring - Azure and GCP metrics have caught up and alerts on these metrics have been enabled.

Additionally, notification latency is back to normal.
Feb 22, 17:04 EST
Update - AWS metrics have caught up and alerts on these metrics have been enabled.
Feb 22, 16:55 EST
Identified - The issue has been identified and a fix is being implemented.
Feb 22, 16:44 EST
Update - We are continuing to investigate this issue.
Feb 22, 16:42 EST
Investigating - We’re actively investigating increased intake latencies for AWS, GCP, and Azure metrics. As an effect, there might be delays in graphs displaying these metrics.

We’re also investigating increased latencies with all notifications.

To avoid spurious alerts we’ve temporarily disabled alerts on these metrics.
Feb 22, 16:34 EST
Feb 21, 2021

No incidents reported.

Feb 20, 2021

No incidents reported.

Feb 19, 2021

No incidents reported.