Datadog
All Systems Operational

About This Site

The status of Datadog.
If you would like to see the status of third party integrations you might have enabled with Datadog check out https://datadogintegrations.statuspage.io
If you are a customer running in our EU region check out https://status.datadoghq.eu

Alerting Engine Operational
API Operational
API Crawlers Operational
APM ? Operational
Corporate Site (www.datadoghq.com) ? Operational
Daily and Weekly Reports Operational
Event Pipeline Operational
Historical Data Operational
Logs Operational
Metrics Pipeline Operational
Package Repositories Operational
Processes Operational
Synthetics Operational
Web Application Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Past Incidents
Sep 26, 2020

No incidents reported today.

Sep 25, 2020

No incidents reported.

Sep 24, 2020
Resolved - All live data including APM metrics are now current, as well as corresponding APM alerts. Note that a subset of historical APM metric data may still show gaps and will be recalculated, along with SLOs over the next 24h. We apologize again for the inconvenience this outage has caused.
Sep 24, 21:40 EDT
Update - Events data is now current. We are continuing to backfill delayed data for APM metrics.
Sep 24, 20:55 EDT
Update - Processes and NPM data is now current. We are currently processing remaining data backlogs and are continuing to backfill delayed data for events and APM metrics.
Sep 24, 19:49 EDT
Monitoring - We are currently processing remaining data backlogs. We’re now current with Metric data and alerts, and are working on backfilling delayed data for events, APM metrics, processes and NPM.
Sep 24, 18:31 EDT
Update - We are making further progress in the recovery of customer-facing systems. The web application and APIs are operational, so are logs and corresponding alerts, as well as live APM traces. A subset of metric data is still delayed and being caught-up. We are still however working on processing backloged APM metrics and other types of alerts.
Sep 24, 17:19 EDT
Update - We are making further progress in the recovery of customer-facing systems. The web application and APIs are operational, so are logs and corresponding alerts, as well as live APM traces. A subset of metric data is still delayed and being caught-up. We are still however working on processing backloged APM metrics and other types of alerts.
Sep 24, 17:17 EDT
Update - We are making further progress in the recovery of customer-facing systems. The web application and APIs are operational, so are logs and corresponding alerts, as well as live APM traces. A subset of metric data is still delayed and being caught-up. We are still however working on processing backloged APM metrics and other types of alerts.
Sep 24, 17:03 EDT
Update - We are making further progress in the recovery of customer-facing systems. The web application and APIs are operational, so are logs and corresponding alerts, as well as live APM traces. A subset of metric data is still delayed and being caught-up. We are still however working on processing backloged APM metrics and other types of alerts.
Sep 24, 16:59 EDT
Update - We are making progress in the recovery of customer-facing systems. Web application error rate is down, metrics data is available, although we are still catching-up on some of the delayed data. Logs data is available and timely. We are still working on re-enabling all functionality and catching-up our alerting systems.
Sep 24, 16:28 EDT
Update - We are making progress in the recovery of customer-facing systems. Web application error rate is down, metrics data is available, although we are still catching-up on some of the delayed data. Logs data is available and timely. We are still working on re-enabling all functionality and catching-up our alerting systems.
Sep 24, 16:26 EDT
Update - We are still working to resolve this outage. We are working to divert traffic away from the affected components and restoring our customer-facing services. Our mitigations are showing progress, but we are still observing high error rates in our web application and API, and delays in metrics processing and alerting.
Sep 24, 15:36 EDT
Update - We are currently experiencing a widespread outage in our US-1 Data center, and all hands are on deck to resolve it - we are truly sorry for the inconvenience and are working towards a timely resolution. The infrastructure that allows the configuration and resolution of our services is currently severely degraded, causing a number of customer-facing services to be disrupted. This results in high error rates in our web application and API, delays in metrics processing and disrupts alerting.
Sep 24, 14:32 EDT
Update - We are continuing to actively work to mitigate the internal infrastructure connectivity issue impacting multiple systems.
Sep 24, 14:19 EDT
Update - We are continuing to actively work to mitigate the internal infrastructure connectivity issue impacting multiple systems.
Sep 24, 13:16 EDT
Identified - We are actively working on an issue that affects internal infrastructure connectivity and is impacting multiple systems.
Sep 24, 12:35 EDT
Update - We are continuing to investigate this issue.
Sep 24, 12:31 EDT
Update - We are continuing to investigate the elevated error rate on the web application.
Sep 24, 12:19 EDT
Update - We are continuing to investigate the elevated error rate on the web application.
Sep 24, 11:42 EDT
Update - We are continuing to investigate this issue.
Sep 24, 11:07 EDT
Update - We are continuing to investigate the elevated error rate on the web application.
Sep 24, 11:06 EDT
Investigating - We are seeing an elevated error rate on the web application. We are currently investigating the issue. It's important to note that monitoring data is properly processed and that no data is lost.
Sep 24, 10:27 EDT
Resolved - This incident has been resolved.
Sep 24, 21:38 EDT
Investigating - We are actively investigating reports of issues logging in using Google Auth.
Sep 24, 21:25 EDT
Resolved - Combined into single incident
Sep 24, 12:37 EDT
Update - We are continuing to investigate this issue.
Sep 24, 11:47 EDT
Investigating - We're investigating an issue with delayed metric monitors since 13:40 UTC.
Sep 24, 10:00 EDT
Resolved - Combined into single incident.
Sep 24, 12:36 EDT
Update - We are continuing to investigate this issue.
Sep 24, 11:48 EDT
Investigating - We are currently investigating delays in processing metric data.
Sep 24, 11:22 EDT
Resolved - This incident has been resolved.
Sep 24, 10:16 EDT
Investigating - We are seeing an elevated error rate on the web application. We are currently investigating the issue. It's important to note that monitoring data is properly processed and that no data is lost.
Sep 24, 10:08 EDT
Sep 23, 2020

No incidents reported.

Sep 22, 2020

No incidents reported.

Sep 21, 2020

No incidents reported.

Sep 20, 2020

No incidents reported.

Sep 19, 2020

No incidents reported.

Sep 18, 2020

No incidents reported.

Sep 17, 2020

No incidents reported.

Sep 16, 2020

No incidents reported.

Sep 15, 2020

No incidents reported.

Sep 14, 2020

No incidents reported.

Sep 13, 2020

No incidents reported.

Sep 12, 2020

No incidents reported.