Try Bifrost Enterprise free for 14 days.
Request access
Replicate
[ LIVE STATUS ]

Is Replicate Down?

Replicate model hosting, inference API, and machine learning platform services.

Official status data last reflected here:

All Systems Operational
Live · updated just now

[ STATUS AT A GLANCE ]

Operational
Current Status
All Systems Operational
13
Components
Service areas tracked on this page
5
90d Incidents
Incidents reported in the last 90 days

System Components

Current status of individual Replicate services

BillingAll Systems Operational
system-wide incident history
90 days agoToday
Support TicketsAll Systems Operational
system-wide incident history
90 days agoToday
Streaming APIAll Systems Operational
system-wide incident history
90 days agoToday
HTTP APIAll Systems Operational
system-wide incident history
90 days agoToday
CPU HardwareAll Systems Operational
system-wide incident history
90 days agoToday
Replicate Registry (r8.im)All Systems Operational
system-wide incident history
90 days agoToday
A100 HardwareAll Systems Operational
system-wide incident history
90 days agoToday
PlaygroundAll Systems Operational
system-wide incident history
90 days agoToday
Home PageAll Systems Operational
system-wide incident history
90 days agoToday
L40S HardwareAll Systems Operational
system-wide incident history
90 days agoToday
H100 HardwareAll Systems Operational
system-wide incident history
90 days agoToday
T4 HardwareAll Systems Operational
system-wide incident history
90 days agoToday
Official ModelsAll Systems Operational
system-wide incident history
90 days agoToday
[ AUTOMATIC FAILOVER ]

Replicate down? Route around it.

When Replicate has issues, Bifrost automatically routes your requests to a healthy alternative provider. Zero code changes. 99.999% effective uptime.

About Replicate

What Replicate does, where the data on this page comes from, and recent reliability

[ ABOUT REPLICATE ]

About Replicate

Replicate provides Replicate API, Hosted open models, and Inference jobs. Replicate is used to access a wide range of models, so availability issues can impact multiple AI features at once across image, audio, and text workflows.

This page pulls data from Replicate's official status page to show current service health, any active incidents, and a history of recent issues, all in one view.

Replicate APIHosted open modelsInference jobs

[ DATA SOURCES ]

Full incident history available

Replicate publishes detailed component status, a full incident archive, and scheduled maintenance data through their official status page.

  • Data pulled from Replicate's official status page (www.replicatestatus.com)
  • Refreshed every 60 seconds
  • Covers Replicate API, Hosted open models, and Inference jobs
  • Includes full incident archive and scheduled maintenance history

[ RELIABILITY ]

Recent reliability

  • 5 incidents reported over the last 90 days.
  • Last reported incident was 18 days ago.
  • All 13 monitored components are currently operational.

[ COMMON USE CASES ]

How teams use Replicate

Replicate is used to access a wide range of models, so availability issues can impact multiple AI features at once across image, audio, and text workflows.

Hosted model APIs
Batch generation
Product experimentation

Incidents & Maintenance

Active incidents, scheduled maintenance, and incident history for Replicate

Past Incidents

Degraded A100 hardware

Apr 19, 2026 · Resolved Apr 19, 2026

Resolvedminor
ResolvedApr 19, 2026, 3:02 PM UTC

MonitoringApr 19, 2026, 2:36 PM UTC

A100 capacity unavailable during storage maintenance

Apr 9, 2026 · Resolved Apr 9, 2026

Resolvedmajor
ResolvedApr 9, 2026, 5:30 PM UTC

InvestigatingApr 9, 2026, 4:43 PM UTC

Downstream errors for Black Forest Labs models

Mar 23, 2026 · Resolved Mar 24, 2026

Resolvedminor
ResolvedMar 24, 2026, 1:34 AM UTC

IdentifiedMar 23, 2026, 3:57 PM UTC

Degraded performance on Flux Schnell

Mar 10, 2026 · Resolved Mar 10, 2026

Resolvednone
ResolvedMar 10, 2026, 6:17 PM UTC

MonitoringMar 10, 2026, 1:11 PM UTC

InvestigatingMar 10, 2026, 12:56 PM UTC

Model Predictions Stuck at "Starting"

Feb 20, 2026 · Resolved Feb 20, 2026

Resolvedmajor
ResolvedFeb 20, 2026, 1:12 PM UTC

MonitoringFeb 20, 2026, 12:59 PM UTC

InvestigatingFeb 20, 2026, 12:13 PM UTC

Increased setup failures for T4 models

Jan 26, 2026 · Resolved Jan 26, 2026

Resolvednone
ResolvedJan 26, 2026, 3:16 PM UTC

MonitoringJan 26, 2026, 10:10 AM UTC

InvestigatingJan 26, 2026, 8:53 AM UTC

InvestigatingJan 26, 2026, 7:33 AM UTC

Predictions and training unavailable for multiple models

Jan 20, 2026 · Resolved Jan 21, 2026

Resolvedminor
ResolvedJan 21, 2026, 12:22 AM UTC

InvestigatingJan 20, 2026, 10:53 PM UTC

Flux Schnell unavailable

Jan 20, 2026 · Resolved Jan 21, 2026

Resolvedminor
ResolvedJan 21, 2026, 12:07 AM UTC

MonitoringJan 20, 2026, 9:39 PM UTC

InvestigatingJan 20, 2026, 8:58 PM UTC

Prediction Errors

Jan 15, 2026 · Resolved Jan 15, 2026

Resolvedmajor
ResolvedJan 15, 2026, 10:34 AM UTC

InvestigatingJan 15, 2026, 10:33 AM UTC

InvestigatingJan 15, 2026, 9:32 AM UTC

InvestigatingJan 15, 2026, 8:34 AM UTC

InvestigatingJan 15, 2026, 7:38 AM UTC

High demand for H100 hardware type

Dec 18, 2025 · Resolved Dec 20, 2025

Resolvedminor
ResolvedDec 20, 2025, 5:10 AM UTC

MonitoringDec 18, 2025, 7:06 PM UTC

Limited availability of L40S hardware

Dec 11, 2025 · Resolved Dec 11, 2025

Resolvedminor
ResolvedDec 11, 2025, 9:49 PM UTC

MonitoringDec 11, 2025, 8:38 PM UTC

Global network outage

Nov 18, 2025 · Resolved Nov 18, 2025

Resolvedcritical
ResolvedNov 18, 2025, 6:42 PM UTC

MonitoringNov 18, 2025, 5:35 PM UTC

MonitoringNov 18, 2025, 2:48 PM UTC

InvestigatingNov 18, 2025, 2:26 PM UTC

InvestigatingNov 18, 2025, 12:48 PM UTC

+ 3 more updates

sora-2-pro currently unavailable

Nov 13, 2025 · Resolved Nov 17, 2025

Resolvednone
ResolvedNov 17, 2025, 2:03 PM UTC

MonitoringNov 13, 2025, 4:19 AM UTC

InvestigatingNov 13, 2025, 3:12 AM UTC

Downstream Service Disruption

Oct 29, 2025 · Resolved Oct 30, 2025

Resolvedmajor
ResolvedOct 30, 2025, 4:04 PM UTC

MonitoringOct 29, 2025, 7:04 PM UTC

Luma models not running

Oct 22, 2025 · Resolved Oct 22, 2025

Resolvedmajor
ResolvedOct 22, 2025, 4:05 PM UTC

MonitoringOct 22, 2025, 2:05 PM UTC

Intermittent issues with `cog push` with large images

Oct 21, 2025 · Resolved Oct 21, 2025

Resolvedminor
ResolvedOct 21, 2025, 1:18 PM UTC

MonitoringOct 21, 2025, 12:02 PM UTC

InvestigatingOct 21, 2025, 11:07 AM UTC

Replicate Platform Outage

Oct 20, 2025 · Resolved Oct 21, 2025

Resolvedcritical
ResolvedOct 21, 2025, 2:46 AM UTC

InvestigatingOct 20, 2025, 9:14 PM UTC

InvestigatingOct 20, 2025, 7:38 PM UTC

InvestigatingOct 20, 2025, 7:28 PM UTC

Widespread service degradation

Oct 20, 2025 · Resolved Oct 21, 2025

Resolvedminor
ResolvedOct 21, 2025, 2:44 AM UTC

MonitoringOct 21, 2025, 1:19 AM UTC

MonitoringOct 20, 2025, 2:37 PM UTC

Heygen models outage

Sep 30, 2025 · Resolved Oct 2, 2025

Resolvedmajor
ResolvedOct 2, 2025, 4:55 PM UTC

IdentifiedSep 30, 2025, 5:37 PM UTC

Google Models are down

Sep 29, 2025 · Resolved Sep 29, 2025

Resolvedmajor
ResolvedSep 29, 2025, 8:48 PM UTC

InvestigatingSep 29, 2025, 6:26 PM UTC

Frequently Asked Questions

Is Replicate down right now?

Check the status indicator at the top of this page. It pulls directly from Replicate's official status page. If Replicate is experiencing any issues, you'll see it reflected here. This real-time monitoring helps teams quickly identify whether performance problems are caused by Replicate infrastructure or their own systems.

What does this Replicate status page track?

This page tracks Replicate API, Hosted open models, and Inference jobs using data from Replicate's official status page. You can see current component health, active incidents, and a history of past issues. This visibility is crucial for teams building resilient AI applications that need to route around provider outages.

How often is Replicate status updated here?

We check Replicate's status page every 60 seconds to ensure you get near real-time status updates. How quickly issues show up here depends on how fast Replicate updates their own official status. For production systems that need instant failover, Bifrost can automatically detect and route around degraded providers.

Why monitor Replicate status?

Replicate is used to access a wide range of models, so availability issues can impact multiple AI features at once across image, audio, and text workflows. Real-time status monitoring enables proactive incident response and helps teams decide when to route traffic to alternative providers for maximum uptime.

What should I do when Replicate goes down?

When Replicate experiences an outage, the best practice is automatic failover to alternative AI providers. Bifrost is an open-source AI gateway that automatically detects Replicate degradation and routes LLM traffic to healthy alternatives like Stability AI, Hugging Face, keeping your application running with zero manual intervention. This intelligent routing ensures your users never experience downtime from a single provider's issues.

How can I prevent Replicate downtime from affecting my application?

Production AI applications should never depend on a single provider. Bifrost AI Gateway provides automatic multi-provider failover, intelligent load balancing, and health-based routing. When Replicate degrades, Bifrost instantly routes requests to operational alternatives while maintaining API compatibility. This architecture approach is used by teams running mission-critical AI features.

What are common causes of Replicate outages and degraded performance?

Common causes of Replicate issues include infrastructure scaling challenges, regional cloud provider problems, API gateway overload, and deployment errors. Monitor this page to stay informed, and consider implementing automatic failover with Bifrost to maintain uptime during Replicate incidents.

Alternative Providers for Automatic Failover

When Replicate experiences issues, Bifrost AI Gateway can automatically route your LLM traffic to these alternatives

💡 Build resilient AI apps: Configure Bifrost to automatically detect Replicate outages and route requests to healthy alternatives. This multi-provider approach ensures your application maintains uptime even when individual providers experience issues. Learn more about Bifrost AI Gateway