Microsoft: Azure Latency Surge as Red Sea Cables Cut, Traffic Rerouted

On September 6, 2025, Microsoft issued a service health advisory warning Azure customers that multiple undersea fiber-optic cable cuts in the Red Sea are causing increased latency and intermittent connectivity issues. The disruptions stem from forced traffic reroutes onto longer alternate paths, as critical submarine cables serving the Europe–Asia corridor remain severed. This incident exposes the physical fragility underpinning cloud infrastructure and has reignited discussions about network resilience in geopolitically tense chokepoints.

Why the Red Sea Matters for Cloud Traffic

The Red Sea is a strategic transit point for submarine communications, funneling data between Europe, the Middle East, and Asia. Several major fiber systems—including AAE-1, SEA-ME-WE 4, and IMEWE—carry a significant share of intercontinental cloud traffic. When these cables are damaged, traffic must circumnavigate the outage by traversing longer routes, often adding thousands of kilometers to the propagation path. The result is elevated round-trip time (RTT), increased jitter, and degraded throughput for latency-sensitive applications.

Previous incidents in 2024 demonstrated how quickly damage in this corridor can cause regional performance drops. Monitoring firms such as NetBlocks reported measurable latency increases of 50-150% on some Europe–Asia routes during earlier outages. Microsoft’s advisory noted that Azure users “may experience increased latency” and that engineers are actively rebalancing and optimizing routing to limit impact. The warning specifically flagged workloads that traverse the Middle East, though any service with endpoints across affected regions could feel the pinch.

What the Azure Advisory Means

Microsoft’s health update breaks down into several technical implications:
- Routing detours increase latency: Traffic once flowing directly through Red Sea cables is now being shifted to alternative subsea or terrestrial paths, potentially routing around the Cape of Good Hope or through trans-Pacific cables. The extra distance and additional network hops unavoidably add latency and packet loss. For example, a direct London–Mumbai path that normally takes ~100ms might now exceed 200ms.
- Performance is regional and uneven: Not all Azure regions or customers are equally hit. The impact concentrates on traffic originating or terminating in Asia, the Middle East, and Europe. Control-plane operations like management APIs may remain responsive if they use separate regional endpoints, but data-plane workloads—database replication, backups, application traffic—are especially sensitive.
- Cascading effects possible: Past Azure incidents have shown that network stress can trigger wider service degradation, affecting storage partitions, private endpoints, and identity services in complex ways because of cloud-internal dependencies. For instance, synchronous storage replication that depends on low-latency links may timeout, causing partitions to become unavailable.

Why Submarine Cable Repairs Take So Long

Undersea cable repair is a slow and specialized operation. Only a handful of cable ships exist globally, and they must reach the fault area safely—a challenge when regional maritime security is unstable. The Red Sea has seen heightened tensions, including attacks on shipping and the presence of unmanned vessels, which can delay or block repair crews. Permitting, weather, and ship availability can stretch repair timelines from days to months. Industry reports suggest that even in favorable conditions, a single cable cut can take 2-4 weeks to fix.

Alternate routing, while essential, is imperfect. It concentrates traffic onto other cables, risking congestion. Satellite and microwave backhaul offer stopgap capacity but at higher cost and latency, unsuitable for real-time applications like videoconferencing or financial trading. Some carriers have begun leasing additional capacity on alternate routes, but this is a temporary and expensive measure.

Who Is Most at Risk?

The incident’s blast radius falls hardest on:
- Synchronous replication and low-latency systems: Databases requiring sub-10ms replication or distributed consensus protocols (e.g., leader elections) can time out, causing outages or degraded throughput. Systems using Azure SQL Database with geo-replication or Cosmos DB multi-region writes are vulnerable.
- Real-time user experiences: Interactive web apps, VoIP, remote desktop, and gaming see higher lag and jitter, directly affecting user experience. A European user connecting to an Asia-based game server may experience unplayable ping spikes.
- Financial services and market-sensitive traffic: Trading platforms that depend on deterministic low-latency paths may face measurable degradation, potentially impacting transaction outcomes. Firms without pre-arranged alternative routes via private fiber may see spreads widen.
- Organizations without multi-region disaster recovery: Enterprises that rely on a single Azure region or have untested failover plans face the highest operational risk. Even when Microsoft reroutes internally, customer readiness is often the deciding factor in recovery speed.

Recommended Actions for Azure Customers

Microsoft recommends several immediate steps for affected enterprises:
1. Check Azure Service Health for your subscriptions and follow tailored guidance. Personalised alerts and impact notices are available in the portal.
2. Identify any applications using synchronous cross-region communication that may transit the Red Sea corridor. Tools like Azure Network Watcher or third-party observability platforms can help map traffic paths.
3. Execute tested failover procedures for production workloads that can tolerate short downtimes. For example, fail over from West Europe to North Europe if latency is impacting critical services.
4. For latency-sensitive services that cannot fail over, consider shifting traffic to CDN edge nodes or alternative transit providers if available. Azure Front Door can route requests to the nearest PoP.
5. Ramp up monitoring: set alerts for RTT thresholds, packet loss, and error rates that correlate with routing changes. Azure Monitor and third-party services can provide visibility.
6. If no disaster recovery playbook exists, prioritize critical services, contact Microsoft support, and document impacts for post-incident review and possible service credit claims. Microsoft’s SLA covers certain latency-related issues, and credits may apply.

Microsoft’s Response: Strengths and Limits

Strengths
Microsoft moved quickly to alert customers and committed to daily updates—a critical step for transparency. Its massive global backbone and extensive peering relationships allow dynamic rerouting, reducing the risk of total connectivity loss. Historical experience in handling similar outages gives Azure engineers playbooks for traffic rebalancing and partition recovery. The company’s investment in hollow-core fiber experiments and expanded backbone capacity may mitigate some future risks, but those are long-term plays.

Limits
The physical repair process remains out of Microsoft’s direct control; cable ships and safe access may delay full restoration for weeks. Complex cloud dependencies mean a networking issue can manifest as storage or identity failures, complicating diagnostics and communication. Additionally, some customers may experience degradation without clear localized messaging, a persistent challenge during large-scale cloud incidents. Past Azure status dashboard inaccuracies have drawn criticism, and this incident may test user trust again.

Geopolitical and Industry Context

The Red Sea has become a recurring point of fragility. Regional conflicts, vessel sinkings, and political tensions have previously delayed cable repairs and cast doubt on the security of the corridor. In 2024, attacks by Houthi rebels reportedly caused damage to multiple cables, though attribution remains disputed. Industry analysts have long warned that the submarine cable ecosystem lacks sufficient geographic redundancy at critical chokepoints. The current cuts—whether accidental or intentional—demonstrate how simultaneous damage to a few cables can disrupt traffic across continents.

Governments and industry bodies are increasingly calling for international cooperation on cable protection and repair prioritization. The International Telecommunication Union (ITU) has urged member states to expedite permit processes for repair ships. Yet, geopolitical realities often slow progress.

Long-Term Implications for IT Architecture

This incident is a stark reminder that cloud resilience cannot be reduced to software-defined abstractions. Physical infrastructure—subsea cables, terrestrial backbones, and the peering points between them—is still the foundation. IT architects must:
- Design for failure: multi-region architectures, asynchronous replication, and cross-cloud redundancy are no longer optional for critical workloads. Azure’s availability zones and paired regions offer some protection, but active-active designs may be necessary for ultra-critical apps.
- Know your traffic paths: real-time network observability and dependency mapping help identify single points of failure. Tools like Azure Monitor, ThousandEyes, or Kentik can reveal unexpected route obligations.
- Invest in private connectivity where necessary: private interconnects such as Azure ExpressRoute can offer more predictable routing than public internet paths, though they still depend on physical cable routes.
- Treat satellite and microwave as temporary bridges, not permanent fixes; they are expensive and suboptimal for real-time applications.
- Engage cloud providers proactively during incidents to understand mitigation strategies and retain contractual recourse.

What Remains Unclear

Repair timelines hinge on ship deployment and on-site safety, with estimates ranging from weeks to months. Cable operators have not issued firm completion dates. Attribution of the cuts is also unresolved, and geopolitical dynamics may continue to influence repair access. Azure engineers continue to rebalance traffic, but the situation remains fluid. Microsoft’s daily updates will be the best source of timely information.

Conclusion

The September 6 Azure advisory is a powerful wake-up call: the physical layer of the internet remains deeply intertwined with cloud reliability. Microsoft’s rapid response and traffic engineering mitigations will limit the worst effects, but elevated latency and uneven performance are likely to persist until repairs are completed. For enterprises running workloads across the affected corridors, immediate action—checking service health, validating dependencies, and executing disaster recovery plans—is essential. In a world where digital transformation races ahead, the resilience of undersea cables may be the next frontier of cloud architecture.