
"We experienced an outage at Coinbase last night, which is never acceptable, Armstrong wrote on May 8. He added that most Coinbase systems were designed to withstand downtime in one AWS Availability Zone, but the centralized exchange did not respond that way during the outage. It is possible to make exchanges resistant to AZ failures, but this can introduce latency delays that are not desirable along with breaking customer co-location, Armstrong stated, adding: Given this incident, we'll revisit these tradeoffs to ensure we're giving you the best possible venue to trade."
"Coinbase is reviewing its exchange infrastructure after an AWS data center cooling failure knocked several trading services offline, blocked some account access, and delayed customer balance displays. CEO Brian Armstrong called the outage unacceptable and said Coinbase will revisit tradeoffs around speed, co-location, and faster recovery during infrastructure failures. Coinbase plans to revisit resilience tradeoffs to reduce future outage duration and customer impact."
"Coinbase (Nasdaq: COIN) has explained how an AWS data center cooling failure triggered a service outage that disrupted trading, exchange access, and customer account data across the platform. Coinbase CEO Brian Armstrong addressed the incident on X, while engineering lead Rob Witoff detailed the recovery process and customer impact. Trading, account access, and customer account information were disrupted across several Coinbase exchange services."
"At a minimum, the duration of an outage should be able to be reduced considerably when an AZ move is needed. Armstrong noted that Coinbase will review how it balances exchange speed, customer co-location, and recovery time a"
An AWS data center cooling failure triggered an outage that disrupted multiple Coinbase exchange services. Trading activity was interrupted, some account access was blocked, and customer balance displays were delayed. Coinbase stated that most systems were designed to tolerate downtime within a single AWS Availability Zone, but the centralized exchange did not behave that way during the incident. Coinbase leadership called the outage unacceptable and said resilience approaches that prevent Availability Zone failures can introduce latency and affect customer co-location. Coinbase plans to revisit tradeoffs among speed, co-location, and recovery time, with a goal of significantly reducing outage duration when an Availability Zone move is required.
#cloud-infrastructure #aws-availability-zones #service-resilience #trading-systems #incident-recovery
Read at news.bitcoin.com
Unable to calculate read time
Collection
[
|
...
]