Close Menu
geekfence.comgeekfence.com
    What's Hot

    Posit AI Blog: Discrete Fourier Transform

    June 9, 2026

    Three Ways Big Data Has Changed the World of SEO

    June 9, 2026

    Beware of the genAI token trap

    June 9, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Technology»On the AWS Outage – O’Reilly
    Technology

    On the AWS Outage – O’Reilly

    AdminBy AdminNovember 4, 2025No Comments3 Mins Read3 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    On the AWS Outage – O’Reilly
    Share
    Facebook Twitter LinkedIn Pinterest Email



    Everybody notices when something big fails—like AWS’s US-EAST-1 region. And fail it did. All sorts of services and sites became inaccessible, and we all knew it was Amazon’s fault. A week later, when I run into a site that’s down, I still say, “Must be some hangover from the AWS outage. Some cache that didn’t get refreshed.” Amazon gets blamed—maybe even rightly—even when it’s not their fault.

    I’m not writing about fault, though, and I’m also not writing a technical analysis of what happened. There are good places for that online, including AWS’s own summary. What I am writing about is a reaction to the outage that I’ve seen all too often: “This proves we can’t trust AWS. We need to build our own infrastructure.”

    Building your own infrastructure is fine. But I’m also reminded of the wisest comment I heard after the 2012 US-EAST outage. I asked JD Long about his reaction to the outage. He said, “I’m really glad it wasn’t my guys trying to fix the problem.”1 JD wasn’t disparaging his team; he was saying that Amazon has a lot of expertise in running, maintaining, and troubleshooting really big systems that can fail suddenly in unpredictable ways—when just the right conditions happen to tickle a bug that had been latent in the system for years. That expertise is hard to find and expensive when you find it. And no matter how expert “your guys” are, all complex systems fail. After last month’s AWS failure, Microsoft’s Azure obligingly failed about 10 days later.

    I’m not really an Amazon fan or, more specifically, an AWS fan. But outages like this should force us to remember what they do right. AWS outages also warn us that we need to learn how to “craft ways of undoing this concentration and creating real choice,” as Signal CEO Meredith Whittaker points out. But Meredith understands how difficult it will be to build this infrastructure and that, for the present, there’s no viable alternative to AWS or one of the other hyperscalers.

    Operating and troubleshooting large systems is difficult and requires very specialized skills. If you decide to build your own infrastructure, you will need those skills. And you may end up wishing that it isn’t your guys trying to fix the problem.


    Footnote

    1. In 2012, I happened to be flying out of DC just as the storm that took US-EAST down was rolling in. My flight made it out, but it was dramatic.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    The AI Agents Stack (2026 Edition) – O’Reilly

    June 8, 2026

    50 Years of The Institute

    June 7, 2026

    The Best 3-in-1 Apple Charging Stations After Testing Top Models

    June 6, 2026

    AI slop is making its way into children’s books. Here’s how to avoid it

    June 5, 2026

    Seminole gaming overhaul permit ruling upheld in Florida

    June 4, 2026

    The Download: Trump’s new AI order, and smart glasses for warfare

    June 3, 2026
    Top Posts

    Understanding U-Net Architecture in Deep Learning

    November 25, 202548 Views

    Hard-braking events as indicators of road segment crash risk

    January 14, 202630 Views

    Redefining AI efficiency with extreme compression

    March 25, 202627 Views
    Don't Miss

    Posit AI Blog: Discrete Fourier Transform

    June 9, 2026

    Note: This post is an excerpt from the forthcoming book, Deep Learning and Scientific Computing…

    Three Ways Big Data Has Changed the World of SEO

    June 9, 2026

    Beware of the genAI token trap

    June 9, 2026

    Got a LinkedIn message from a recruiter? It might be Chinese intelligence, warn FBI and MI5

    June 9, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    Posit AI Blog: Discrete Fourier Transform

    June 9, 2026

    Three Ways Big Data Has Changed the World of SEO

    June 9, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.