Learning the right lessons from the Amazon cloud outage

Updated: May 02, 2011

At this point in time, we're experiencing a backlash from this crash. People are reconsidering the wisdom of moving to the cloud, and in particular, public clouds. Perhaps the large infrastructure vendors who were warning their customers about the security and reliability issues with public clouds in order to sell more gear to build private clouds were right after all?

Not so fast. If we place the Amazon crash into its proper context, we are in a better position to learn the right lessons from this crisis, rather than reacting out of fear to an event taken out of that context. Here, then, are some essential lessons we should take away from the crash:

  • There is no such thing as 100 percent reliability. In fact, there's nothing 100 percent about any of IT—no code is 100 percent bug free, no system is 100 percent crashproof, and no security is 100 percent impenetrable. Just because Amazon came up snake eyes on this throw of the dice doesn't mean that public clouds are any less reliable than they were before the crisis. Whether investing in the stock market or building a high availability IT infrastructure, the best way to lower risk is to diversify. You got eggs? The more baskets the better.
  • This particular crisis is unlikely to happen ever again. We can safely assume that Amazon has some wicked smart cloud experts, and that they had already built a cloud architecture that could withstand most challenges. Suffice it to say, therefore, that the latest crisis had an unusual and complex set of causes. It also goes without saying that those experts are working feverishly to root out those causes, so that this particular set of circumstances won't happen again.

    Just because Amazon came up snake eyes on this throw of the dice doesn't mean that public clouds are any less reliable than they were before the crisis.

  • The unknown unknowns are by definition inherently unpredictable. Even though the particular sequence of events that led to the current crisis is unlikely to happen again, the chance that other entirely unpredictable issues will arise in the future is relatively likely. But such issues might very well apply to private, hybrid, or community clouds just as much as they might impact the public cloud again. In other words, bailing on public clouds to take refuge in the supposedly safer private cloud arena is an exercise in futility.
  • The most important lesson for Amazon to learn is more about visibility than reliability. The weakest part of Amazon's cloud offerings is the lack of visibility they provide their customers. This "never mind the man behind the curtain" attitude is part of how Amazon supports the cloud abstraction I discussed in the previous ZapFlash. But now it's working against them and their customers. For Amazon to build on its success, it must open the kimono a bit and provide its customers a level of management visibility into its internal infrastructure that it's been uncomfortable delivering to this point.

Featured Research
  • The Social Side of Service

    Did you know that 83% of Twitter users who tweeted a complaint said they loved receiving a response from the brand? In order to provide the best possible service to your customers, you MUST provide service on the channels that they are utilizing. Social customer service might seem scary and undefined, but can be much more effective and less expensive than traditional channels. more

  • Video Conferencing

    For many, the mere mention of video conferencing brings about bad memories of conference rooms full of people staring at a screen with dodgy sound, fuzzy images, and broken connections. What if we were to tell you that over the past decade, video conferencing solutions have evolved to where they are affordable to businesses of every size and have evolved beyond just the standard boardroom. Today, 74% of B2C marketers and 94% of B2B marketers use video in their marketing efforts. more

  • EHR Implementation

    More and more medical practices are selecting and implementing electronic health records (EHR) than ever before. In fact, statistics show that the number of practices who have purchased an EHR has doubled in just three years. That being said, many practices fail to prepare for their new EHR and thus do not gain the full benefits that come with implementing a solution. more

  • Selecting the Right EHR for Your Practice

    The purchase and implementation of an electronic health record (EHR) system is no small feat and is a big step for a practice, small or large, to take. Selecting your new EHR is one of the most important decisions that you will make for your practice. more

  • 8 Ways Business Travelers Can Save with VoIP

    Do you or any part of your workforce travel for work, or even telecommute? If that answer is yes, then you should be utilizing mobile VoIP. With VoIP, businesses have been found to save as much as 40% on local calls and a whopping 90% on international calling expenses. more