Essential Guide

Get started Bring yourself up to speed with our introductory content.

Amazon S3 outage: A guide to getting over cloud failures


If there's one thing that the Amazon S3 outage of February 2017 taught the builders of Java cloud applications, it's that cloud computing platforms are not immune to catastrophic failures. And while some developers of Java cloud apps can get away with simply pointing the finger of blame for a four-hour outage at Amazon, others cannot be quite so glib and are now tackling the question of how to build more failover and resiliency into their cloud-hosted applications.

In the first section of this essential guide, we look at the realities behind the Amazon Simple Storage Service outage, what the real reasons were that caused Amazon's US-East-1 availability zone to falter and the impact that failure has had on the level of confidence users have with their cloud computing vendors of choice.

The next section looks at some of the key concepts surrounding application failure, failover and availability, creating a baseline of terms that are important when software architects and Java cloud app developers are thinking about how to deal with unexpected cloud infrastructure failures.

And in the final section of this guide, we look at how to apply these concepts to the development of Java cloud apps and the provisioning of resources in single and multicloud environments.

If you are developing modern, cloud-native applications that are hosted on SaaS, platform-as-a-service or infrastructure-as-a-service systems provided by vendors such as Google, Amazon, Oracle or Microsoft, you must be aware of the fact that cloud outages are always a possibility. This essential guide will keep you abreast of the dangers of moving your applications to the cloud, while providing insight on how to mitigate the possibility of a cloud vendor failure, such as the Amazon S3 outage that happened in February of this year.

1What happened?-

Understanding the Amazon S3 outage

It was a surprise to everyone when the Amazon S3 outage took place on the East Coast in February 2017. In this section, we take a look at why the Amazon cloud failed and how that failure impacted the industry's confidence in cloud computing platforms.


Why the Amazon S3 outage is a big deal for cloud computing

The Amazon S3 outage has turned into the Fukushima moment of cloud computing, as users re-evaluate the cloud's long-term viability. Continue Reading


The Amazon S3 outage was more than user input

Maybe there was an input error during some daytime troubleshooting, but input error was not what caused the Amazon S3 outage. Continue Reading

2Understanding software apps-

Concepts in Java cloud app availability

When thinking about the availability and the resiliency of Java cloud apps, there are some key concepts to consider. In this section, we take a look at ideas such as graceful degradation and a multicloud strategy and look at how experienced software architects bake these concepts into their software applications.


What is a multicloud strategy?

A multi-cloud strategy refers to the use of two or more cloud computing services. Continue Reading


What is graceful degradation?

Graceful degradation is the ability of a computer, machine, electronic system or network to maintain limited functionality even when a large portion of it has been destroyed or rendered inoperative. The purpose of graceful degradation is to prevent catastrophic failure... (Continued) Continue Reading


Addressing the cloud-based analysis paralysis

For many applications, an Amazon or Azure cloud might be overkill. In such cases, a managed services model approach just might be the better option, and here's why. Continue Reading


Netflix employs multicloud deployment with Spinnaker

Learn how Netflix leveraged automation and Spinnaker to perform 4,000 multi-cloud deployments per day. Continue Reading

3How to move forward-

Keeping your apps up when Amazon is down

So, given an understanding of concepts such as graceful degradation and multicloud strategies, it's time to see how organizations have applied those concepts to their software development process and built resiliency into each part of the Java cloud app's application lifecycle management (ALM) process. Here we look at how to build Java cloud apps that will keep running, even if your cloud computing provider's engine has run out of steam.


Amazon S3 apps don't have to go down if Amazon does

Just because Amazon is down doesn't mean your S3 apps have to be. Here are some tips and strategies for protecting yourself against the next Amazon S3 outage. Continue Reading


What to know about applying ALM to the cloud app-dev process

What's hampering your cloud ALM efforts? Find out how three ALM myths can affect cloud app development and management. Continue Reading


How to keep your Java cloud apps running in an S3 outage

Many people had to learn the hard way that an Amazon S3 outage is possible. Here we look at how to keep your Java cloud apps running, even if the Amazon cloud isn't. Continue Reading

Start the conversation

Send me notifications when other members comment.

By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Please create a username to comment.