August 16, 2011 – After a power failure caused disruptions and “stuck” work loads with its cloud service, Amazon states it will add redundancy, more isolation to electrical controllers and added recovery capabilities at its data centers and among vendors.

Amazon published a postmortem on the details and architecture behind its second recent incident with its Amazon Web Services, which knocked out power and made some workloads inaccessible for hours and without recovery snapshots for days. On Aug. 7, an unknown power surge – originally but erroneously attributed to a lightning strike that downed a transformer – blitzed EU West, Amazon’s data center in Dublin, Ireland. Backup power did not kick in like expected, causing stuck data loads and disconnects with cloud services such as its Elastic Compute Cloud or Elastic Block Store, according to the postmortem.

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access