Not only do they deploy every 11.6s. Since the changed their processes back in 2006 they have 75% fewer outages, 90% fewer outage minutes and - in case something happens (which only happens 0.001% of the deployments) they can do an automatic rollback. They key to this is smaller changes on a more frequent basis.
However – even small changes can have severe impacts. Examples?
- Memory Leaks in Production: Introduced by a not well tested remote logging framework downloaded on GitHub
- Performance Impact of Exceptions in Ops: Ops and Dev did not follow the same deployment steps (lack of automation scripts) resulting in thousands of exceptions and maxes out CPU on all app servers