Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Feb 28, 2013
http://strataconf.com/ There's been a huge amount of progress in recent years in developing distributed systems that are resilient to all sorts of faults. However, there's one critical category of errors that has largely been ignored: human error. The scope and potential impact of human error is massive: deployed bugs, accidentally deleting data, accidentally DDOS'ng important internal services, and so on. Designing for human fault-tolerance leads to important conclusions on the fundamental ways data systems should be architected.