A Practical Guide to Chaos Engineering
Modern systems built on cloud technologies and microservices architecture have a lot of dependencies on the internet, infrastructure, and services that you do not have control over. We cannot control or avoid failures in distributed systems, but we can control the impact radius of the failure and optimize the time to recover and restore the systems. This can be achieved only by exercising as many failures as we can in the test lab, thus achieving confidence in the system’s resilience., says Jitendra Nath Lella, Senior Architect, Delivery, Cigniti Technologies.