{"id":14913,"date":"2020-09-21T20:15:36","date_gmt":"2020-09-21T14:45:36","guid":{"rendered":"https:\/\/cigniti.com\/blog\/?p=14913"},"modified":"2022-07-29T11:29:23","modified_gmt":"2022-07-29T05:59:23","slug":"chaos-testing-software-resilience-testing","status":"publish","type":"post","link":"https:\/\/www.cigniti.com\/blog\/chaos-testing-software-resilience-testing\/","title":{"rendered":"How Chaos Testing helps build and deliver resilient software"},"content":{"rendered":"

In\u00a0<\/span>the real world<\/span>, things are rarely perfect.<\/span>\u00a0<\/span><\/p>\n

Software fails. Outages happen. Cyberattacks occur.<\/span>\u00a0<\/span><\/p>\n

Consequently, customers suffer from non-availability of the needed application and long downtime. Businesses suffer from lost revenues and bruised reputation.<\/span>\u00a0<\/span><\/p>\n

Is it possible to develop and deploy a software that never faces any issues? Probably not. But it is indeed possible to minimize the impact of those issues by designing resiliency in the software with the help of chaos testing or chaos engineering.\u00a0<\/span>\u00a0<\/span><\/p>\n

\u201cPrinciples of Chaos Engineering\u201d defines the practice as the \u201cd<\/span>iscipline of experimenting on a distributed system in order to build confidence in the system\u2019s capacity to withstand turbulent conditions in production.<\/span>\u201d<\/span>\u00a0<\/span><\/p>\n

This involves intentional introduction of<\/span>\u00a0failure into a software system to measure the system\u2019s ability to tackle with it and evaluate the impact of the failure on the system\u2019s availability and durability.<\/span>\u00a0<\/span><\/p>\n

The concept of Chaos Engineering was introduced and developed by Netflix to test the resilience of their IT infrastructure for ensuring a seamless experience for their customers. They called it \u2018Chaos Monkey\u2019.<\/span>\u00a0<\/span><\/p>\n

Antonio Garc\u00eda Mart\u00ednez<\/span>, the author of the book \u2018Chaos Monkeys\u2019 explains the concept as \u2013<\/span>\u00a0<\/span><\/p>\n

“Imagine a monkey entering a ‘data center’, these ‘farms’ of servers that host all the critical functions of our online activities. The monkey randomly rips cables, destroys devices and returns everything that passes by the hand [i.e. flings excrement]. The challenge for IT managers is to design the information system they are responsible for so that it can work despite these monkeys, which no one ever knows when they arrive and what they will destroy.”<\/span>\u00a0<\/span><\/p>\n

The concept might seem counterintuitive, but is a brilliant tool to prepare your software against any potential issues. Instead of waiting for an error to occur for implementing the fix, chaos engineering or chaos testing takes a proactive approach.<\/span>\u00a0<\/span><\/p>\n

By undertaking the practice of chaos testing, organizations have more control over the introduced error. With this benefit, they can quickly identify hidden defects, vulnerabilities, and other issues, which may\u00a0<\/span>never have presented themselves during the traditional testing processes. Having such critical errors lying around may prove fatal in the long run.<\/span>\u00a0<\/span><\/p>\n

The impact of software failures<\/span><\/b>\u00a0<\/span><\/h4>\n

History has been a witness to numerous incidents where the tiniest of bug in a software led to catastrophic outcomes.\u00a0<\/span>\u00a0<\/span><\/p>\n

Recently, London\u2019s Heathrow Airport was hit by technical issues, impacting more than hundreds of flights. And, the two Boeing 737 Max incidents are unforgettable.<\/span>\u00a0<\/span><\/p>\n

A few weeks ago, the video conferencing application Zoom went down for hours, which disrupted meetings and classes\u00a0<\/span>around the United States.<\/span>\u00a0<\/span><\/p>\n

A recent cyberattack on\u00a0<\/span>Brno University Hospital in the Czech Republic\u00a0<\/span>caused<\/span>\u00a0an immediate computer shutdown in the midst of the coronavirus outbreak.<\/span>\u00a0<\/span><\/p>\n

These are just a few examples from an ocean of incidents that happen every year due to software glitches and unresolved defects.<\/span>\u00a0<\/span><\/p>\n

These failures cause an organization to be the talk of the town for all the wrong reasons. Not only does their business and revenue get impacted, but their hard-earned reputation also has to face a heavy blow.<\/span>\u00a0<\/span><\/p>\n

Mean Time To Recovery (MTTR) vs. Mean Time To Failure (MTTF)<\/span><\/b>\u00a0<\/span><\/h4>\n

Traditionally, organizations take pride in the longest time that they have had without any outage or technical issues. This means that they rely on the Mean Time To Failure (MTTF), which is the average time for which a system operates before failing.<\/span>\u00a0<\/span><\/p>\n

But, the irony is that the customer won\u2019t remember all the time a software system worked perfectly, but will never forget the ONE time that it didn\u2019t.<\/span>\u00a0<\/span><\/p>\n

Therefore, it is high time that organizations shift their focus from the Mean Time To Failure to Mean Time to Recovery.<\/span>\u00a0<\/span><\/p>\n

The Mean Time To Recovery is the average time that a software system takes to recover from a failure. Now, the objective for the global organizations should be to minimize the Mean Time To Recovery to\u00a0<\/span>such an extent that the customers do not even notice when an issue occurs. And this can be made possible with the help of Chaos testing.<\/span>\u00a0<\/span><\/p>\n

Chaos testing and DevOps<\/span><\/b>\u00a0<\/span><\/h4>\n

Let\u2019s get one thing clear \u2013 to systemically inject an error into a software system in no way means to cause any impact on the end customers. Whatever experimentation with the deliberate introduction of chaos\u00a0<\/span>happens<\/span>\u00a0must remain behind the curtains of a well-functioning application for the end users.<\/span>\u00a0<\/span><\/p>\n

This cannot be feasible to do with the legacy software development and testing methodologies. Chaos engineering complements well only in a DevOps setup where automation is integral to the entire development and software testing process, a continuous monitoring and feedback loop is established, and there is scope for continuous improvement.<\/span>\u00a0<\/span><\/p>\n

When a fault is injected into a system, several bugs are uncovered and vulnerabilities identified. With the help of a DevOps practice, these defects can be resolved in real time and automation can be deployed for future occurrences.<\/span>\u00a0<\/span><\/p>\n

Chaos testing, combined with DevOps, is the ultimate way of developing and delivering highly fault-tolerant and resilient software.<\/span>\u00a0<\/span><\/p>\n

How can we help<\/span><\/b>\u00a0<\/span><\/h4>\n

Achieving fast and continuous development & deployment of business-critical cloud-based applications across diverse platforms requires seamless collaboration among development,\u00a0<\/span>test automation<\/span><\/a>, & operations teams.<\/span>\u00a0<\/span><\/p>\n

At Cigniti, we standardize efforts<\/span>, ascertain software resilience,<\/span>\u00a0and ensure accelerated time-to-market with\u00a0<\/span>DevOps testing<\/span><\/a>\u00a0solutions. We focus on delivering improved deployment quality with greater operational efficiency. Our DevOps testing specialists with their deep experience in Continuous Integration (CI) testing & Continuous Deployment (CD) help configure & execute popular CI\/CD tools supporting your DevOps transformation & application testing<\/a> efforts.<\/span>\u00a0<\/span><\/p>\n

Schedule a discussion<\/span><\/a>\u00a0with us today.<\/span>\u00a0<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"

In\u00a0the real world, things are rarely perfect.\u00a0 Software fails. Outages happen. Cyberattacks occur.\u00a0 Consequently, customers suffer from non-availability of the needed application and long downtime. Businesses suffer from lost revenues and bruised reputation.\u00a0 Is it possible to develop and deploy a software that never faces any issues? Probably not. But it is indeed possible to […]<\/p>\n","protected":false},"author":20,"featured_media":14914,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[735],"tags":[3265,3268,3269,3264,3267,3270,3266,319,2050,170,432,214],"ppma_author":[3727],"class_list":["post-14913","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-devops-testing","tag-chaos-engineering","tag-chaos-engineering-in-devops","tag-chaos-monkey-testing","tag-chaos-testing","tag-chaos-testing-microservices","tag-chaos-testing-scenerios","tag-devops-enviornment","tag-devops-methodology","tag-devops-qa","tag-devops-testing","tag-performance-engineering","tag-performance-testing"],"authors":[{"term_id":3727,"user_id":20,"is_guest":0,"slug":"cigniti","display_name":"Cigniti Technologies","avatar_url":{"url":"https:\/\/www.cigniti.com\/blog\/wp-content\/uploads\/120X120-1.png","url2x":"https:\/\/www.cigniti.com\/blog\/wp-content\/uploads\/120X120-1.png"},"user_url":"https:\/\/www.cigniti.com\/","last_name":"Technologies","first_name":"Cigniti","job_title":"","description":"Cigniti is the world\u2019s leading AI & IP-led Digital Assurance and Digital Engineering services company with offices in India, the USA, Canada, the UK, the UAE, Australia, South Africa, the Czech Republic, and Singapore. We help companies accelerate their digital transformation journey across various stages of digital adoption and help them achieve market leadership."}],"_links":{"self":[{"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/posts\/14913"}],"collection":[{"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/users\/20"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/comments?post=14913"}],"version-history":[{"count":0,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/posts\/14913\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/media\/14914"}],"wp:attachment":[{"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/media?parent=14913"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/categories?post=14913"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/tags?post=14913"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.cigniti.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=14913"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}