Continuous Resilience Testing in AWS Environments with Advanced Fault Injection Techniques
Keywords:
Resilience testing, Fault injection, AWS FIS,, Cloud reliability, System robustness, Automated recovery, Chaos engineeringAbstract
Advanced fault injection approaches for continuous resilience testing have significantly enhanced the robustness and dependability of cloud-based systems. The implementation of these strategies in Amazon Web Services (AWS) environments is the main emphasis of this study, which makes use of AWS CloudWatch, AWS X-Ray, AWS Step Functions, AWS Lambda, and AWS Fault Injection Simulator (FIS). The system has become significantly better at handling and recovering from numerous failure scenarios, such as network slowness, CPU and memory load, API errors, and instance terminations. Key findings indicate that the system successfully handled the increased load without crashing, stabilized resource use after earlier rises, and remained to function despite API errors. It also maintained acceptable performance levels with only a 10% increase in latency during simulated delays. service availability was maintained by auto-scaling methods that quickly replaced terminated instances. Maintaining systems' robustness and reliability under pressure requires proactive fault injection and real-time monitoring. The aforementioned approach not only detects and addresses such vulnerabilities but also guarantees the continued stability and dependability of systems, enabling them to withstand unforeseen malfunctions and sustain service availability in frequently changing cloud environments.
Downloads
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.