Sourcify
Effortlessly find and manage open-source dependencies for your projects.

Cloud-Native Chaos Engineering for Resilient Kubernetes Environments.

LitmusChaos is a CNCF Graduated project providing an end-to-end framework for cloud-native chaos engineering. Its technical architecture is built on a Kubernetes-native design, utilizing Custom Resource Definitions (CRDs) to manage chaos experiments as declarative code. By 2026, LitmusChaos has solidified its position as the industry standard for platform teams transitioning from reactive monitoring to proactive resilience. It enables SREs to orchestrate complex failure scenarios—ranging from pod kills and network latency to cloud-provider API failures—integrated directly into CI/CD pipelines. The platform features ChaosCenter, a unified control plane for multi-tenant experiment management, and ChaosHub, a public repository of pre-built experiments. Its architecture supports GitOps workflows, allowing teams to version control their resilience tests alongside application code. The 2026 market landscape sees LitmusChaos as the primary open-source alternative to proprietary solutions like Gremlin, favored for its deep integration with the Prometheus/Grafana stack and its ability to run entirely within air-gapped or highly regulated environments.
LitmusChaos is a CNCF Graduated project providing an end-to-end framework for cloud-native chaos engineering.
Explore all tools that specialize in fault injection. This domain focus ensures LitmusChaos delivers optimized results for this specific requirement.
Declarative checks that run before, during, and after chaos injection to validate steady-state via HTTP, K8s, or Prometheus queries.
A centralized repository of reusable chaos experiments maintained by the community and vendors.
ChaosCenter allows multiple teams to share a single installation with isolated projects and permissions.
Native integration with Git repositories to trigger experiments based on code commits or deployment events.
The ability to run multiple concurrent faults (e.g., CPU hog + Network Latency) to simulate complex cascading failures.
Triggering experiments based on specific Kubernetes events or Prometheus alerts.
A proprietary calculation metric based on the success rate of probes during a chaos run.
Install the LitmusChaos helm chart into a dedicated namespace.
Access the ChaosCenter dashboard via Port-forward or Ingress.
Register the target Kubernetes cluster as a 'Chaos Delegate'.
Configure Chaos Probes (HTTP, CMD, K8s, or Prometheus) to define steady-state.
Browse ChaosHub to select a pre-defined experiment (e.g., Pod Delete).
Define the chaos workflow using the visual experiment builder.
Configure Blast Radius and Fault Injection parameters in YAML.
Schedule the experiment or trigger it via a CI/CD pipeline hook.
Monitor real-time execution logs and probe success/failure rates.
Analyze the Resilience Score and export findings for post-mortem analysis.
All Set
Ready to go
Verified feedback from other users.
"Highly praised for its Kubernetes-native approach and extensive experiment library, though some users find the initial learning curve for custom experiments steep."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.