Subscribe to receive notifications of new posts:

Speed & Reliability

Improving platform resilience at Cloudflare through automation

2024-10-09

We realized that we need a way to automatically heal our platform from an operations perspective, and designed and built a workflow orchestration platform to provide these self-healing capabilities across our global network. We explore how this has helped us to reduce the impact on our customers due to operational issues, and the rich variety of similar problems it has empowered us to solve....

Continue reading »
Improving platform resilience at Cloudflare through automation

MORE POSTS