A production-grade, asynchronous Incident Management System designed to monitor a complex distributed stack (APIs, Caches, Databases, Queues, RDBMs, NoSQL) and manage failure mediation workflows with ...