#reliability-engineering

[ follow ]
Artificial intelligence
fromInfoQ
2 weeks ago

QConAI NY 2025 - Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery

Reliable agentic AI combines probabilistic model components with deterministic boundaries and integrates models as layers over operational systems rather than replacements.
Software development
fromInfoQ
3 weeks ago

AWS Debuts "DevOps Agent" to Automate Incident Response and Improve System Reliability

AWS DevOps Agent is an autonomous, always-on on-call engineer that integrates with observability, deployment, and ticketing tools to automate incident response and improve reliability.
Software development
fromInfoQ
3 months ago

From Grassroots to Enterprise: Vanguard's Journey in SRE Transformation

Vanguard built an enterprise SRE program from minimal resources into an organization-wide job family, emphasizing performance, resilience, coaching, and technical solutions.
[ Load more ]