10,000+ hand-picked resources
Navigate the Cloud Universe
StackLens is your intelligent companion for DevOps, SecOps, ML, and AI Engineering. Search hand-picked resources across the technical ecosystem.
SRE Resources
blog
Engineering Design Document: Reusable Observability Platform V2
A production-focused redesign of a Stage 6 LGTM observability platform, moving from a...
SRE
blog
Railway vs AWS: When Leaving Railway Means Owning Reliability
TL;DR I no longer recommend Railway as the default for serious production workloads after...
SRE
blog
Surviving the region you run in: failover on Aurora DSQL, and what the demo proves
How Quorum's failover layer works on Amazon Aurora DSQL multi-region clusters, an honest account of what the chaos demo simulates and what it does not, and where the survival story currently ends.
SRE
blog
I'm building a read-only context engine for Kubernetes and AI agents
kctx turns Kubernetes API state into compact operational context for humans, scripts, and AI SRE workflows.
SRE