Navigate the Cloud Universe
StackLens is your intelligent companion for DevOps, SecOps, ML, and AI Engineering. Search hand-picked resources across the technical ecosystem.
DevOps Resources
trimstray/the-book-of-secret-knowledge
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
hashicorp/terraform
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.
musistudio/claude-code-router
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
MichaelCade/90DaysOfDevOps
This repository started out as a learning in public project for myself and has now become a structured learning map for many in the community. We have 3 years under our belt covering all things DevOps, including Principles, Processes, Tooling and Use Cases surrounding this vast topic.
The Linux Command Line
A complete introduction to the Linux shell and command line.
Docker Curriculum
A comprehensive tutorial on getting started with Docker and containers.
Terraform 1.8 Adds Provider-Defined Functions
HashiCorp announces Terraform 1.8 with a major new feature: functions defined by providers.
Docker Desktop 4.30 includes Docker Build Cloud integration
The latest version of Docker Desktop makes it easier to use remote build capabilities for faster container image creation.
Terraform Best Practices
Guide on how to structure your Terraform projects for production.
AI Isn't Something to Trust — It's Something to Design (Series Final)
Series Final. The four mechanisms covered across this series — knowledge graph, Auto Review, Self-Healing, Recurrence Prevention — plus the non-engineer-PR application that sits on top of them, all hang off a single conviction: AI isn't something to trust; it's something to design. The 'I don't trust AI to fill in the blanks for me' framing this lives inside isn't doubt about generation quality, but the clear-eyed acceptance that AI has no idea what context wasn't handed to it, and that 'ideal behavior with no spec given' is a fantasy. The starting point goes back to 2025, when I was trying to figure out how to make AI actually understand a large codebase — and ran into walls on both context window scaling (lost in the middle, attention dilution) and learning-based approaches (machine unlearning, destructive interference). GraphRAG + MCP became the way out: hand AI only the facts it needs, when it needs them, so it doesn't have to infer. From code-graph (which I burned two months on and threw away) to the current product-graph (cpg). This piece is the philosophy and the trial-and-error behind the whole series: harnesses confine where hallucinations are allowed to happen, design is translating principles into your own use cases, and Coverage 90% as a solo target breaks the implementation.
You Fixed the Rate Limits. Now Your Agent Fails Quietly.
Every capacity fix - retries, fallbacks, caching - buys availability by acting on output it didn't freshly earn. Why uptime and correct uptime are different SLOs, and how to engineer the second one.
The Real AI Coding Breakthrough Is Not More Context. It Is Better Diagnostics.
When I started building what became Scarab Diagnostic Suite, I was not trying to create a theory of...
The $0 Bug That Cost Us $1,800 in API Calls
Last quarter our OpenAI bill went from $620 to $2,480 in 23 days. No new features shipped. No...
Testing Neovim in a Container with Finch (like Docker)
So developers like CI... for everything! We do this because we like things to be automated....
How I Recovered 35GB on a Production Server by Moving Docker Builds Off It
And why your server should never be your build machine It started with a simple task, deploy a new...
How I Recovered 35GB on a Production Server by Moving Docker Builds Off It
How I Recovered 35GB on a Production Server by Moving Docker Builds Off It ...
Fixing GHCR “Unauthorized” + Docker “Cannot perform interactive login from non-TTY” in GitHub Actions + SSH Deployments
Fixing GHCR “Unauthorized” + Docker “Cannot perform interactive login from non-TTY” in GitHub...
Claude Code TDD: Force Red-Green-Refactor with Hooks & CLAUDE.md (2026)
The problem with AI-assisted TDD isn't that Claude can't write tests — it's that without constraints,...
Your AI Provider Is a Single Point of Failure
Last Friday, the U.S. Commerce Department sent a letter to Anthropic. By that evening, Fable 5 and...
Your Terraform state file is a plaintext secrets store. Mine was too.
Here's a fact that's easy to forget until it bites you: a Terraform state file stores resource...
GitLab Orbit maps your whole SDLC. It still can't tell you what an infrastructure change will break.
GitLab Orbit is an excellent symbol-and-SDLC graph. It is also the clearest illustration yet of the...
Agentic loops don't fix lying agents
Published June 15, 2026 by gyorgy The current discourse says you should stop prompting coding agents...
Deploying Gemma 4 26B on Proxmox: IaC Setup with Terraform, Ansible & AMD iGPU
A complete guide to automating a local AI stack on Proxmox LXC using Terraform and Ansible, including Open-WebUI and AMD Radeon Vega iGPU workarounds.
Run Real Docker on Android — No Root, No Tricks, Just QEMU
Turn your old Android phone into a self-contained Docker host using Termux + QEMU. Battle-tested on a Samsung Galaxy Note 10+ with full reboot survival.
Automating Proxmox Virtual Machine Deployment Using Terraform and Cloud-Init Templates
The Problem: I had been manually creating Ubuntu Virtual Machines within my Proxmox Home Lab....