Navigate the Cloud Universe
StackLens is your intelligent companion for DevOps, SecOps, ML, and AI Engineering. Search hand-picked resources across the technical ecosystem.
Grafana – The Open Observability Platform
Grafana is the leading open-source platform for monitoring and observability. Query, visualize, alert on and understand your metrics, logs, and traces with beautiful dashboards.
Prometheus – Monitoring System & Time Series DB
Prometheus is an open-source systems monitoring and alerting toolkit. It collects and stores metrics as time series data, with powerful queries via PromQL.
OpenTelemetry – Vendor-Neutral Observability Framework
OpenTelemetry provides a collection of APIs, SDKs, and tools to instrument, generate, collect, and export telemetry data (metrics, logs, and traces).
The Roadmap to DevOps in 2024
A comprehensive guide on how to become a DevOps engineer in 2024 with a clear learning path.
Documentations
Elastic Stack (ELK) – Search, Observe, Protect
The Elastic Stack — Elasticsearch, Logstash, and Kibana — is the world's most popular log management platform. Collect, parse, and visualize any type of data.
AWS CloudWatch – Official Documentation
Amazon CloudWatch is a monitoring and management service that provides data and actionable insights for AWS resources, applications, and services. Set alarms, log metrics, and more.
Datadog – Cloud Monitoring as a Service
Datadog is a monitoring and security platform for cloud applications. It brings together end-to-end traces, metrics, and logs, making your stack fully observable.
Grafana + Prometheus – Full Stack Monitoring Tutorial
Complete guide to setting up a production-grade monitoring stack with Prometheus for metrics collection and Grafana for visualization and alerting.
Azure Monitor – Full Observability for Azure
Azure Monitor collects, analyzes, and acts on telemetry from your Azure and on-premises environments. Includes Application Insights, Log Analytics, and more.
New Relic – Full-Stack Observability Platform
New Relic provides full-stack observability for your entire software stack. Monitor APM, infrastructure, logs, browser, mobile, and synthetics from one platform.
The Linux Command Line
A complete introduction to the Linux shell and command line.
Loki – Like Prometheus but for Logs
Grafana Loki is a horizontally scalable, highly available, multi-tenant log aggregation system inspired by Prometheus. Designed to be cost-effective and easy to operate.
Google Cloud Monitoring (formerly Stackdriver)
Google Cloud's operations suite provides monitoring, logging, and diagnostics for applications running on Google Cloud and beyond.
Zabbix – Enterprise-Class Monitoring
Zabbix is a mature, enterprise-level platform designed to monitor networks, servers, cloud, applications, and services. Fully open source with no limits on hosts.
Jaeger – End-to-End Distributed Tracing
Jaeger is an open-source, end-to-end distributed tracing system, used for monitoring microservices-based distributed systems. CNCF graduated project.
Docker Curriculum
A comprehensive tutorial on getting started with Docker and containers.
Nagios – IT Infrastructure Monitoring
Nagios is one of the most widely used open-source monitoring solutions. Monitor hosts, services, and network devices with powerful alerting and notification capabilities.
Kubernetes Documentation
Official documentation for Kubernetes orchestration system.
VictoriaMetrics – Fast & Scalable Monitoring
VictoriaMetrics is a fast, cost-saving, and scalable monitoring solution and time series database. Drop-in replacement for Prometheus with better performance.
Cloud Native Computing Foundation (CNCF) Landscape
The complete map of the cloud native ecosystem.
Terraform Best Practices
Guide on how to structure your Terraform projects for production.