Everything I know
Site Reliability Engineering
You can make your default branch
, and assign it a custom domain. Using Vercel.
Ask HN: What is the fastest way to ramp up on DevOps, k8 and GCP? (2021)
DevOps, SRE, and Platform Engineering (2021)
We're Reddit's Infrastructure team, ask us anything! (2018)
- Develop. Preview. Ship. (
- Examples of Now deployments you can use.
I forgot how to manage a server (2019)
- Self-hosted deployment server for your team.
Lobsters: What’s your container-less deployment process? (2019)
A developer goes to a DevOps conference (2019)
Deploy your side-projects at scale for basically nothing - Google Cloud Run (2020)
DevOps Questions & Exercises
Ops Lessons We All Learn The Hard Way (2020)
- Simple, secure devops tooling built to manage today's complex applications wherever you run your software. (
Book Recommendations for the Infrastructure Engineer
Ask HN: How do you make sure your servers are up as a single founder? (2020)
- Allows you and your software development team to implement DevOps automations in minutes rather than days.
Deploys at Slack (2020)
We Need DevOps for ML Data (2020)
- Curated list of awesome pipeline toolkits inspired by Awesome Sysadmin.
- Curated list of awesome open source sysadmin resources.
Using SRE to meet reliability challenges | Google Cloud (2020)
- DevOps as a Service.
- Automates infrastructure super fast at massive scale. It can be used for ad-hoc command execution, service deployment, configuration management and more. (
- Write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
PagerDuty Incident Response Documentation
Building an online community around learning from incidents (2019)
The Rise of Platform Engineering (2020)
How we monitor our services at SourceHut (2020)
Reference checklist for going to production
- Create a complete cloud architecture on your Amazon Web Services, Google Cloud Platform or Microsoft Azure account. (
- Extensible platform for infrastructure management. (
What is DevOps? (2020)
- Security, Compliance & Performance for your Devops Workflows.
A List of Skills and PracticesWe Use to Train Our DevOps Internally (2020)
- Codified cloud security for DevOps. (
You Reap What You Code (2020)
How we use HashiCorp Nomad (2020)
Ask HN: Has anyone moved from Kubernetes to Nomad? (2020)
- Deploy your apps on any Cloud providers in just a few seconds. (
- Private NPM registry and Maven, RPM, DEB, PyPi and RubyGem Repository.
- Remote Access and Secure Deployments.
- Automatically build and deploy code from your repositories.
Cooking Infrastructure by Chef
- Open source feature toggle service. (
The golden age of configuration languages (2020)
School of SRE
Christine Dodrill: ex-SRE, Lightspeed (2020)
- Detect, track and alert on infrastructure drift. (
- Modern cloud native development environments. (
- DevOps community.
DevOps Maturity Framework
- Packaged Applications for Any Platform - Cloud, Container, Virtual Machine. (
Bitnami Library for Kubernetes
- Project management framework with deep philosophy underneath.
Site Reliability Engineer Interview Preparation Guide
- App automation done right. (
List of Devops Resources
- Git as a single source of truth. Build. Deploy to Kubernetes. Stay in sync. (
Zero-downtime deploys with DigitalOcean, GitHub, and Docker (2021)
Running Nomad for home server (2021)
- Curated Collection on Site Reliability Engineering.
We are far from a better Heroku for production apps in a hyper cloud (2021)
- Open-source, self-hostable Heroku and Netlify alternative. (
- Platform-As-Code. (
- ELT for the DataOps era.
- Collects system metrics from DigitalOcean Droplets.
- Modern Infrastructure as Code. Any cloud, any language. (
- Tiniest PaaS you've ever seen. Piku allows you to do git push deployments to your own servers. (
Awesome Incident Response
- Open source device management. (
- Reliability as Code: SRE automation at the tip of your fingers. (
To PaaS or not (2021)
SRE at Google: Our complete list of CRE life lessons (2021)
Bad Machinery: Managing Interrupts Under Load
Securing DevOps: Security in the Cloud (2018)
- Universal Release Tool (And More).
DevOps Cheat Sheets
- High Performance Software Architecture. (
- Enterprise-grade application building, deploying, monitoring platform.
DevOps Engineering Course for Beginners (2021)
How to improve your website’s uptime (2021)
- Deploy Databases and Services Easily for Development and Testing Pipelines. (
DevOps Engineer Crash Course (2021)
- Modern load testing & smoke testing for SRE and DevOps. (
Top-10 talks of SREcon18 Europe (2018)
The DevOps: A Concise Understanding to the DevOps Philosophy and Science. (Technical Report) (2021)
- Caching service for source code and external dependencies.
- Makes sure you don't accidentally deploy apps with missing or invalid environment variables.
- Fancy self-hosted monitoring tool. (
Ask HN: Solo-preneurs, how do you DevOps to save time? (2021)
How to Use Hydra as your Deployment Source of Truth (2021)
What to Ask in an SRE Technical Interview (2021)
DevOps Newsletters of Note
- Helps you to automate your application deployments using Python DSL. (
- Automated Certificate Management for DevOps. (
Learn-by-Doing Platforms for Dev, DevOps, and SRE Folks (2021)
- Platform for integration and automation across services and tools, taking actions in response to events. (
- Easy-to-use on-call management tool. (
- Service for managing and provisioning Bare Metal servers.
Scaled Agile DevOps Maturity Framework
- Enterprise transformation without the risk of culture change.
- Single-binary server that is all designed in order to make the provisioning of servers, platforms and applications easier.
Equinix Metal Images
- Cloud Incident and Response Simulations.
The Reports of Devops's death are greatly exaggerated (2021)
- Threat Modeling with HCL.
- Uptime monitoring with public status pages.
- Bootstrap HashiCorp Consul, Nomad, or Vault over SSH < 1 minute.
- OpenFaas provider for Nomad.
A Multi Cluster and Multi Orchestrator home lab (2021)
DevOps in academic research (2021)
Hetzner Pulumi Intro (2021)
The Operator Pattern in Nomad (2021)
- Brings all your DevOps data into one practical, personalized, extensible view. Ingest, analyze, and visualize data.
Fastly Resource Provider
OOPS (Learning from the incident you didn't have) writeup template (2021)
Ultimate DevSecOps library
Common Infrastructure Errors I've Made (2021)
Lightweight Experiment & Resource Monitoring
Howie: The Post-Incident Guide
- Dedicated Incident Analysis Platform.
- Opinionated infrastructure to take you from idea to production on day one. (
- Cloud Management and Automation Framework. (
Deployment from Scratch
- Complete guide to web application deployment. (
Awesome Event IDs
- Collection of Event ID resources useful for Digital Forensics and Incident Response.
- “housekeeping for clouds” - find leaky resources, manage quota limits, detect drift and clean up.
- Keep Your Containerized Applications Safe. (
- Declarative checker for website uptime to run continuously for monitoring.
- Incident response framework focused on remote live forensics.
OWASP DevSecOps Guideline
- Can help us to embedding security as a part of the development pipeline.
Site Reliability Engineering