DevOps & SRE notes 2432

DevOps & SRE notes

This report demystifies the frequently misunderstood metric of Mean Time to Resolution (MTTR). It clearly defines the four key "mean time" metrics and outlines their significance within a DevOps framework.
https://www.causely.ai/blog/mttr-meaning

www.causely.ai

The “R” in MTTR: Repair or Recover? What’s the difference?

Finding meaning in a world of acronyms There are so many ways to measure application reliability today, with hundreds of key performance indicators (KPIs) to measure availability, error rates, user experiences, and quality of service (QoS). Yet every organization…

❤1

1.31K viewstutunak, 15:03

DevOps & SRE notes

Anvesh Muppeda provides a hands-on analysis of Kubernetes Endpoints and their modern, more scalable counterpart, EndpointSlices. The content uses practical examples to explain their crucial role in Kubernetes service discovery.
https://medium.com/@muppedaanvesh/a-hands-on-guide-to-kubernetes-endpoints-endpointslices-%EF%B8%8F-1375dfc9075c

Medium

⎈ A Hands-On Guide to Kubernetes Endpoints & EndpointSlices 🛠️

⇢ Understanding Kubernetes Endpoints and Endpoint Slices: A Comprehensive Guide

👍1

1.34K viewstutunak, 08:02

DevOps & SRE notes

Kubernetes performance and scale test orchestration framework written in golang

https://github.com/kube-burner/kube-burner

GitHub

GitHub - kube-burner/kube-burner: Kubernetes performance and scale test orchestration framework written in golang

Kubernetes performance and scale test orchestration framework written in golang - kube-burner/kube-burner

1.36K viewstutunak, 15:01

DevOps & SRE notes

Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)

https://github.com/slimtoolkit/slim

GitHub

GitHub - slimtoolkit/slim: Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled…

Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source) - slimtoolkit/slim

👍1

1.25K viewstutunak, 08:04

DevOps & SRE notes

Gergely Orosz's story offers a look inside the engineering culture at Bluesky, the decentralized social media platform. He explores how a small, autonomous team is working to build a viable alternative to established networks.
https://newsletter.pragmaticengineer.com/p/bluesky-engineering-culture

Pragmaticengineer

Inside Bluesky’s Engineering Culture

A deep dive into how a fully remote, open source-first, tech company is building a challenger social media platform. What can small teams learn from Bluesky about punching above your weight?

👍2

1.37K viewstutunak, 15:01

DevOps & SRE notes

This dispatch explores how eBPF maps can be used to synchronize state across multiple nodes in a Kubernetes cluster. It delves into the technical details of leveraging this capability for advanced networking and observability.
https://medium.com/all-things-ebpf/ebpf-maps-state-synchronization-across-multi-node-kubernetes-cluster-d0c075810fa7

Medium

eBPF Maps State Synchronization across Multi-Node Kubernetes Cluster

eBPF Maps State Synchronization using Go and gRPC

❤2

1.43K viewstutunak, 08:04

DevOps & SRE notes

A terminal OpenTelemetry viewer inspired by otel-desktop-viewer

https://github.com/ymtdzzz/otel-tui

GitHub

GitHub - ymtdzzz/otel-tui: A terminal OpenTelemetry viewer inspired by otel-desktop-viewer

A terminal OpenTelemetry viewer inspired by otel-desktop-viewer - ymtdzzz/otel-tui

1.52K viewstutunak, 15:04

DevOps & SRE notes

Prometheus-based Kubernetes Resource Recommendations

https://github.com/robusta-dev/krr

GitHub

GitHub - robusta-dev/krr: Prometheus-based Kubernetes Resource Recommendations

Prometheus-based Kubernetes Resource Recommendations - robusta-dev/krr

👍1

1.4K viewstutunak, 08:02

DevOps & SRE notes

The Agoda Engineering team details their "private cloud first" approach to load shedding in this feature. They explain how this strategy helps protect their most critical systems by gracefully degrading services during traffic surges.
https://medium.com/agoda-engineering/load-shedding-private-cloud-first-81ddd5ab53ac

Medium

How Agoda Handles Load Shedding in Private Cloud

by Johan Tiesinga

❤2

1.4K viewstutunak, 15:01

DevOps & SRE notes

This item serves as a practical walkthrough for managing AWS EKS Access Entries with both Terraform and OpenTofu. It provides the necessary code and configuration to streamline Kubernetes user access management.
https://dev.to/aws-builders/managing-aws-eks-access-entries-with-terraform-and-opentofu-414

DEV Community

Managing AWS EKS access entries with Terraform and OpenTofu

Setting up EKS authentication and authorization using Infrastructure as Code 📚...

👍2

1.38K viewstutunak, 08:02

DevOps & SRE notes

Simple but still extremely powerful K9S alternative. An interactive explain command. Security scanning based on trivy. Supports multiple envs. Midnight commander like interface. Custom hotkeys

https://github.com/pymag09/kubecui

GitHub

GitHub - pymag09/kubecui: Simple but still extremely powerful K9S alternative. An interactive `explain` command. Security scanning…

Simple but still extremely powerful K9S alternative. An interactive `explain` command. Security scanning based on `trivy`. Supports multiple envs. Midnight commander like interface. Custom hotkeys ...

❤2👍1

1.47K viewstutunak, 15:04

DevOps & SRE notes

An AWS author provides a helpful review of using Terraform to provision and manage AWS IoT Things. The text demonstrates how to automate the lifecycle of IoT devices using infrastructure as code.
https://builder.aws.com/content/2iJVs17s7drNneJiB9GdYrNszWw/using-terraform-to-create-aws-iot-things

Aws

AWS Builder Center

Connect with builders who understand your journey. Share solutions, influence AWS product development, and access useful content that accelerates your growth. Your community starts here.

👍1

1.28K viewstutunak, 08:05

DevOps & SRE notes

The CloudKitchens tech publication examines the powerful self-healing mechanisms built into Kubernetes. It highlights how features like readiness probes and replication controllers enable the creation of resilient, automated systems.
https://techblog.cloudkitchens.com/p/kubernetes-self-healing

Cloudkitchens

From Fragile to Faultless: Kubernetes Self-Healing In Practice

Overcoming imperfections of managed Kubernetes with early self-healing.

❤3👍2

1.28K viewstutunak, 15:01

DevOps & SRE notes

Hwameistor is an HA local storage system for cloud-native stateful workloads.

https://github.com/hwameistor/hwameistor

GitHub

GitHub - hwameistor/hwameistor: Hwameistor is an HA local storage system for cloud-native stateful workloads.

Hwameistor is an HA local storage system for cloud-native stateful workloads. - GitHub - hwameistor/hwameistor: Hwameistor is an HA local storage system for cloud-native stateful workloads.

👍1

1.28K viewstutunak, 08:03

DevOps & SRE notes

This insightful article explores the complexities of managing long-lived connections within Kubernetes, offering solutions for ensuring they remain stable and reliable. The author delves into the technical challenges and provides practical examples to overcome them.
https://learnkube.com/kubernetes-long-lived-connections

LearnKube

Load balancing and scaling long-lived connections in Kubernetes

Kubernetes doesn't load balance long-lived connections, and some Pods might receive more requests than others. If you're using HTTP/2, gRPC, RSockets, AMQP or any other long-lived connection such as a database connection, you might want to consider client…

👍1

1.26K viewstutunak, 15:03

DevOps & SRE notes

Mark Tinderholt's latest blogpost provides a comprehensive walkthrough on setting up a Jekyll blog using Azure Static Web Apps and Azure Storage. This piece is a great resource for anyone looking to establish a low-cost and scalable personal website.
https://www.marktinderholt.com/infrastructure-as-code/terraform/azure/cloud/2024/10/24/setup-blog-jekyll-azure-storage.html

Mark Tinderholt’s Blog.

Automating Jekyll with Terraform: A Serverless Approach to Static Site Hosting on Azure

My plan is to create a private repository on GitHub to be the source of all of the content for my blog posts. This is where I will write and where I will maintain the original copy of every blog post I make. I can then distribute the blog posts to any online…

👍1

1.26K viewstutunak, 08:01

DevOps & SRE notes

Kubernetes controller for GitHub Actions self-hosted runners

https://github.com/actions/actions-runner-controller

GitHub

GitHub - actions/actions-runner-controller: Kubernetes controller for GitHub Actions self-hosted runners

Kubernetes controller for GitHub Actions self-hosted runners - actions/actions-runner-controller

👍2

1.31K viewstutunak, 15:00

DevOps & SRE notes

Library for managing git hooks

https://github.com/greenbone/autohooks

GitHub

GitHub - greenbone/autohooks: Library for managing git hooks

Library for managing git hooks. Contribute to greenbone/autohooks development by creating an account on GitHub.

👍1

1.26K viewstutunak, 08:01

DevOps & SRE notes

This new analysis from Honeycomb argues against the traditional use of incident severities, proposing a shift towards classifying incidents by type instead. It's a thought-provoking read that challenges common practices in incident management.
https://www.honeycomb.io/blog/against-incident-severities-favor-incident-types

Honeycomb

Against Incident Severities and in Favor of Incident Types

In this post, SRE Fred Hebert goes over why we decided to choose incident types instead of incident severities for incident response.

👍1

1.27K viewstutunak, 15:02

DevOps & SRE notes

If you've ever wondered what Helm is and why it's so popular in the Kubernetes ecosystem, this excellent guide has you covered. The text breaks down the fundamentals of the package manager and explains its core concepts in an easy-to-understand way.
https://glasskube.dev/blog/what-is-helm-in-kubernetes/

glasskube.dev

What is Helm in Kubernetes? A complete guide | Glasskube

An in-depth overview of how Helm simplifies Kubernetes application management. Learn about Helm charts, key commands, and best practices in this comprehensive tutorial.

🔥1

1.36K viewstutunak, 08:01

2025/10/17 00:50:23
Back to Top

HTML Embed Code:

<iframe width="100%" src="https://www.bootg.com/buyppe/web?embed=1" title="Telegram Web" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>