This report demystifies the frequently misunderstood metric of Mean Time to Resolution (MTTR). It clearly defines the four key "mean time" metrics and outlines their significance within a DevOps framework.
https://www.causely.ai/blog/mttr-meaning
https://www.causely.ai/blog/mttr-meaning
www.causely.ai
The “R” in MTTR: Repair or Recover? What’s the difference?
Finding meaning in a world of acronyms There are so many ways to measure application reliability today, with hundreds of key performance indicators (KPIs) to measure availability, error rates, user experiences, and quality of service (QoS). Yet every organization…
❤1
Anvesh Muppeda provides a hands-on analysis of Kubernetes Endpoints and their modern, more scalable counterpart, EndpointSlices. The content uses practical examples to explain their crucial role in Kubernetes service discovery.
https://medium.com/@muppedaanvesh/a-hands-on-guide-to-kubernetes-endpoints-endpointslices-%EF%B8%8F-1375dfc9075c
https://medium.com/@muppedaanvesh/a-hands-on-guide-to-kubernetes-endpoints-endpointslices-%EF%B8%8F-1375dfc9075c
Medium
⎈ A Hands-On Guide to Kubernetes Endpoints & EndpointSlices 🛠️
⇢ Understanding Kubernetes Endpoints and Endpoint Slices: A Comprehensive Guide
👍1
Kubernetes performance and scale test orchestration framework written in golang
https://github.com/kube-burner/kube-burner
https://github.com/kube-burner/kube-burner
GitHub
GitHub - kube-burner/kube-burner: Kubernetes performance and scale test orchestration framework written in golang
Kubernetes performance and scale test orchestration framework written in golang - kube-burner/kube-burner
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
https://github.com/slimtoolkit/slim
https://github.com/slimtoolkit/slim
GitHub
GitHub - slimtoolkit/slim: Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled…
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source) - slimtoolkit/slim
👍1
Gergely Orosz's story offers a look inside the engineering culture at Bluesky, the decentralized social media platform. He explores how a small, autonomous team is working to build a viable alternative to established networks.
https://newsletter.pragmaticengineer.com/p/bluesky-engineering-culture
https://newsletter.pragmaticengineer.com/p/bluesky-engineering-culture
Pragmaticengineer
Inside Bluesky’s Engineering Culture
A deep dive into how a fully remote, open source-first, tech company is building a challenger social media platform. What can small teams learn from Bluesky about punching above your weight?
👍2
This dispatch explores how eBPF maps can be used to synchronize state across multiple nodes in a Kubernetes cluster. It delves into the technical details of leveraging this capability for advanced networking and observability.
https://medium.com/all-things-ebpf/ebpf-maps-state-synchronization-across-multi-node-kubernetes-cluster-d0c075810fa7
https://medium.com/all-things-ebpf/ebpf-maps-state-synchronization-across-multi-node-kubernetes-cluster-d0c075810fa7
Medium
eBPF Maps State Synchronization across Multi-Node Kubernetes Cluster
eBPF Maps State Synchronization using Go and gRPC
❤2
The Agoda Engineering team details their "private cloud first" approach to load shedding in this feature. They explain how this strategy helps protect their most critical systems by gracefully degrading services during traffic surges.
https://medium.com/agoda-engineering/load-shedding-private-cloud-first-81ddd5ab53ac
https://medium.com/agoda-engineering/load-shedding-private-cloud-first-81ddd5ab53ac
Medium
How Agoda Handles Load Shedding in Private Cloud
by Johan Tiesinga
❤2
This item serves as a practical walkthrough for managing AWS EKS Access Entries with both Terraform and OpenTofu. It provides the necessary code and configuration to streamline Kubernetes user access management.
https://dev.to/aws-builders/managing-aws-eks-access-entries-with-terraform-and-opentofu-414
https://dev.to/aws-builders/managing-aws-eks-access-entries-with-terraform-and-opentofu-414
DEV Community
Managing AWS EKS access entries with Terraform and OpenTofu
Setting up EKS authentication and authorization using Infrastructure as Code 📚...
👍2
Simple but still extremely powerful K9S alternative. An interactive
https://github.com/pymag09/kubecui
explain
command. Security scanning based on trivy
. Supports multiple envs. Midnight commander like interface. Custom hotkeyshttps://github.com/pymag09/kubecui
GitHub
GitHub - pymag09/kubecui: Simple but still extremely powerful K9S alternative. An interactive `explain` command. Security scanning…
Simple but still extremely powerful K9S alternative. An interactive `explain` command. Security scanning based on `trivy`. Supports multiple envs. Midnight commander like interface. Custom hotkeys ...
❤2👍1
An AWS author provides a helpful review of using Terraform to provision and manage AWS IoT Things. The text demonstrates how to automate the lifecycle of IoT devices using infrastructure as code.
https://builder.aws.com/content/2iJVs17s7drNneJiB9GdYrNszWw/using-terraform-to-create-aws-iot-things
https://builder.aws.com/content/2iJVs17s7drNneJiB9GdYrNszWw/using-terraform-to-create-aws-iot-things
Aws
AWS Builder Center
Connect with builders who understand your journey. Share solutions, influence AWS product development, and access useful content that accelerates your growth. Your community starts here.
👍1
The CloudKitchens tech publication examines the powerful self-healing mechanisms built into Kubernetes. It highlights how features like readiness probes and replication controllers enable the creation of resilient, automated systems.
https://techblog.cloudkitchens.com/p/kubernetes-self-healing
https://techblog.cloudkitchens.com/p/kubernetes-self-healing
Cloudkitchens
From Fragile to Faultless: Kubernetes Self-Healing In Practice
Overcoming imperfections of managed Kubernetes with early self-healing.
❤3👍2
Hwameistor is an HA local storage system for cloud-native stateful workloads.
https://github.com/hwameistor/hwameistor
https://github.com/hwameistor/hwameistor
GitHub
GitHub - hwameistor/hwameistor: Hwameistor is an HA local storage system for cloud-native stateful workloads.
Hwameistor is an HA local storage system for cloud-native stateful workloads. - GitHub - hwameistor/hwameistor: Hwameistor is an HA local storage system for cloud-native stateful workloads.
👍1
This insightful article explores the complexities of managing long-lived connections within Kubernetes, offering solutions for ensuring they remain stable and reliable. The author delves into the technical challenges and provides practical examples to overcome them.
https://learnkube.com/kubernetes-long-lived-connections
https://learnkube.com/kubernetes-long-lived-connections
LearnKube
Load balancing and scaling long-lived connections in Kubernetes
Kubernetes doesn't load balance long-lived connections, and some Pods might receive more requests than others. If you're using HTTP/2, gRPC, RSockets, AMQP or any other long-lived connection such as a database connection, you might want to consider client…
👍1
Mark Tinderholt's latest blogpost provides a comprehensive walkthrough on setting up a Jekyll blog using Azure Static Web Apps and Azure Storage. This piece is a great resource for anyone looking to establish a low-cost and scalable personal website.
https://www.marktinderholt.com/infrastructure-as-code/terraform/azure/cloud/2024/10/24/setup-blog-jekyll-azure-storage.html
https://www.marktinderholt.com/infrastructure-as-code/terraform/azure/cloud/2024/10/24/setup-blog-jekyll-azure-storage.html
Mark Tinderholt’s Blog.
Automating Jekyll with Terraform: A Serverless Approach to Static Site Hosting on Azure
My plan is to create a private repository on GitHub to be the source of all of the content for my blog posts. This is where I will write and where I will maintain the original copy of every blog post I make. I can then distribute the blog posts to any online…
👍1
Kubernetes controller for GitHub Actions self-hosted runners
https://github.com/actions/actions-runner-controller
https://github.com/actions/actions-runner-controller
GitHub
GitHub - actions/actions-runner-controller: Kubernetes controller for GitHub Actions self-hosted runners
Kubernetes controller for GitHub Actions self-hosted runners - actions/actions-runner-controller
👍2
This new analysis from Honeycomb argues against the traditional use of incident severities, proposing a shift towards classifying incidents by type instead. It's a thought-provoking read that challenges common practices in incident management.
https://www.honeycomb.io/blog/against-incident-severities-favor-incident-types
https://www.honeycomb.io/blog/against-incident-severities-favor-incident-types
Honeycomb
Against Incident Severities and in Favor of Incident Types
In this post, SRE Fred Hebert goes over why we decided to choose incident types instead of incident severities for incident response.
👍1
If you've ever wondered what Helm is and why it's so popular in the Kubernetes ecosystem, this excellent guide has you covered. The text breaks down the fundamentals of the package manager and explains its core concepts in an easy-to-understand way.
https://glasskube.dev/blog/what-is-helm-in-kubernetes/
https://glasskube.dev/blog/what-is-helm-in-kubernetes/
glasskube.dev
What is Helm in Kubernetes? A complete guide | Glasskube
An in-depth overview of how Helm simplifies Kubernetes application management. Learn about Helm charts, key commands, and best practices in this comprehensive tutorial.
🔥1