My Recommended Kubernetes Resources for Newbies
A curated list of recommended books, tools, tutorials, and videos for beginners starting their journey to learn Kubernetes.
A curated list of recommended books, tools, tutorials, and videos for beginners starting their journey to learn Kubernetes.
A veteran software engineer observes a shift from wholesale cloud migration to a hybrid approach, with companies keeping workloads on-prem unless there's a compelling cloud reason.
A personal account of attending the EMF Camp, a tech and maker festival, including experiences with talks, installations, and volunteering on the event's phone infrastructure.
A technical guide to Kubernetes logging layers, focusing on debug vs. security logs and practical tuning advice without vendor bias.
Analyzes the complex total cost of ownership for deploying generative AI models in production, beyond just raw compute expenses.
A recap of the Nutanix .NEXT 2024 conference, covering key announcements like NXRail, Nutanix Central GA, and Project Beacon, along with technical breakout sessions.
Explains DNS resolution on Linux and Kubernetes, focusing on troubleshooting a common Kubernetes DNS warning message.
Explains how to use AWS EBS-backed ephemeral volumes in Kubernetes for temporary storage, avoiding node disk pressure and multi-attach errors.
Explains how to use LiteLLM to load balance OpenAI API calls across multiple Azure instances to avoid rate limits.
A retrospective on organizing and running ThanosCon, a co-located event at KubeConEU 2024, focused on the Thanos open-source project for metrics.
Explains how Kubernetes exposes metrics for monitoring, covering the Metrics API, Kubelet/cAdvisor, and different metric categories.
A tutorial on using the kr8s Python library to manage and automate Kubernetes resources, covering setup, creation, and listing.
A guide to running Stable Diffusion's AUTOMATIC1111 WebUI on Google Kubernetes Engine (GKE) Autopilot using a Google Deep Learning Container.
A guide applying the 12-factor app methodology principles to writing efficient, secure, and maintainable Dockerfiles for containerized applications.
A guide to deploying and running your own LLM on Google Kubernetes Engine (GKE) Autopilot for control, privacy, and cost management.
How to cordon and drain nodes in GKE Autopilot for node replacement, including a disruptive cluster-wide method.
Guide to upgrading a GKE Autopilot cluster to version 1.28 to enable CUDA 12 support for NVIDIA GPU workloads.
Explains how to enable Google GKE Image Streaming for public DockerHub images using a remote Artifact Registry repository as a mirror.
A quick guide to finding the NVIDIA GPU driver version running on a Google Kubernetes Engine (GKE) cluster using a kubectl command.
A tutorial on setting up and running PyFlink streaming data jobs on a Kubernetes cluster, including prerequisites and deployment steps.