In January 2026, Microsoft and NVIDIA released the second iteration of the NVIDIA Dynamo Planner—a groundbreaking tool for optimizing large language model (LLM) inference on Azure Kubernetes Service (AKS). This collaboration addresses one of the most challenging aspects of production AI: efficiently scaling GPU resources to balance cost, latency, and throughput. This comprehensive guide explores […]
Read more →Tag: Kubernetes
.NET Aspire 10: Cloud-Native Development from Local to Azure
.NET Aspire 10, released alongside .NET 10, represents Microsoft’s answer to the complexity of cloud-native development. It provides an opinionated, orchestrated approach to building distributed applications with built-in service discovery, health checks, telemetry, and deployment automation. This comprehensive guide explores Aspire’s architecture, the developer experience improvements in version 10, and patterns for deploying Aspire applications […]
Read more →Ingress NGINX Deprecation: Complete Migration Guide to Kubernetes Gateway API
In January 2026, the Kubernetes Steering and Security Response Committees issued a critical warning: Ingress NGINX will cease receiving security patches from March 2026. With approximately 50% of cloud-native environments relying on this controller, this deprecation represents one of the most significant infrastructure migrations in Kubernetes history. This comprehensive guide provides a step-by-step migration path […]
Read more →Kubernetes 1.35: In-Place Pod Resource Updates and AI Model Image Volumes
Kubernetes 1.35, released in January 2026 and now supported on Amazon EKS and EKS Distro, marks a significant milestone in container orchestration—particularly for AI/ML workloads. This release introduces In-Place Pod Resource Updates, allowing you to resize CPU and memory without restarting pods, and Image Volumes, a game-changer for delivering large AI models using OCI container […]
Read more →Case Study: Building a Modern FHIR Patient Timeline Explorer with .NET 10 and React 19
Executive Summary This case study explores the development of DooLittle Health Patient Timeline Explorer, a modern healthcare application that demonstrates enterprise-grade architecture patterns for FHIR-compliant patient data visualization. Built as a proof-of-concept, this project showcases best practices in full-stack development, cloud-native deployment, and healthcare interoperability standards. 🏥 HEALTHCARE INTEROPERABILITY SERIES This article is part of […]
Read more →Production Model Deployment Patterns: From REST APIs to Kubernetes Orchestration in Python
After deploying hundreds of ML models to production across startups and enterprises, I’ve learned that model deployment is where most AI projects fail. Not because the models don’t work—but because teams underestimate the engineering complexity of serving predictions reliably at scale. This article shares production-tested deployment patterns from REST APIs to Kubernetes orchestration. 1. The […]
Read more →