When AWS Lambda launched in 2014, it fundamentally changed how we think about infrastructure. No servers to provision, no capacity to plan, no patches to […]
Year: 2025
Tips and Tricks #129: Use Span for Zero-Allocation String Parsing
Eliminate heap allocations when parsing strings by using Span
Building Real-Time Applications with Google Cloud Firestore: A Document Database Deep Dive
Executive Summary: Google Cloud Firestore provides a fully managed, serverless NoSQL document database designed for mobile, web, and server development. This comprehensive guide explores Firestore’s […]
Tips and Tricks #128: Implement Retry Logic for LLM API Calls
Handle rate limits and transient failures gracefully with exponential backoff.
Tips and Tricks #127: Cache LLM Responses for Cost Reduction
Implement semantic caching to avoid redundant LLM calls and reduce API costs.
Beyond Chatbots: Building Autonomous AI Agents That Actually Get Things Done
The AI landscape has shifted dramatically. While chatbots dominated the conversation for years, we’re now witnessing the emergence of something far more powerful: autonomous AI […]