AI/ML – Page 4 – C4: Container, Code, Cloud & Context

Building Your First AI Agent with Microsoft Agent Framework (.NET) – Part 2

Posted on October 8, 2025 by Nithin Mohan TK 10 min read

Build a production-ready Customer Support AI agent using C# and .NET 8. Complete tutorial covering project setup, tools, multi-turn conversations, middleware, and error handling.

Read more →

Evaluating Agent Performance: Metrics and Testing Strategies

Posted on October 5, 2025 by Nithin Mohan TK 12 min read

Evaluating agent performance is harder than evaluating models. After developing evaluation frameworks for 10+ agent systems, I’ve learned what metrics matter and how to test effectively. Here’s the complete guide to evaluating agent performance. Figure 1: Agent Evaluation Metrics Framework Why Agent Evaluation is Different Agent evaluation is more complex than model evaluation: Multi-step reasoning: […]

Read more →

Introduction to Microsoft Agent Framework: The Open-Source Engine for Agentic AI Apps (Part 1)

Posted on October 1, 2025 by Nithin Mohan TK 10 min read

Learn about Microsoft Agent Framework (MAF), the unified open-source SDK for building production-ready AI agents. This comprehensive guide covers the architecture, key features, and how MAF combines the best of Semantic Kernel and AutoGen for enterprise agentic AI development.

Read more →

Streaming Responses for LLMs: Implementing Server-Sent Events

Posted on June 15, 2025 by Nithin Mohan TK 10 min read

Streaming LLM responses dramatically improves user experience. After implementing streaming for 20+ LLM applications, I’ve learned what works. Here’s the complete guide to implementing Server-Sent Events for LLM streaming. Figure 1: Streaming Architecture Why Streaming Matters Streaming LLM responses provides significant benefits: Perceived performance: Users see results immediately, not after 10+ seconds Better UX: Progressive […]

Read more →

Quantization Methods for LLMs: GPTQ, AWQ, and BitsAndBytes

Posted on April 8, 2025 by Nithin Mohan TK 5 min read

Last year, I needed to run a 13B parameter model on a 16GB GPU. Full precision required 52GB. After testing GPTQ, AWQ, and BitsAndBytes, I reduced memory to 7GB with minimal accuracy loss. After quantizing 30+ models, I’ve learned which method works best for each scenario. Here’s the complete guide to LLM quantization. Figure 1: […]

Read more →

Advanced LoRA Techniques: Multi-LoRA, LoRA+, and Beyond

Posted on March 15, 2025 by Nithin Mohan TK 6 min read

Last year, I fine-tuned a 7B parameter model with standard LoRA. It worked, but accuracy was 5% lower than full fine-tuning. After experimenting with Multi-LoRA, LoRA+, and advanced techniques, I’ve achieved 98% of full fine-tuning performance with 1% of the parameters. Here’s everything you need to know about advanced LoRA techniques. Figure 1: LoRA Techniques […]

Read more →

Searching in

Category: AI/ML

Building Your First AI Agent with Microsoft Agent Framework (.NET) – Part 2

Evaluating Agent Performance: Metrics and Testing Strategies

Introduction to Microsoft Agent Framework: The Open-Source Engine for Agentic AI Apps (Part 1)

Streaming Responses for LLMs: Implementing Server-Sent Events

Quantization Methods for LLMs: GPTQ, AWQ, and BitsAndBytes

Advanced LoRA Techniques: Multi-LoRA, LoRA+, and Beyond