LLM – Page 17 – C4: Container, Code, Cloud & Context

Claude API Deep Dive: Building with Anthropic’s Models

Posted on November 12, 2024 by Nithin Mohan TK 7 min read

A comprehensive guide to the Anthropic Claude API covering Claude 3.5 Sonnet, tool use, vision, computer use, and production best practices.

Read more →

The Complete Guide to RAG Architecture: From Fundamentals to Production

Posted on November 10, 2024 by Nithin Mohan TK 11 min read

Master Retrieval-Augmented Generation (RAG) with this expert-level guide. Learn about RAG types (Naive, Advanced, Modular, Agentic), chunking strategies, embedding models, vector databases, hybrid retrieval, and production best practices with high-quality architecture diagrams.

Read more →

Prompt Templates and Versioning: Building Maintainable LLM Applications

Posted on November 6, 2024 by Nithin Mohan TK 13 min read

Introduction: Production LLM applications need structured prompt management—not ad-hoc string concatenation scattered across code. Prompt templates provide reusable, parameterized prompts with consistent formatting. Versioning enables A/B testing, rollbacks, and tracking which prompts produced which results. This guide covers practical prompt template patterns: template engines and variable substitution, prompt registries, version control strategies, A/B testing frameworks, […]

Read more →

Deploying LLM Applications on Cloud Run: A Complete Guide

Posted on November 5, 2024 by Nithin Mohan TK 6 min read

Last year, I deployed our first LLM application to Cloud Run. What should have taken hours took three days. Cold starts killed our latency. Memory limits caused crashes. Timeouts broke long-running requests. After deploying 20+ LLM applications to Cloud Run, I’ve learned what works and what doesn’t. Here’s the complete guide. Figure 1: Cloud Run […]

Read more →

Enterprise Generative AI: A Solutions Architect’s Framework for Production-Ready Systems

Posted on October 27, 2024 by Nithin Mohan TK 5 min read

After two decades of building enterprise systems, I’ve witnessed numerous technology waves—from SOA to microservices, from on-premises to cloud-native. But nothing has matched the velocity and transformative potential of generative AI. The challenge isn’t whether to adopt it; it’s how to do so without creating technical debt that will haunt your organization for years. The […]

Read more →

LLM Evaluation: Metrics, Benchmarks, and A/B Testing

Posted on October 15, 2024 by Nithin Mohan TK 12 min read

Introduction: Evaluating LLM outputs is challenging because there’s often no single “correct” answer. Traditional metrics like BLEU and ROUGE fall short for open-ended generation. This guide covers modern evaluation approaches: automated metrics for specific tasks, LLM-as-judge for quality assessment, human evaluation frameworks, A/B testing in production, and building comprehensive evaluation pipelines. These techniques help you […]

Read more →

Searching in

Tag: LLM

Claude API Deep Dive: Building with Anthropic’s Models

The Complete Guide to RAG Architecture: From Fundamentals to Production

Prompt Templates and Versioning: Building Maintainable LLM Applications

Enterprise Generative AI: A Solutions Architect’s Framework for Production-Ready Systems

LLM Evaluation: Metrics, Benchmarks, and A/B Testing