Production AI – Page 5 – C4: Container, Code, Cloud & Context

Building LLM Agents with Tools: From Simple Loops to Production Systems

Posted on August 5, 2024 by Nithin Mohan TK 11 min read

Introduction: LLM agents extend language models beyond text generation into autonomous action. By connecting LLMs to tools—web search, code execution, APIs, databases—agents can gather information, perform calculations, and interact with external systems. This guide covers building tool-using agents from scratch: defining tools with schemas, implementing the reasoning loop, handling tool execution, managing conversation state, and […]

Read more →

Document Processing with LLMs: Enterprise Parsing, Chunking, and Extraction (Part 2 of 2)

Posted on July 25, 2024 by Nithin Mohan TK 16 min read

Introduction: Processing documents with LLMs unlocks powerful capabilities: extracting structured data from unstructured text, summarizing lengthy reports, answering questions about document content, and transforming documents between formats. However, effective document processing requires more than just sending text to an LLM—it demands careful parsing, intelligent chunking, and strategic prompting. This guide covers practical document processing patterns: […]

Read more →

LLM Observability: Tracing, Metrics, and Logging for Production AI (Part 1 of 2)

Posted on July 18, 2024 by Nithin Mohan TK 16 min read

Introduction: Observability is essential for production LLM applications—you need visibility into latency, token usage, costs, error rates, and output quality. Unlike traditional applications where you can rely on status codes and response times, LLM applications require tracking prompt versions, model behavior, and semantic quality metrics. This guide covers practical observability: distributed tracing for multi-step LLM […]

Read more →

LLM Evaluation Metrics: Automated Testing, LLM-as-Judge, and Human Assessment for Production AI

Posted on July 17, 2024 by Nithin Mohan TK 13 min read

Introduction: Evaluating LLM outputs is fundamentally different from traditional ML evaluation. There’s no single ground truth for creative tasks, quality is subjective, and outputs vary with each generation. Yet rigorous evaluation is essential for production systems—you need to know if your prompts are working, if model changes improve quality, and if your system meets user […]

Read more →

Searching in

Tag: Production AI

Building LLM Agents with Tools: From Simple Loops to Production Systems

Document Processing with LLMs: Enterprise Parsing, Chunking, and Extraction (Part 2 of 2)

LLM Observability: Tracing, Metrics, and Logging for Production AI (Part 1 of 2)

LLM Evaluation Metrics: Automated Testing, LLM-as-Judge, and Human Assessment for Production AI