Artificial Intelligence(AI) – Page 22 – C4: Container, Code, Cloud & Context

LLM Evaluation Metrics: Automated Testing, LLM-as-Judge, and Human Assessment for Production AI

Posted on July 17, 2024 by Nithin Mohan TK 13 min read

Introduction: Evaluating LLM outputs is fundamentally different from traditional ML evaluation. There’s no single ground truth for creative tasks, quality is subjective, and outputs vary with each generation. Yet rigorous evaluation is essential for production systems—you need to know if your prompts are working, if model changes improve quality, and if your system meets user […]

Read more →

Fine-Tuning Large Language Models: A Complete Guide to LoRA and QLoRA

Posted on July 15, 2024 by Nithin Mohan TK 3 min read

Master parameter-efficient fine-tuning with LoRA and QLoRA. Learn how to customize LLMs like Llama 3 and Mistral on consumer hardware with step-by-step implementation guides.

Read more →

Text-to-SQL with LLMs: Building Natural Language Database Interfaces

Posted on July 12, 2024 by Nithin Mohan TK 10 min read

Introduction: Natural language to SQL is one of the most practical LLM applications. Business users can query databases without knowing SQL, analysts can explore data faster, and developers can prototype queries quickly. But naive implementations fail spectacularly—generating invalid SQL, hallucinating table names, or producing queries that return wrong results. This guide covers building robust text-to-SQL […]

Read more →

Generative AI Services in AWS

Posted on May 5, 2024 by Nithin Mohan TK 19 min read

A practitioner’s deep-dive into the complete AWS Generative AI stack: Amazon Bedrock foundation models, Knowledge Bases, Agents, Guardrails, Amazon Q Business and Q Developer, SageMaker fine-tuning with LoRA, Trainium and Inferentia custom silicon, multi-model routing patterns, and production observability. 3000+ words of enterprise-grade guidance.

Read more →

Generative AI in Healthcare: Revolutionizing Patient Care

Posted on May 3, 2024 by Nithin Mohan TK 4 min read

The first time I witnessed a generative AI system accurately synthesize a patient’s complex medical history into actionable clinical insights, I understood we were entering a new era of healthcare delivery. After two decades of architecting enterprise systems across industries, I can say that healthcare presents both the greatest challenges and the most profound opportunities […]

Read more →

Context Distillation Methods: Extracting Signal from Long Documents

Posted on July 1, 2019 by Nithin Mohan TK 2 min read

Introduction: Long contexts contain valuable information, but they also contain noise, redundancy, and irrelevant details that consume tokens and dilute model attention. Context distillation extracts the essential information from lengthy documents, conversations, or retrieved passages, producing compact representations that preserve what matters while discarding what doesn’t. This technique is crucial for RAG systems processing multiple […]

Read more →

Searching in

Category: Artificial Intelligence(AI)

LLM Evaluation Metrics: Automated Testing, LLM-as-Judge, and Human Assessment for Production AI

Fine-Tuning Large Language Models: A Complete Guide to LoRA and QLoRA

Text-to-SQL with LLMs: Building Natural Language Database Interfaces

Generative AI in Healthcare: Revolutionizing Patient Care

Context Distillation Methods: Extracting Signal from Long Documents