Cloud Computing – Page 2 – C4: Container, Code, Cloud & Context

Deploying LLM Applications on Cloud Run: A Complete Guide

Posted on November 5, 2024 by Nithin Mohan TK 6 min read

Last year, I deployed our first LLM application to Cloud Run. What should have taken hours took three days. Cold starts killed our latency. Memory limits caused crashes. Timeouts broke long-running requests. After deploying 20+ LLM applications to Cloud Run, I’ve learned what works and what doesn’t. Here’s the complete guide. Figure 1: Cloud Run […]

Read more →

Mastering AWS EKS Deployment with Terraform: A Comprehensive Guide

Posted on October 29, 2024 by Nithin Mohan TK 3 min read

Introduction: Amazon Elastic Kubernetes Service (EKS) simplifies the process of deploying, managing, and scaling containerized applications using Kubernetes on AWS. In this guide, we’ll explore how to provision an AWS EKS cluster using Terraform, an Infrastructure as Code (IaC) tool. We’ll cover essential concepts, Terraform configurations, and provide hands-on examples to help you get started […]

Read more →

ML.NET for Custom AI Models: When to Use ML.NET vs Cloud APIs

Posted on September 5, 2024 by Nithin Mohan TK 6 min read

Six months ago, I faced a critical decision: build a custom ML model with ML.NET or use cloud APIs. The project required real-time fraud detection with zero latency tolerance. Cloud APIs were too slow. ML.NET was the answer. But when should you use ML.NET vs cloud APIs? After building 15+ production ML systems, here’s what […]

Read more →

Multi-Cloud AI Strategies: Avoiding Vendor Lock-in

Posted on July 25, 2024 by Nithin Mohan TK 12 min read

Multi-cloud AI strategies prevent vendor lock-in and optimize costs. After implementing multi-cloud for 20+ AI projects, I’ve learned what works. Here’s the complete guide to multi-cloud AI strategies. Figure 1: Multi-Cloud AI Architecture Why Multi-Cloud for AI Multi-cloud strategies offer significant advantages: Vendor independence: Avoid lock-in to single cloud provider Cost optimization: Use best pricing […]

Read more →

Generative AI Services in AWS

Posted on May 5, 2024 by Nithin Mohan TK 19 min read

A practitioner’s deep-dive into the complete AWS Generative AI stack: Amazon Bedrock foundation models, Knowledge Bases, Agents, Guardrails, Amazon Q Business and Q Developer, SageMaker fine-tuning with LoRA, Trainium and Inferentia custom silicon, multi-model routing patterns, and production observability. 3000+ words of enterprise-grade guidance.

Read more →

AWS Cloud Platform Fundamentals: Account Structure, IAM, and Global Infrastructure (Part 1 of 6)

Posted on April 15, 2024 by Nithin Mohan TK 8 min read

Amazon Web Services (AWS) is the world’s most comprehensive and widely adopted cloud platform, offering over 200 fully featured services from data centers globally. This foundational guide covers the essential concepts every developer and architect needs to master before building on AWS. 📚 AWS FUNDAMENTALS SERIES This is Part 1 of a 6-part series covering […]

Read more →

Searching in