GenAI
- February 14, 2026
LLM Inference Internals: KV Cache, Flash Attention, and Optimizing for Apple Silicon
- February 14, 2026
I Ran an 80B Coding Model Locally with Claude Code. It Took 1 Hour Instead of 9 Minutes. Here's What Was Wrong.
- December 30, 2025
2025 Year in Review
- July 17, 2025
My Honest Take on Kiro, AI IDE from AWS
- July 7, 2025
Built an AI Agent using Strands Agents SDK
- June 25, 2025
Built an app using Lovable, vibecoding startup
- May 14, 2025
Built my first MCP Server for Kubernetes
- May 11, 2025
Deploying LLMs on Amazon EKS using NVIDIA GPUs
- May 4, 2025
Generate AWS Arch diagrams using AWS MCP server and Amazon Q CLI
- April 19, 2025
My Capstone project for Google GenAI course
- March 13, 2025
Creating decibel meter using Claude Code Agentic tool
- February 24, 2025
Claude Code Agentic CLI demo
- January 29, 2025
Running Deepseek R1 locally
- January 28, 2025
New metric to measure the efficiency of AI models