Blog

Articles on AI/ML engineering, system design, and production deployments — grouped by topic

AI Agent Architecture

Building robust, production-ready AI agent systems: architecture, design, and orchestration fundamentals.

Title	Date	Categories
The Snapshot Tax: Why AG-UI's STATE_DELTA Drifts in Production	Jun 2, 2026	ai-engineering agentic-ai
Claude Code on Enterprise WSL: Eleven Errors, Eleven Fixes, and the One Flag Nobody Documents (--bare)	Jun 1, 2026	ai-engineering developer-tooling
Why Agent Memory Needs a Graph: Lessons from the Kumiho Architecture	Apr 2, 2026	agentic-ai ai-architecture agent-memory
Design Patterns for SLM-First Systems	Mar 14, 2026	ai-systems architecture small-language-model
Small Language Models Are Not Smaller GPTs - They're Infrastructure	Mar 13, 2026	ai-systems architecture small-language-model
Why Your AI Agent Finishes Tasks But Fails the Goal	Mar 10, 2026	agentic-ai langgraph ai-engineering
Claude Code Guide: Build Agentic Workflows with Commands, MCP, and Subagents	Mar 9, 2026	agentic-ai ai-engineering developer-tools
Orchestration in Agentic AI: Tool Selection, Execution, Planning Topologies, and Context Engineering	Mar 8, 2026	agentic-ai llm-engineering production-ml
Tool Use in LLM Agents: From Local Functions to the Model Context Protocol	Mar 7, 2026	agentic-ai langgraph ai-engineering
Designing User Experience for Agentic AI Systems	Mar 6, 2026	agentic-ai langgraph production-systems
Designing Agentic AI Systems That Survive Production	Mar 5, 2026	agentic-ai system-design production-ai
5 Principles for Building Production-Grade Agentic AI Systems	Mar 4, 2026	agentic-ai mlops system-design
The 7 GenAI Architectures Every AI Engineer Should Know	Mar 3, 2026	agentic-ai ai-infrastructure system-design
From LLMs to Agents: The Mindset Shift Nobody Talks About	Mar 2, 2026	agentic-ai ai-foundations langgraph
CopilotKit in Production: Where the Abstraction Holds and Where You're on Your Own	Mar 1, 2026	agentic-ai ai-infrastructure frontend-ai
Agentic AI Observability: Why Traditional Monitoring Breaks with Autonomous Systems	Feb 27, 2026	ai-engineering mlops ai-observability
Beyond Copy-Paste: Staying Relevant in the Age of AI Code Assistants	Feb 24, 2026	ai-engineering developer-productivity
Building Production-Ready AI Agent Services: FastAPI + LangGraph Template Deep Dive	Feb 15, 2026	ai-engineering production-systems agent-architecture
MLOps Foundation: What Actually Breaks When You Deploy ML Systems	Feb 6, 2026	mlops machine-learning-engineering production-ml
Stop Pasting Screenshots: How AI Engineers Document Systems with Mermaid	Dec 30, 2025	agentic-ai ai_ml ai-engineering
Building Production-Ready AI Agents with LangGraph: A Developer's Guide to Deterministic Workflows	Dec 29, 2025	agentic-ai ai-ml ai-engineering
Agent Building Blocks: Build Production-Ready AI Agents with LangChain \| Complete Developer Guide	Dec 22, 2025	agentic-ai ai_ml ai-engineering
Open Source AI's Original Sin: The Illusion of Democratization	Dec 10, 2025	ai-infrastructure open-source ai-economics
The Tyranny of the Mean: Population-Based Optimization in Healthcare and AI	Dec 8, 2025	ai_ml genai llms
The AI Ouroboros: How Gen AI is Eating Its Own Tail	Dec 5, 2025	agentic-ai ai_ml genai
Building Agents That Remember: State Management in Multi-Agent AI Systems	Nov 30, 2025	agentic-ai ai_ml ai-engineering
Building Production-Ready Agentic AI: The Infrastructure Nobody Talks About	Nov 27, 2025	agentic-ai ai_ml ai-engineering
Asynchronous Processing and Message Queues in Agentic AI Systems	Nov 27, 2025	agentic-ai ai_ml ai-engineering
Playwright + AI: The Ultimate Testing Power Combo Every Developer Should Use in 2025	Nov 24, 2025	ai_ml software-development software-testing
Prompt Engineering Deep Dive: Parameters, Chains, Reasoning, and Guardrails	Mar 9, 2025	ai_ml genai natural-language-processing-nlp
Fact-Checking in LLM Systems: From Hallucinations to Verifiable AI	Feb 5, 2025	ai_ml genai

LLM Applications & RAG

Practical guides for building and optimizing LLM-powered applications with RAG techniques.

Title	Date	Categories
HNSW Vector Search Recall Failures in Production	May 28, 2026	ai-engineering rag-systems
LLM Wiki Is Not a RAG Replacement - It's a Synthesis-Time Decision	Apr 20, 2026	ai-engineering agentic-ai knowledge-systems
Closing the Loop: How to Actually Measure RAG Quality in Production	Feb 26, 2026	rag-systems ai-observability production-ai
How Google's SynthID Actually Works: A Visual Breakdown	Dec 16, 2025	ai_ml genai llms
The Splintered Web: India 2025	Dec 7, 2025	ai_ml genai
When Models Stand Between Us and the Web: The Future of the Internet in the Age of Generative AI	Sep 26, 2025	ai_ml explainable-ai genai
Reranking for RAG: Boosting Answer Quality in Retrieval-Augmented Generation	Aug 11, 2025	ai_ml genai llms
Question Answer Chatbot using RAG, Llama and Qdrant	May 19, 2025	ai_ml genai llms
On Emergent Abilities of Large Language Models	Mar 26, 2025	ai_ml genai llms
LLM Text Clustering and Topic Modeling: HDBSCAN and BERTopic Tutorial	Mar 4, 2025	ai_ml genai natural-language-processing-nlp
Text Classification using Large Language Models (LLMs)	Mar 3, 2025	ai_ml genai natural-language-processing-nlp
Summary of the paper DeepSeek-R1	Jan 30, 2025	ai_ml genai
Hands-on Tutorial on Making an Audio Bot using LLM, and RAG	Jan 27, 2025	ai_ml genai natural-language-processing-nlp
My notes on AI-Generated Content (AIGC)	Jan 7, 2025	ai_ml computer-vision genai

Search Retrieval Methods

Practical guides to implementing and optimizing production search systems.

Title	Date	Categories
Local Binary Patterns: The Texture Descriptor That Deep Learning Hasn't Killed	Apr 14, 2026	computer-vision ml-engineering feature-engineering
BM25 for Developers: What Actually Matters in Production	Jan 17, 2026	information-retrieval search-engineering rag-systems
BM25 vs Dense Retrieval for RAG: What Actually Breaks in Production	Jan 17, 2026	genai information-retrieval rag-systems
Building Hybrid Search That Actually Works: BM25 + Dense Retrieval + Cross-Encoders	Jan 17, 2026	information-retrieval rag-systems search-engineering
A Deep Dive into Cross Encoders and How they work	Oct 2, 2025	genai information-retrieval llms
Fast Face Search (Billion-scale Face Recognition) using Vector DB (Faiss)	May 19, 2025	ai_ml computer-vision deep-learning
How do you choose among competing open-source products? Example comparison of open-source vector databases.	Jan 28, 2025	ai_ml genai
Fine-Tuning Cross-Encoders: When Accuracy Matters More Than Speed (A Practical Guide)	Jan 17, 2025	ai_ml deep-learning natural-language-processing

Agent Security Architecture

Securing autonomous AI agents through layered access control and execution boundaries.

Title	Date	Categories
Consequence Modeling for Agent Systems: Predicting Action Impact Before Execution	Feb 28, 2026	ai-safety agentic-systems risk-management
Multi-Party Authorization: Requiring Human Approval Without Killing Autonomy	Feb 25, 2026	ai-security agentic-systems authorization
Agent Audit Trails: Logging Context, Not Just Actions	Feb 23, 2026	ai-security agentic-systems observability
Credential Scoping for Agents: Why Temporary Keys Aren't Enough	Feb 22, 2026	ai-security agentic-systems credential-management
The Tool Execution Firewall: Pattern-Based Defense for Agent Actions	Feb 21, 2026	ai-security agentic-systems security-architecture
Trust Gradients: Dynamic Permission Scaling Based on Agent Behavior	Feb 17, 2026	ai-security agentic-systems access-control
Capability Tokens: Fine-Grained Authorization for Non-Deterministic Agents	Feb 16, 2026	ai-security agentic-systems authorization
Context Sandboxing: How to Prevent Tool Response Poisoning in Agentic Systems	Feb 14, 2026	ai-security agentic-systems production-security
The Agent DMZ: Isolating Decision-Making from Execution in Production AI	Feb 13, 2026	ai-security agentic-systems architecture
Zero Trust Agents: Why 'Verify Every Tool Call' Is the Only Defensible Architecture	Feb 12, 2026	ai-security agentic-systems zero-trust
Prompt Injection Is Just the Beginning: The Undefendable Attack Surface of Agentic AI	Feb 10, 2026	ai-security agentic-systems llm-security
The Agentic Security Divide: Why Only Rich Companies Can Deploy AI Agents Safely	Feb 9, 2026	ai-security agentic-systems production-ai
The Autonomous Credential Problem: When Your AI Needs Root Access	Feb 8, 2026	ai-security agentic-systems infrastructure
The Agent Trust Problem: Why Security Theater Won't Save Us from Agentic AI	Feb 5, 2026	agentic-ai ai-security llm-systems

MCP Implementation Guide

Practical guide to implementing Model Context Protocol in production systems.

Title	Date	Categories
Model Context Protocol (MCP): Architecture, Tradeoffs, and Production Realities	Feb 20, 2026	production-ai-engineering agentic-ai-systems ai-security-and-governance
Building a Production MCP Server: Architecture, Pitfalls, and Best Practices	Jan 23, 2026	ai-engineering mcp-protocol production-systems
Building an MCP Server for Non-LLM Clients (CLIs, IDEs, Pipelines)	Jan 23, 2026	mcp-implementation developer-tools infrastructure
Can MCP Replace Memory Systems? A Critical Analysis	Jan 23, 2026	agent-architecture memory-systems mcp-analysis
Designing Secure MCP Servers: Preventing Context Injection & Data Exfiltration	Jan 23, 2026	mcp-security llm-systems production-ai
How MCP Changes Agent Architecture (From Loops to Context Graphs)	Jan 23, 2026	agent-architecture mcp-systems llm-orchestration
Implementing MCP with LangGraph: A Practical Walkthrough	Jan 23, 2026	langgraph-integration mcp-implementation agent-systems
MCP as an AI Control Plane: Context Routing, Governance, and Policy	Jan 23, 2026	ai-infrastructure governance control-plane
MCP vs RAG vs Tools: When to Use Each (and When Not To)	Jan 23, 2026	llm-architecture production-ai system-design
Multi-Tenant MCP Servers: Isolating Context at Scale	Jan 23, 2026	multi-tenancy mcp-infrastructure enterprise-architecture
Observability for MCP Servers: Debugging Context, Not Prompts	Jan 23, 2026	observability mcp-operations debugging
Why MCP Servers Will Replace Most Agent Tool APIs	Jan 23, 2026	agent-architecture mcp-adoption api-design
Building a Local Banking Sandbox: Why I Created DevBankSDK	Jan 20, 2026	fintech-development developer-tools open-source

LLM Application Development

Building production systems with LLMs, from frontend to inference to deployment.

Title	Date	Categories
Frontend Architecture for GenAI: Why Your React Patterns Don't Work Anymore	Feb 19, 2026	frontend-architecture genai-engineering react-patterns
Building ChatGPT-Style Streaming in React: FastAPI + Next.js Production Guide	Feb 18, 2026	llm-engineering frontend-architecture streaming-apis
Choosing the Right LLM Is a Systems Decision, Not a Model Benchmark	Feb 6, 2026	ai-engineering llm-deployment production-ml
Choosing the Right LLM Inference Framework: A Practical Guide	Dec 24, 2025	ai_ml ai-engineering genai
When Your Chatbot Needs to Actually Do Something: Understanding AI Agents	Dec 19, 2025	agentic-ai ai_ml genai
Introducing My New Book: The ChatML (Chat Markup Language) Handbook	Nov 17, 2025	ai_ml genai llms
Cursor AI Code Editor: Boost Developer Productivity with MCP Servers	Sep 18, 2025	ai_ml genai llms
LLMs for SMEs - 001: How Small Businesses Can Leverage AI Without Cloud Costs	Aug 22, 2025	ai_ml genai llms
LLM-Powered Chatbots: A Practical Guide to User Input Classification and Intent Handling	Aug 12, 2025	agentic-ai ai_ml ai-engineering
Inside the LLM Inference Engine: Architecture, Optimizations, Tools, Key Concepts and Best Practices	Feb 9, 2025	ai_ml ai-engineering genai
Making a talking bot using Llama3.2:1b running on Raspberry Pi 4 Model-B 4GB	Jan 2, 2025	ai_ml genai iot-edge-computing
Python Audio Recording Tutorial - Record, Save & Play	Dec 5, 2024	python software-development unstructured-data

Agentic AI Safety

Safeguarding autonomous AI systems through monitoring, control, and ethical deployment practices.

Title	Date	Categories
The Panopticon Agent: How Agentic AI Makes Surveillance Trivial and Invisible	Feb 11, 2026	ai-ethics surveillance agentic-systems
Building Privacy-Preserving Machine Learning Applications in Python with Homomorphic Encryption	Sep 3, 2025	ai_ml deep-learning medical-imaging
Provenance in AI: Auto-Capturing Provenance with MLflow and W3C PROV-O in PyTorch Pipelines – Part 4	Aug 29, 2025	ai_ml deep-learning explainable-ai
Navigating AI Risks with NIST’s AI Risk Management Framework (AI RMF)	Aug 28, 2025	ai_ml deep-learning explainable-ai
Provenance in AI: Building a Provenance Graph with Neo4j – Part 3	Aug 28, 2025	ai_ml computer-vision deep-learning
Provenance in AI: Tracking AI Lineage with Signed Provenance Logs in Python - Part 2	Aug 28, 2025	ai_ml computer-vision deep-learning
Provenance in AI: Why It Matters for AI Engineers - Part 1	Aug 27, 2025	ai_ml computer-vision deep-learning

Local LLM Setup

Learn to set up and prompt local language models effectively.

Title	Date	Categories
ChatML Guide: Master Structured Prompts for LLMs	Aug 10, 2025	ai_ml genai llms
Install, run, and access Llama using Ollama	Dec 17, 2024	ai_ml genai

Other writings on AI, engineering, and technology

Title	Date	Categories
Fault Isolation and Circuit Breaking: Stop Retrying LLM Calls Like Microservices	Jun 16, 2026	ai-engineering agentic-ai systems-design
State Architecture for Agent Networks: The Resume Is the Dangerous Part	Jun 15, 2026	ai-engineering agentic-ai systems-design
Multi-Agent Topology Patterns: Every Topology Has a Tear Point	Jun 14, 2026	ai-engineering agentic-ai
Why Single Agents Fail at Scale: The Five-Mode Failure Taxonomy	Jun 8, 2026	ai-engineering agentic-ai