AI
3 articles
Building a RAG Pipeline That Actually Works
A production-focused guide to building Retrieval-Augmented Generation pipelines with Python and FastAPI. Covers chunking strategies, embedding models, vector search, prompt construction, and the failure modes nobody warns you about.
How Local LLMs Replaced $300/Month in OpenAI Costs
A practical breakdown of when and how to move AI workloads from OpenAI APIs to local models. Real cost math, production code, and an honest look at when it's not worth it.
How I Use AI to Write Better Code (Without Losing My Skills)
A senior engineer's practical approach to AI-assisted development: boosting productivity while maintaining technical depth and code quality.