I spent $80 and 14 hours to build this, welcome to my new website!
My refreshed personal blog, built with the help of Claude Code.
I write about software engineering. Find my articles here and across leading tech publications.
My refreshed personal blog, built with the help of Claude Code.
Docker images bloat from AI libraries and OS components. This articles suggests using docker history and dive tools to diagnose layer-by-layer bloat sources for targeted optimization.
A beginner-friendly guide showing developers how to easily deploy transformer models (like DistilBERT) using Docker, Flask, Gunicorn, and AWS SageMaker.
Unmasking AI’s Illusions: Inside Google’s Data Gemma
A Comprehensive Guide to Choosing the Right Vector Index for Efficient Similarity Search
A containerized solution for hosting transformer models using Flask, Gunicorn, and Docker with AWS SageMaker deployment support. Build once, run anywhere!
This project dives into the capabilities of Google's DataGemma LLM and demonstrates how to replicate similar behavior on the Claude LLM through prompt engineering. By combining large language models with real-time data retrieval from Data Commons, we aim to provide accurate and up-to-date responses to statistical queries.
A evolving codebase that demonstrates various techniques to optimize docker image for size and performance.