Articles · Page 2
Older posts from the archive.

The ESM Mess: JavaScript's Module System Is Still Broken and Here's Why
ES Modules have been the supposed future of JavaScript for nine years. Only 9-27% of the ecosystem has actually adopted them. Here's what's really going on, and how to survive until the ecosystem commits.

The Three Things Exceptional Engineering Leaders Do (And the One They Stop Doing)
Most engineering leaders excel at one of three pillars — providing direction, removing obstacles, or foreseeing change — and quietly fail at the other two. After 12 years and more management mistakes than I care to count, here's what I've learned about building strength across all three.

The Open-Weight LLM Landscape in 2026: What Engineers Actually Need to Know
The open-weight ecosystem has matured faster than most engineers realize. MoE proliferation, hybrid attention, and extended context windows are changing what's actually deployable on-premise — and that matters more than ever for healthcare AI.

Software 2.0 Is Here and It Changed How I Think About Programming
In 2017, a post called "Software 2.0" argued that neural networks would replace explicit logic as the dominant programming paradigm. Nine years later, that prediction has fully landed — and the implications for how we build software are bigger than most engineers want to admit.

Three Ways to Know If Your Career Is Actually Growing
Normal career metrics — title, pay, team size — tell you how you're doing relative to others. They don't tell you whether you're growing. Here are three that do.

RAG Isn't Dead. You're Just Using It Wrong.
The 'RAG is dead' narrative is wrong — but it's wrong in an interesting way. After building RAG systems in healthcare production, here's what actually kills LLM context quality and what to do about it.

Fine-Tuning a 70B Model on a Consumer GPU: The Q-LoRA Practical Guide
Q-LoRA + SFTTrainer + Flash Attention v2 means you can fine-tune a 70B parameter model on 24GB of VRAM. Here is what that actually looks like end-to-end, what it costs in quality, and when you should just use the API instead.

Time vs. Timing: The Career Framework I Wish I Had Earlier
I've made bets that paid off because of timing and bets that paid off because of compounding. Confusing the two is how careers stall.

The LLM Year in Review: What Actually Mattered in 2025 (And What Was Noise)
The prediction was: bigger models win. The reality was: DeepSeek R1 rewrote the rules in January and nothing was the same after that. Here is what 2025 actually taught us about reasoning, inference-time compute, and the changing economics of intelligence.








