Tagged: LLM
3 articles on llm.

RAG Isn't Dead. You're Just Using It Wrong.
The 'RAG is dead' narrative is wrong — but it's wrong in an interesting way. After building RAG systems in healthcare production, here's what actually kills LLM context quality and what to do about it.
EngineeringRead more →

Fine-Tuning a 70B Model on a Consumer GPU: The Q-LoRA Practical Guide
Q-LoRA + SFTTrainer + Flash Attention v2 means you can fine-tune a 70B parameter model on 24GB of VRAM. Here is what that actually looks like end-to-end, what it costs in quality, and when you should just use the API instead.
EngineeringRead more →

When Recommendations Meet Language: The LLM-RecSys Convergence
Most AI stacks treat the recommendation engine and the language model as two separate systems that hand off to each other. A new class of hybrid models eliminates that seam — and the implications for domain-specific AI are significant.
EngineeringRead more →


