Tagged: DeepSeek
2 articles on deepseek.

The LLM Year in Review: What Actually Mattered in 2025 (And What Was Noise)
The prediction was: bigger models win. The reality was: DeepSeek R1 rewrote the rules in January and nothing was the same after that. Here is what 2025 actually taught us about reasoning, inference-time compute, and the changing economics of intelligence.
EngineeringRead more →

From GPT-2 to DeepSeek: The Architectural Changes That Actually Mattered
I've been reading ML papers for 10 years. Most don't matter. These architectural choices did. RoPE, GQA, SwiGLU — each one solved a real scaling problem. Here's what practitioners need to know when a new model claims 'better architecture.'
EngineeringRead more →

