aiaipython9 min read

When fine-tuning is worth it (and the cheaper alternatives that usually win)

A decision framework for choosing between prompting, RAG, fine-tuning, and full training — with real cost numbers.

Daniel Kim

Editor at Skill Trek

APR 3, 2026

Fine-tuning has a marketing problem: it's the first thing people reach for when a model doesn't do what they want, and the last thing that actually fixes it. Most production teams that reach for fine-tuning would get better ROI from a well-structured prompt and a reranking layer.

The decision framework

Fine-tuning is worth it when: (1) you need consistent output format that prompting can't enforce reliably, (2) you have >10,000 high-quality examples of the exact task, or (3) you need to compress a complex multi-turn prompt into a faster, cheaper inference call. Everything else is a prompt engineering problem.

Daniel Kim

Applied ML engineer. Writes about LLMs, RAG, and production AI systems.

Keep reading

ai·9 MIN

When fine-tuning is worth it (and the cheaper alternatives that usually win)

The decision framework

Daniel Kim

Keep reading

RAG isn't a system, it's a 12-stage failure surface

An eval harness that survives contact with a real user base

Why your agent demo doesn't survive a real workflow