Glossary ›
Distillation.
training a smaller, cheaper model to mimic a larger one, capturing much of its ability at a fraction of the running cost
training a smaller, cheaper model to mimic a larger one, capturing much of its ability at a fraction of the running cost.
Updated 2026-06-03