I am by no means an expert, but I think it is a process that allows training LLM...

tensor · 2025-12-25T05:25:51 1766640351

No, distillation is far older than deepseek. Deepseek was impressive because of algorithmic improvements that allowed them to train a model of that size with vastly less compute than anyone expected, even using distillation.

I also haven’t seen any hard data on how much they do use distillation like techniques. They for sure used a bunch of synthetic generated data to get better at reasoning, something that is now commonplace.

MobiusHorizons · 2025-12-25T08:00:14 1766649614

Thanks it seems I conflated.

tickerticker · 2025-12-25T04:56:55 1766638615

Yes. They bounced millions of queries off of ChatGPT to teach/form/train their DeepSeek model. This bot-like querying was the "distillation."

orbital-decay · 2025-12-25T07:41:24 1766648484

They definitely didn't. They demonstrated their stuff long before OAI and the models were nothing like each other.

SirMaster · 2025-12-25T06:16:51 1766643411

Why would OpenAI allow someone to do that?

qcnguy · 2025-12-26T19:37:02 1766777822

They don't anymore. They introduced ID verification shortly after, but it's hard to stop completely while also scaling fast.

MadnessASAP · 2025-12-25T07:15:04 1766646904

They didn't, but how do you stop it? Presuming the scale that OpenAI is running at?