It's basically continual learning. This is beyond a hard problem it's currently ...

foobar10000 · 2026-02-06T21:13:25 1770412405

So, surprising, that is not completely true - I know of 2 finance HFT trading firms that do CL at scale, and it works - but in a relatively narrow context of predicting profitable actions. It is still very surprising it works, and the compute is impressively large to do it - but it does work. I do have some hope of it translating to the wider energy landscapers we want AI to work over…

xnxnxkx · 2026-02-07T09:13:21 1770455601

no my nigga, they CLAIM it works

foobar10000 · 2026-02-07T21:22:55 1770499375

Nah, it works - let's just call it personal experience.

johnsmith1840 · 2026-02-06T22:18:01 1770416281

During covid almost every prediction model like that exploded, everything went out of distribution really fast. In your sense we've been doing "CL" for a decade or more. It can also be cheap if you use smaller models.

But true CL is the ability to learn out of distribution information on the fly.

The only true solution I know to continual learning is to completely retrain the model from scratch with every new example you encounter. That technically is achievable now but it also is effectively useless.

foobar10000 · 2026-02-07T21:24:19 1770499459

Yes and no - the ones that exploded - and there were many - got shut down by the orchestrator model, and within 2 weeks it was now a new ensemble of winners - with some overlap to prior winners. To your point, it did in fact take 2-3 weeks - so one could claim this is retraining...

logicchains · 2026-02-07T08:01:04 1770451264

Schmidhuber solved it at a small scale: https://arxiv.org/abs/2202.05780 .

snovv_crash · 2026-02-06T23:18:51 1770419931

For neural networks, yeah continuous learning is basically dead.

But for other ML approaches, it works really well. KNN is one example that works particularly well.

Legend2440 · 2026-02-06T23:27:44 1770420464

Ehhh KNN doesn’t have a training phase, so it’s really more that the concept of continual learning doesn’t apply. You have to store your entire dataset and recalculate everything from scratch every time anyway.

snovv_crash · 2026-02-07T18:44:11 1770489851

Yes, that's basically the point. You get 'free' continuous learning just by throwing the new data into the pool. Needing an explicit training step is a weakness that makes CL hard to make work for many other approaches.

For any practical application KNN will need some kind of accelerated search structure (eg Kd-tree for < ~7 dimensions) which then requires support for dynamic insertions. But this is an engineering problem, not a data science problem, it works and is practical. For example this has been used by the top systems in Robocode for 15+ years at this point, it's just academia that doesn't find this approach novel enough to bother pursuing.

Legend2440 · 2026-02-07T19:07:38 1770491258

>Needing an explicit training step is a weakness that makes CL hard to make work for many other approaches.

On the other hand, not having an explicit training step is a huge weakness of KNN.

Training-based methods scale better because the storage and runtime requirements are independent of dataset size. You can compress 100TB of training data down into a 70GB LLM.

A KNN on the same data would require keeping around the full 100TB, and it would be intractably slow.

snovv_crash · 2026-02-08T12:22:04 1770553324

Feature engineering is a thing, you don't need the full data source for KNN to do the search in. It is already used extensively in RAG type lookup systems, for example.

vjerancrnjak · 2026-02-06T22:57:52 1770418672

Bandits?

Spaced repetition algos