It takes a few hours to compute the imatrix on some calibration dataset since we...

jychang · 2025-07-23T10:47:24 1753267644

What cluster do you have to do the quantizing? I'm guessing you're not using a single machine with a 3090 in your garage.

danielhanchen · 2025-07-23T13:28:21 1753277301

Oh definitely not! I use some spot cloud instances!

sleight42 · 2025-07-23T21:02:22 1753304542

But you can get one of these quantized models to run effectively on a 3090?

If so, I'd love detailed instructions.

The guide you posted earlier goes over my (and likely many others') head!

danielhanchen · 2025-07-23T23:27:43 1753313263

Oh yes definitely! Oh wait is the guide too long / wordy? This section https://docs.unsloth.ai/basics/qwen3-coder-how-to-run-locall... shows how to run it on a 3090

sleight42 · 2025-07-24T22:06:51 1753394811

Kind of you to respond! Thanks!

I have pretty bad ADHD. And I've only run locally using kobold; dilettante at DIY AI.

So, yeah, I'm a bit lost in it.

danielhanchen · 2025-07-24T22:45:10 1753397110

Oh sorry - for Kobold - I think it uses llama.cpp behind the hood? I think Kobold has some guides on using custom GGUFs