More

__jl__ · 2025-12-17T17:47:56 1765993676

I will have to try that. Cursor bill got pretty high with Opus 4.5. Never considered opus before the 4.5 price drop but now it's hard to change... :)

diamondfist25 · 2025-12-17T18:40:04 1765996804

$100 Claude max is the best subscription I’ve ever had.

Well worth every penny now

vanviegen · 2025-12-18T08:29:24 1766046564

Or a $40 GitHub copilot plan also gets you a lot of Opus usage.

onoesworkacct · 2025-12-18T23:50:27 1766101827

Missing a lot without claude code tho

vanviegen · 2025-12-21T22:46:55 1766357215

I've tried both, and I'm still not sure. Claude Code steers more towards a hands-off, vibe coding approach, which I often regret later. With Copilot I'm more involved, which feels less 'magical' and takes me more time, but generally does not end in misery.

__jl__ · 2025-12-17T16:58:25 1765990705

This is awesome. No preview release either, which is great to production.

They are pushing the prices higher with each release though: API pricing is up to $0.5/M for input and $3/M for output

For comparison:

Gemini 3.0 Flash: $0.50/M for input and $3.00/M for output

Gemini 2.5 Flash: $0.30/M for input and $2.50/M for output

Gemini 2.0 Flash: $0.15/M for input and $0.60/M for output

Gemini 1.5 Flash: $0.075/M for input and $0.30/M for output (after price drop)

Gemini 3.0 Pro: $2.00/M for input and $12/M for output

Gemini 2.5 Pro: $1.25/M for input and $10/M for output

Gemini 1.5 Pro: $1.25/M for input and $5/M for output

I think image input pricing went up even more.

Correction: It is a preview model...

mips_avatar · 2025-12-17T17:16:02 1765991762

I'm more curious how Gemini 3 flash lite performs/is priced when it comes out. Because it may be that for most non coding tasks the distinction isn't between pro and flash but between flash and flash lite.

KoolKat23 · 2025-12-17T22:34:17 1766010857

Token usage also needs to be factored in specifically when thinking is enabled, these newer models find more difficult problems easier and use less tokens to solve.

srameshc · 2025-12-17T17:05:50 1765991150

Thanks that was a great breakup of cost. I just assumed before that it was the same pricing. The pricing probably comes from the confidence and the buzz around Gemini 3.0 as one of the best performing models. But competetion is hot in the area and it's not too far where we get similar performing models for cheaper price.

YetAnotherNick · 2025-12-17T18:18:16 1765995496

For comparison, GPT-5 mini is $0.25/M for input and $2.00/M for output, so double the price for input and 50% higher for output.

AuthError · 2025-12-17T18:21:33 1765995693

flash is closer to sonnet than gpt minis though

martythemaniak · 2025-12-17T19:02:52 1765998172

The price increase sucks, but you really do get a whole lot more. They also had the "Flash Lite" series, 2.5 Flash Lite is 0.10/M, hopefully we see something like 3.0 Flash Lite for .20-.25.

sunaookami · 2025-12-17T18:36:57 1765996617

This is a preview release.

reed1234 · 2025-12-18T02:29:08 1766024948

https://openrouter.ai/google/gemini-3-flash-preview

uluyol · 2025-12-17T17:22:30 1765992150

Are these the current prices or the prices at the time the models were released?

__jl__ · 2025-12-17T17:45:30 1765993530

Mostly at the time of release except for 1.5 Flash which got a price drop in Aug 2024.

Google has been discontinuing older models after several months of transition period so I would expect the same for the 2.5 models. But that process only starts when the release version of 3 models is out (pro and flash are in preview right now).

misiti3780 · 2025-12-17T18:44:11 1765997051

is there a website where i can compare openai, anthropic and gemini models on cost/token ?

jsnell · 2025-12-17T19:52:10 1766001130

There are plenty. But it's not the comparison you want to be making. There is too much variability between the number of tokens used for a single response, especially once reasoning models became a thing. And it gets even worse when you put the models into a variable length output loop.

You really need to look at the cost per task. artificialanalysis.ai has a good composite score, measures the cost of running all the benchmarks, and has 2d a intelligence vs. cost graph.

misiti3780 · 2025-12-17T21:13:27 1766006007

thanks

deaux · 2025-12-18T05:22:15 1766035335

For reference the above completely depends on what you're using them for. For many tasks, the number of tokens used is consistent within 10~20%.

deaux · 2025-12-18T05:22:58 1766035378

https://www.helicone.ai/llm-cost

Tried a lot of them and settled on this one, they update instantly on model release and having all models on one page is the best UX.

rrhartjr · 2025-12-18T03:07:32 1766027252

https://www.llm-prices.com/

int_19h · 2025-12-17T22:49:43 1766011783

https://openrouter.ai/models

__jl__ · 2025-11-18T15:30:38 1763479838

API pricing is up to $2/M for input and $12/M for output

For comparison: Gemini 2.5 Pro was $1.25/M for input and $10/M for output Gemini 1.5 Pro was $1.25/M for input and $5/M for output

raincole · 2025-11-18T15:54:43 1763481283

Still cheaper than Sonnet 4.5: $3/M for input and $15/M for output.

brianjking · 2025-11-18T15:57:03 1763481423

It is so impressive that Anthropic has been able to maintain this pricing still.

bottlepalm · 2025-11-18T18:08:46 1763489326

Claude is just so good. Every time I try moving to ChatGPT or Gemini, they end up making concerning decisions. Trust is earned, and Claude has earned a lot of trust from me.

Honestly Google models have this mix of smart/dumb that is scary. Like if the universe is turned into paperclips then it'll probably be Google model.

int_19h · 2025-11-18T23:07:36 1763507256

Well, it depends. Just recently I had Opus 4.1 spend 1.5 hours looking at 600+ sources while doing deep research, only to get back to me with a report consisting of a single sentence: "Full text as above - the comprehensive summary I wrote". Anthropic acknowledged that it was a problem on their side but refused to do anything to make it right, even though all I asked them to do was to adjust the counter so that this attempt doesn't count against their incredibly low limit.

epolanski · 2025-11-18T19:22:35 1763493755

Idk Anthropic has the least consistent models out there imho.

Aeolun · 2025-11-18T16:32:04 1763483524

Because every time I try to move away I realize there’s nothing equivalent to move to.

Alex-Programs · 2025-11-18T16:52:53 1763484773

People insist upon Codex, but it takes ages and has an absolutely hideous lack of taste.

sumedh · 2025-11-19T07:51:37 1763538697

It creates beautiful websites though.

andybak · 2025-11-18T17:25:33 1763486733

Taste in what?

js4ever · 2025-11-18T23:18:47 1763507927

Wines!

jhack · 2025-11-18T15:53:10 1763481190

With this kind of pricing I wonder if it'll be available in Gemini CLI for free or if it'll stay at 2.5.

xnx · 2025-11-18T16:53:03 1763484783

There's a waitlist for using Gemini 3 for Gemini CLI free users: https://docs.google.com/forms/d/e/1FAIpQLScQBMmnXxIYDnZhPtTP...

eevmanu · 2025-11-18T18:08:16 1763489296

In case anyone wants to confirm if this link is official, it is.

https://goo.gle/enable-preview-features

-> https://github.com/google-gemini/gemini-cli/blob/release/v0....

--> https://goo.gle/geminicli-waitlist-signup

---> https://docs.google.com/forms/d/e/1FAIpQLScQBMmnXxIYDnZhPtTP...

dktp · 2025-11-18T22:55:15 1763506515

It's interesting that grounding with search cost changed from

* 1,500 RPD (free), then $35 / 1,000 grounded prompts

to

* 1,500 RPD (free), then (Coming soon) $14 / 1,000 search queries

It looks like the pricing changed from per-prompt (previous models) to per-search (Gemini 3)

fosterfriends · 2025-11-18T16:36:28 1763483788

Thrilled to see the cost is competitive with Anthropic.

hirako2000 · 2025-11-18T15:37:47 1763480267

[flagged]

mupuff1234 · 2025-11-18T15:41:10 1763480470

I assume the model is just more expensive to run.

hirako2000 · 2025-11-18T16:05:13 1763481913

Likely. The point is we would never know.

__jl__ · 2025-11-18T15:30:03 1763479803

API pricing is up to $2/M for input and $12/M for output

For comparison: Gemini 2.5 Pro was $1.25/M for input and $10/M for output Gemini 1.5 Pro was $1.25/M for input and $5/M for output

__jl__ · 2025-11-18T14:05:52 1763474752

Same here. They have been aggressively increasing prices with each iteration (maybe because they started so low). Still hope that is not the case this time. GPT 5.1 is priced pretty aggressively so maybe that is an incentive to keep the current gemini API prices.

Deathmax · 2025-11-18T14:50:13 1763477413

Bad news then, they've bumped 3.0 Pro pricing to $2/$12 ($4/$18 at long context).

__jl__ · 2025-11-13T22:17:02 1763072222

The prompt caching change is awesome for any agent. Claude is far behind with increased costs for caching and manual caching checkpoints. Certainly depends on your application but prompt caching is also ignored in a lot of cost comparisons.

pants2 · 2025-11-13T22:31:30 1763073090

Though to be fair, thinking tokens are also ignored in a lot of cost comparisons and in my experience Claude generally uses fewer thinking tokens for the same intelligence

__jl__ · 2025-10-01T17:22:36 1759339356

Since we have cursor people joining, let me bring up my constant problems around applying code changes. For background, I mostly work with "chat":

1. The apply button does not appear. This used to be mostly a problem with Gemini 2.5 Pro and GPT-5 but now sometimes happens with all models. Very annoying because I have to apply manually

2. Cursor doesn't recognize which file to apply changes to and just uses the currently open file. Also very annoying and impossible to change the file to which I want to apply changes after they were applied to one file.

taylorfinley · 2025-10-01T19:20:15 1759346415

For both of these scenarios, it seems to happen when the context limit is getting full and the context is summarized. I've found it usually works to respond with the right file, i.e. "great, let's apply those changes in @path/to/file", but it may also be a good time to return to an earlier conversation point by editing one of your previous messages. You might edit the message that got you the response with changes not linked to a specific file, including the file path in that prompt will usually get you back on track.

__jl__ · 2025-06-17T18:42:32 1750185752

1.5 -> 2.0 was a price increase as well (double, I think, and something like 4x for image input)

Now 2.0 -> 2.5 is another hefty price increase.

__jl__ · 2025-06-05T10:51:59 1749120719

Same! :)

__jl__ · 2025-05-24T23:33:18 1748129598

Voyage models are great in my experience and I am planing to test 3.5. Almost more interested in 3.5-lite though. Great price.

My concern: voyage api has been unreliable. They were bought by mango db, which makes me a little uneasy.

Gemini embeddings look like a great model but it’s in preview and there haven’t been any updates for a while (including at io). Also not sure how committed Google is to embeddings models.