Ask HN: Is GPT-4o overall worse than GPT-4 for you?

aurareturn · on May 22, 2024

Small anecdote but I asked an SQL query and GPT4o got it wrong. It didn't work when I ran it. Pasted the same question into GPT4 and it got it right.

roncesvalles · on May 30, 2024

Maybe it's a placebo but I switched back to GPT-4. Something about GPT-4o's responses, it's too verbose and rambles on about generalities instead of capturing the nuance of the topic of question, which is what I really want to know. Almost like 3.5 in that regard.

serulin · on May 29, 2024

4o is completely trash, it literally won't listen, cant reason, talks non stop like an idiot gushing you with usless info you didnt ask for. Its like that one kid that over explains and thinks hes smart. Its not even that much faster, quality sacrifice is not usable.

joeythedolphin · on May 29, 2024

It's nearly useless for me. GPT-4 used to be so good, 3.5 even, but I believe they have nerfed processing power per request, and the OpenAI stack is virtually useless to me, I tend to rely on Claude.

meiraleal · on May 22, 2024

That's not my experience at all.

I was using Claude Opus to code before and now I'm back to ChatGPT. GPT-4o is faster, doesn't generate placeholders and works way better for me because of the larger context.

speedgoose · on May 22, 2024

I find it better overall, it's my default.

It's also what people think in blind tests: https://arena.lmsys.org

nullbio · on May 22, 2024

Take those results with a large grain of salt. It's dead simple to figure out which response is the new model purely by the speed at which the result is returned, and as such, this is incredibly easy to bot in an effort to manipulate the rankings.

They should really be streaming the content at the same time, based on the slowest responder.

speedgoose · on May 22, 2024

Who would manipulate the rankings to make OpenAI look better? Are you sure you can tell the difference between a small llama3 8b or a fast gemini?

btbuildem · on May 28, 2024

Came here from a google search with a similar sentiment..

I'm getting the strong impression that 4o is significantly weaker than 4, at least for dealing with coding snippets.

runjake · on May 22, 2024

I find 4o better than 4 in my experiences. Mostly doing code generation/correction in Python/JS, and asking science, business, finance, management, and other non-creative questions.

nullbio · on May 22, 2024

I've found it is a lot worse in general. I use GPT-4 90% of the time now, and 4o when I need something answered quickly that has a very simple answer.

atleastoptimal · on May 24, 2024

Half say 4o is better, others say it's worse. I'd wager it's probably about as good as 4T on average then.

EISENFELD · on May 22, 2024

In my experience GPT-4o is better in coding. I tested it with old C and Go. Both gave me better results.

Turboblack · on May 22, 2024

maybe someone else is delighted with this, but for me it’s all still the stone age

lmiller1990 · on May 22, 2024

I noticed this too, I find 4.0 is still better for giving me what I ask for.

EchoStar27 · on May 23, 2024

No issues so far. Quality seems to be similar as 4 but way faster

tikkun · on May 24, 2024

4o feels worse than 4 for me.

ciprianx · on May 24, 2024

It's worse.

muzani · on May 22, 2024

Yeah, it's much worse for me, worse than 3.5 even. Almost at the level of GPT-3 curie at worst.

I suspect it could be related to whatever it's using as language detection, because many others don't experience this. It glitches hard on language, often responding in the wrong one.

wruza · on May 22, 2024

Sorry for a tangent, but also is gpt-4 better for you than 8x7b?

When I return to 8x7b from gpt-4 it feels like I just shook off an unbearably boring guy and met a normal one, both very similar in knowledge (and unable to perform complex tasks).

kelsier1 · on May 22, 2024

Their claim hasn't been that 4o is better than 4. Just that it's faster and cheaper. So it's better than 3.5-turbo but not as good as 4, atleast from the examples I've tried out for summarization, code gen etc.

mnk47 · on May 22, 2024

No, their site literally says "our most powerful model" as the description for GPT4o, and it scores slightly higher than GPT-4 in their benchmarks: https://openai.com/index/hello-gpt-4o/

alecsm · on May 22, 2024

Why would they give free access to GPT4 when it's more expensive than 4o?