More

nickandbro · 2026-02-19T16:43:49 1771519429

Does well on SVGs outside of "pelican riding on a bicycle" test. Like this prompt:

"create a svg of a unicorn playing xbox"

https://www.svgviewer.dev/s/NeKACuHj

Still some tweaks to the final result, but I am guessing with the ARC-AGI benchmark jumping so much, the model's visual abilities are allowing it to do this well.

ertgbnm · 2026-02-19T21:30:27 1771536627

Animated SVGs are one of the example in the press release. Which is fine, I just think the weird SVG benchmark is now dead. Gemini has beat the benchmark and now differences are just coming down to taste.

I don't know if it got these abilities through generalization or if google gave it a dedicated animated SVG RL suite that got it to improve so much between models.

Regardless we need a new vibe check benchmark ala bicycle pelican.

wolttam · 2026-02-20T00:02:03 1771545723

What benchmark, though? There is very clearly a lot of room for improvement in its SVG making capabilities. The fact that it can now, finally, make a pelican on a bike that isn’t completely wrong is not an indicator that SVG generation is now a solved problem.

simonw · 2026-02-19T16:44:34 1771519474

Interesting how it went a bit more 3D with the style of that one compared to the pelican I got.

pugio · 2026-02-19T22:21:08 1771539668

Unfortunately it still fails my personal SVG benchmark (educational 2d cross section of the human heart), even after multiple iterations and screenshots feedback. Oh well, back to the (human) drawing board.

andy12_ · 2026-02-19T16:48:51 1771519731

I'm thinking now that as models get better and better at generating SVGs, there could be a point where we can use them to just make arbitrary UIs and interactive media with raw SVGs in realtime (like flash games).

rafark · 2026-02-19T19:36:27 1771529787

> there could be a point where we can use them to just make arbitrary UIs and interactive media with raw SVGs

So render ui elements using xml-like code in a web browser? You’re not going to believe me when I tell you this…

nickandbro · 2026-02-19T16:54:28 1771520068

Or quite literally a game where SVG assets are generated on the fly using this model

kridsdale3 · 2026-02-19T18:41:11 1771526471

Thats one dimension before another long term milestone: Realtime generation of 3D mesh content during gameplay.

Which is the "left brain" approach vs the "right brain" approach of coming at dynamic videogames from the diffusion model direction which the Gemini Genie thing seems to be about.

roryirvine · 2026-02-19T17:41:00 1771522860

On the other hand, creation of other vector image formats (eg. "create a postscript file showing a walrus brushing its teeth") hasn't improved nearly so much.

Perhaps they're deliberately optimising for SVG generation.

mclau153 · 2026-02-19T19:33:39 1771529619

can we move on from SVG to 3D models at some point?

knicholes · 2026-02-19T22:14:08 1771539248

Image to model is already a thing, and it's pretty good.

nickandbro · 2026-02-09T01:27:18 1770600438

Currently working on:

https://vimgolf.ai

To show newbies how to use vim. Currently its not complete and has major issues. So if you want to try give it a go, but please hold your judgement as not all shortcuts have been added.

nickandbro · 2026-02-06T16:19:02 1770394742

I have found GPT 5.3-Codex to do exceedingly well when working with graphics rendering pipelines. They must have better training data or RL approaches than Antropic as I have given the same prompt and config to Opus 4.6 and it seems to have added unwanted rendering artifacts. This may be just an issue specific to my use case, but wonder since OpenAI is partners with MSFT, which makes lots of games, that this may be an area they heavily invested in

nickandbro · 2026-02-06T03:46:21 1770349581

That's insane. Deflock is a map of Flock cameras.

Definition of terrorism is:

the unlawful use of violence and intimidation, especially against civilians, in the pursuit of political aims.

couldn't be further apart from that.

oofbey · 2026-02-06T04:45:56 1770353156

While I think the use of the term “terrorist” is unwarranted, I do think deflock is seeking political change. The decision to use flock is a government policy choice, right?

vanc_cefepime · 2026-02-06T06:20:10 1770358810

Just the people’s choice, right? They voted for this government policy, right???!? https://www.coloradopolitics.com/2025/10/22/denver-mayor-ext...

>> “I was stunned to learn late yesterday that after convening a task force of local and national experts, Mayor Johnston has been negotiating secretly with the discredited CEO of Flock Safety and signing another unilateral extension of this mass surveillance contract with no public process and no vote from the City Council or input from his own task force,” Councilmember Sarah Parady told The Denver Gazette.

muwtyhg · 2026-02-06T23:15:12 1770419712

What is the point of this comment? Are you saying that deflock are not terrorists but are terrorist adjacent? Why respond to someone defining terrorism by pointing out that 2 words at the end of the definition also apply to deflock? Do those not apply to basically everyone who participates in their country's society, including literally everyone who votes and all politicians?

AngryData · 2026-02-06T08:26:15 1770366375

Political parties seek political change too, but that doesn't make them terrorists. Deflock isn't trying to intimidate or cause violence to citizens.

btown · 2026-02-06T04:18:22 1770351502

If corporations can be people, cameras can be people too! Think of the cameras! /s

DerArzt · 2026-02-06T14:39:33 1770388773

Please don't give them ideas like that, even in jest.

nickandbro · 2026-01-30T20:28:47 1769804927

I am very curious if this app is making money or are users just using the two generators and then leaving? If so I am very impressed with your wrapper around the image gen models.

londons_explore · 2026-01-30T20:35:55 1769805355

I can imagine the reverse model could be very profitable with every real estate agent using it to make dreary photos look great.

joshuaissac · 2026-01-30T20:46:50 1769806010

Reverse model aimed at estate agents already posted in this thread by someone: https://news.ycombinator.com/item?id=46829566

luckydata · 2026-01-30T20:35:00 1769805300

this landing page is a lead gen tool for the architect at the bottom

nickandbro · 2026-01-30T20:40:38 1769805638

Ahh, I see that. Thanks

nickandbro · 2026-01-29T17:35:23 1769708123

This could be the future of film. Instead of prompting where you don't know what the model will produce, you could use fine-grained motion controls to get the shot you are looking for. If you want to adjust the shot after, you could just checkpoint the model there, by taking a screenshot, and rerun. Crazy.

JKCalhoun · 2026-01-29T17:41:20 1769708480

I feel like people are already currently doing this. Essentially storyboarding first.

This guy a month ago for example: https://youtu.be/SGJC4Hnz3m0

nickandbro · 2026-01-27T20:13:53 1769544833

Great work! Really respect AI2. they open source everything. The model, the weights, the training pipeline, inference stack, and corpus

nickandbro · 2026-01-24T02:33:14 1769221994

Interesting, I use cloudflare containers and it takes roughly 6-7 seconds to boot up using a very lightweight image.

nickandbro · 2026-01-22T20:02:37 1769112157

Maybe show how it works instead of making the home page a login screen.

nickandbro · 2026-01-18T20:26:26 1768767986

I wonder if some of the docs from https://app.wafer.ai/docs could be used to make the model be better at writing GGML kernels. Interesting use case.