Hacker Newsnew | past | comments | ask | show | jobs | submit | rohitpaulk's commentslogin

Saw this on Twitter a few weeks ago, interesting approach.

The post links to the GitHub repo, but imo the website does a better job of explaining what it does: https://ataraxy-labs.github.io/weave/


Thanks for sharing the website as well here. Appreciate it!

Two questions I'd have expected the article to answer:

(a) how did they identify the employee, and (b) how come they weren't sent to jail


Surprised that none of the comments here are comparing this to Bootstrap.


Yeah reminds me of early Bootstrap


Wouldn't a comparison to Bulma be more apt?


This sounds like a good optimization task to give a long-running agent. Ask it to come up with experiments and maximize the % of successful edits.


In case anyone finds it useful, we (CodeCrafters) built a coding challenge as a companion to this book. The official repository for the book made this very easy to do since it has tests for each individual chapter.

Link: https://app.codecrafters.io/courses/interpreter/overview


Not sure why this ad (access needs paid membership) is the top comment


Wild how this beats 2.5 Pro in every single benchmark. Don't think this was true for Haiku 4.5 vs Sonnet 3.5.


Sonnet 3.5 might have been better than opus 3. That's my recollection anyhow


For those who haven't seen the actual issue that links to, I'd take a closer look. Pretty insane: https://github.com/actions/runner/issues/3792#issuecomment-3...


Yeah, calling the authors of this code losers and monkeys is being kind. There is zero excuse for ever writing code like this, the incompetence is staggering.


Always felt dependency updates are a perfect fit for AI agents:

(a) they’re broadly similar across companies,

(b) they aren’t time-sensitive, so the agent can take hours without anyone noticing, and

(c) customers are already accustomed to using bots here, just bad ones


One would imagine they are broadly similar; but that's off the assumption that codebases are similar as well.

Migrations between versions can have big variance largely as a function of the parent codebase and not the dependency change. A simple example of this would be a supported node version bump. It's common to lose support for older node runtimes with new dependency versions, but migrating the parent codebase may require large custom efforts like changing module systems.


I noticed this with Harvey / Legal.


Where do I buy one?



Love to know as well!


Link added above :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: