More

adastra22 · 2025-12-09T17:15:49 1765300549

That’s the joke…

lagniappe · 2025-12-09T17:59:10 1765303150

Really? What's the punchline? I like jokes.

adastra22 · 2025-12-09T17:12:48 1765300368

There is no technical difference.

adastra22 · 2025-12-08T23:26:55 1765236415

Mount the parent read-only.

adastra22 · 2025-12-08T23:25:07 1765236307

Frustratingly, it isn’t an original idea either, and there were better implementations even at the time.

adastra22 · 2025-12-08T23:15:42 1765235742

Is there a jj solution for git LFS?

steveklabnik · 2025-12-09T15:54:42 1765295682

There’s a PR open with basic support but hasn’t been merged yet.

Large files in general is something the project would like to solve but it’s gonna take time.

adastra22 · 2025-12-08T23:05:40 1765235140

It usually doesn’t introduce grammatical mistakes at the same time though.

adastra22 · 2025-12-08T23:01:14 1765234874

These aren’t companies doing the patient storage. It is non profits setup and run by people who are signed up themselves.

FridayoLeary · 2025-12-08T23:05:48 1765235148

It's basically the same thing. What's stopping them from eventually losing interest or the non profit getting hijacked. I personally think the whole thing is a huge waste of money and i can imagine some guys in 50 years will think so too.

adastra22 · 2025-12-09T01:16:37 1765242997

That they and the people they care about are stored there. This is a far stronger incentive than any profit motive.

michaelt · 2025-12-09T00:47:47 1765241267

Wasn't OpenAI a non-profit?

adastra22 · 2025-12-09T02:45:22 1765248322

The second part of that sentence was the important bit.

IAmBroom · 2025-12-09T14:29:27 1765290567

And since all things under that umbrella are exactly the same...

adastra22 · 2025-12-08T04:36:26 1765168586

The timelines are increasing powers of 2. It’ll take much longer to colonize all asteroids than to settle Mars.

adastra22 · 2025-12-07T09:30:05 1765099805

Yes, training is considered fair use, and output is non-copyrightable / public domain. With many asterix and footnotes, of course.

Madmallard · 2025-12-07T09:35:07 1765100107

Don't see how output being public domain makes sense when they could be outputting copyrighted code.

Shouldn't the right's extend forward and simply require the LLM code to be deleted?

adastra22 · 2025-12-07T09:54:06 1765101246

With many asterix and footnotes. One of which being that if it literally output the exact code, of course that would be copyright infringement. Something that greatly resembled but with minor changes would be a gray area.

Those kinds of cases, although they do happen, are exceptional. In a typical output that doesn't not line-for-line resemble a single training input, it is considered a new, but non-copyrightable work.

vegardx · 2025-12-07T17:46:17 1765129577

(I'm not a lawyer)

You should be careful about speaking in absolute terms when talking about copyright.

There is nothing that prevents multiple people from owning copyright to identical works. This is also why copyright infringement is such a mess to litigate.

I'd also be interested in knowing why you think code generated by LLMs can't be copyrighted. That's quite a statement.

There's also the problem with copyright law and different jurisdictions.

adastra22 · 2025-12-08T04:10:25 1765167025

It is the official stance of the US copyright office.

It was upheld by Thaler v. Perlmutter.

Bartz v. Anthropic and Kadrey v. Meta confirmed with similar rulings.

menaerus · 2025-12-07T09:50:38 1765101038

First, you have to prove it that it produced the copyrighted code. The question is what copyrighted code is in the first place? Literal copy-paste from source is easy but I think 99% of the time this isn't the case.

adastra22 · 2025-12-06T04:33:29 1764995609

This is a great article. Anyone know more content like this?