The Internet Is About to Get Much Worse

Jure Repinc · 1 year ago

The Internet Is About to Get Much Worse

FaceDeer · 1 year ago

The code that AI produces isn’t “copied” from those original authors, though. The AI learned how to code from them, it isn’t literally copying and pasting from them.

If you think a bit of code is “really from” XYZ open-source project, that’s a copyright violation and you can pursue that legally. But you’ll need to actually show that the code is a copy.

@NeoNachtwaechter@lemmy.world · 1 year ago

The copyright violation has happened when the code got fed into that AI’s greedy gullet, not when it came out of it’s rear end.

FaceDeer · 1 year ago

That remains to be tested legally speaking, and I don’t think it’s likely to pass muster. If it was trained correctly (ie, no overfitting) the resulting AI model does not contain a copy of the training inputs in any identifiable sense.

@NeoNachtwaechter@lemmy.world · 1 year ago

Yes, the laws are probably muddy in Usa as usual, but rather clear here in the EU. But legal proceedings are slow, and Big Tech is making haste with their feeding.

FaceDeer · 1 year ago

There are many jurisdictions beyond the US and EU, Japan in particular has been very vocal about going all-in on allowing AI training. And I wouldn’t say the EU’s laws are “clear” until they are actually tested.

@kibiz0r@midwest.social · 1 year ago

Your justification seems to rest on whether LLM training technically passes the legal standard of violating IP.

That’s not a super compelling argument to me, because:

Nobody designed current IP law with LLMs in mind
I would wager that a vast majority of creators whose works were consumed by LLMs did not consider whether their license would permit such an act, and thus didn’t meaningfully consent to have their work used this way (whether or not the law would agree)
I would argue that IP law is heavily stacked in favor of platforms (who own IP, but do not create it) and against creators (who create, but do not own IP) and consumers

I don’t think that there is fundamentally anything wrong with LLMs as a technology. My problem is that the economic incentives are misaligned with long-term stability of the creative pools that fuel these things in the first place.