Gentoo bans AI-created contributions
Gentoo bans AI-created contributions
Posted Apr 19, 2024 17:09 UTC (Fri) by ballombe (subscriber, #9523)In reply to: Gentoo bans AI-created contributions by kleptog
Parent article: Gentoo bans AI-created contributions
But this is not uniform compression. The most relevant parts are kept mostly verbatim, the least relevant part are ignored. The AI trick is that there is no easy way to find which parts are kept verbatim, and there is no easy way to find the source.
Posted Apr 19, 2024 17:28 UTC (Fri)
by mb (subscriber, #50428)
[Link] (4 responses)
This is not a technical problem at all.
But. Machine learning breaks Copyright in a fundamental way in that it is very similar to human learning so that one can use human learning reasoning, but at the same time it is fast and cheap.
While you need to put significant effort into your work when human-learning from others and creating new non-derived work, with MI this is just a click of a button.
A human "filter" processing ("learning") work set "A" into non-derived work "B" is expensive. So it's almost never done just for copying and Copyright erasing.
Copyright is fundamentally broken. It's not a technical problem.
Posted Apr 19, 2024 20:17 UTC (Fri)
by kleptog (subscriber, #1183)
[Link] (1 responses)
This is absurd. The value of a copyrighted work is not dependant on the amount of effort that went into it.
If anything LLMs are a great equaliser. It used to be that to be a great writer you needed to have a great idea for a story and the skill to execute it. Now people with a great idea but not quite as good writing skills get a chance they might not otherwise.
Copyright protects the economic value and moral value of a work. The fact that other people can now also create new works easier does not reduce the value of copyright at all (or break it). Copyright does not protect all uses of your work, only those that reduce the economic value of the original.
Posted Apr 19, 2024 20:49 UTC (Fri)
by mb (subscriber, #50428)
[Link]
I did never claim that.
Posted Apr 21, 2024 22:37 UTC (Sun)
by NYKevin (subscriber, #129325)
[Link] (1 responses)
This is prosaically true in the sense that it's ultimately a judgment call on the part of the trier of fact, but as a matter of law, there absolutely is an answer, and the law generally expects you to know it. You can wave your hands about copyright "breaking" all you want, but the legal system is not going to be impressed.
You are correct, however, that AI does put the legal system in a bit of an awkward spot. Up until now, derivative works have been decided by the trier of fact (judge or jury) looking at the original and the allegedly infringing work side by side, and seeing if they're close enough that copying can be inferred. The legal term used in the US is "substantially similar" (or "strikingly similar"), but most countries are going to use a similar method in their courts.
That works fine when you have one original. When you have two billion originals, and an unbounded set of potentially infringing works, it's a bit impractical. Right now, the unspoken expectation is that the plaintiff has to do the leg work of figuring out which images to put side by side in this comparison (US courts would say the plaintiff is "master of their complaint" and thus responsible for deciding exactly what is and is not in scope). That's not easy in the case of AI, and it's the main reason (or at least, a major reason) that artists have struggled to sue image generators successfully.
But that does not imply that an image generator "erases" copyright as you have put it. If an artist is able to find a specific infringing output that closely resembles their art, and their art was used as a training input, then the artist might have a case. Saying "the AI breaks copyright" is not going to be an effective defense.
Posted Apr 21, 2024 22:39 UTC (Sun)
by NYKevin (subscriber, #129325)
[Link]
Gentoo bans AI-created contributions
It's not relevant how you compressed the data to be a derived work.
And there is no single right or wrong answer to whether something is a derived works. Always been like that.
This is where Copyright breaks. It's hard for a human to create new non-derived work.
But it's cheap for machine learning to do the same thing.
A machine learning filter, however, is cheap and is easy to erase Copyright in that way.
Gentoo bans AI-created contributions
Gentoo bans AI-created contributions
Gentoo bans AI-created contributions
Gentoo bans AI-created contributions