How to check copyright?
How to check copyright?
Posted Oct 2, 2025 17:16 UTC (Thu) by Wol (subscriber, #4433)In reply to: How to check copyright? by farnz
Parent article: Fedora floats AI-assisted contributions policy
It also "conveniently forgets" that any developer worth their salt is exposed to a lot of code for which they do not hold the copyright, and may not even be aware of the fact that they are recalling verbatim chunks of code they memorised at Uni / another place of work / a friend showed it to them.
So all this complaining about AI-generated code could also be applied pretty much the same to developer-generated code, it's just that we don't think it's a problem if it's a developer, some people think it is if it's an AI.
Personally, I'd be quite to happy to ingest AI-generated code into my brain, and then regurgitate the gist of it (suitably modified for corporate guidelines/whatever). By the time you've managed to explain in excruciating detail to the AI what you want, it's probably better to give it a simple explanation and rewrite the result.
Okay, that end result may not be "clean room" copyright compliant, but given the propensity for developers to remember code fragments, I expect very little code is.
We have a problem with musicians suing each other for copying fragments of songs (which the "copier" was probably unaware of - which the copyright *holder* probably copied as well without being aware of it!!!), how can we keep that out of computer programming? We can't, and that's assuming AI had no hand in it!
Cheers,
Wol
Posted Oct 3, 2025 13:20 UTC (Fri)
by alex (subscriber, #1355)
[Link]
Patents where a separate legal rabbit hole.
Posted Oct 3, 2025 15:12 UTC (Fri)
by stefanha (subscriber, #55072)
[Link] (5 responses)
In the original comment I linked to a paper about extracting copyrighted content from LLMs. A web search brings up a bunch more in this field that I haven't read. Here is one explicitly about generated code (https://arxiv.org/html/2408.02487v3) that says "we evaluate 14 popular LLMs, finding that even top-performing LLMs produce a non-negligible proportion (0.88% to 2.01%) of code strikingly similar to existing open-source implementations".
I think AI policies are getting ahead of themselves when they assume that a contributor can vouch for license compliance. There needs to be some kind of lawyer-approved solution to this so that the open source community is protected from a copyright mess.
Posted Oct 3, 2025 15:25 UTC (Fri)
by farnz (subscriber, #17727)
[Link] (4 responses)
We know that humans accidentally and unknowingly infringe, too. Why can't we reuse the existing lawyer-approved solution to that problem for LLM output?
Posted Oct 3, 2025 16:47 UTC (Fri)
by Wol (subscriber, #4433)
[Link] (3 responses)
If I had an LLM and found myself sued like that, I'd certainly want to drag the querier into it ...
Cheers,
Posted Oct 6, 2025 14:24 UTC (Mon)
by stefanha (subscriber, #55072)
[Link] (2 responses)
Hence why contributors need a way to check copyright compliance.
Posted Oct 6, 2025 14:29 UTC (Mon)
by farnz (subscriber, #17727)
[Link]
Posted Oct 6, 2025 14:36 UTC (Mon)
by pizza (subscriber, #46)
[Link]
This is a legal problem, and cannot be solved via (purely, or even mostly) technical means.
How to check copyright?
How to check copyright?
There's a critical piece of data missing - what proportion of human-written code is strikingly similar to existing open-source implementations?
How to check copyright?
How to check copyright?
Wol
How to check copyright?
TBF, you also need such a mechanism to check copyright compliance of any code you've written yourself - you are also quite capable of accidental infringement (where having seen a particular way to write code before, you copy it unintentionally), and to defend yourself or the project you contribute to, you have to prove either that you never saw the original code that you're alleged to have copied (the clean room route) or that this code is on the "idea" side of the idea-expression distinction (however that's expressed in local law).
How to check copyright?
How to check copyright?
