No disclosure for LLM-generated patch?
No disclosure for LLM-generated patch?
Posted Jun 27, 2025 9:11 UTC (Fri) by Funcan (subscriber, #44209)In reply to: No disclosure for LLM-generated patch? by drago01
Parent article: Supporting kernel development with large language models
I vaguely remember some llm providers include legal wavers for copyright where they take on the liability, but I can't find one for e.g. copilot right now
Posted Jun 27, 2025 10:51 UTC (Fri)
by mb (subscriber, #50428)
[Link] (3 responses)
If you as a human learn from proprietary code and then write Open Source with that knowledge, it's not copying unless you actually copy code sections. Same goes for LLMs. If it produces a copy, then it copied. Otherwise it didn't.
Posted Jun 27, 2025 11:47 UTC (Fri)
by laarmen (subscriber, #63948)
[Link] (1 responses)
Posted Jun 27, 2025 12:57 UTC (Fri)
by mb (subscriber, #50428)
[Link]
It's in no way required to avoid copyright problems.
And you can also use that concept with LLMs, if you want.
Posted Jul 1, 2025 9:51 UTC (Tue)
by cyphar (subscriber, #110703)
[Link]
You could just as easily argue that LLMs produce something equivalent to a generative collage of all of their training data, which (given the current case law on programs and copyright) would mean that the copyright status of the training data would be transferred to the collage. You would thus need to make an argument for a fair use exemption for the output, which your example would not pass muster.
However, this is not the only issue at play here -- to submit code to Linux you need to sign the DCO, which the commit author did with their Signed-off-by line. However, none of the sections of the DCO can be applied to LLM-produced code, and so the Signed-off-by is invalid regardless of the legal questions about copyright and LLM code.
Posted Jun 27, 2025 16:57 UTC (Fri)
by geofft (subscriber, #59789)
[Link]
https://blogs.microsoft.com/on-the-issues/2023/09/07/copi...
"Specifically, if a third party sues a commercial customer for copyright infringement for using Microsoft’s Copilots or the output they generate, we will defend the customer and pay the amount of any adverse judgments or settlements that result from the lawsuit, as long as the customer used the guardrails and content filters we have built into our products."
See also https://learn.microsoft.com/en-us/legal/cognitive-service... . The exact legal text seems to be the "Customer Copyright Commitment" section of https://www.microsoft.com/licensing/terms/product/ForOnli...
No disclosure for LLM-generated patch?
No disclosure for LLM-generated patch?
No disclosure for LLM-generated patch?
Just don't copy and then you are safe.
Learning is not copying.
Just feed the output from one LLM into the input of another LLM and you basically get the same thing as with two human clean-room teams.
No disclosure for LLM-generated patch?
No disclosure for LLM-generated patch?