LWN: Comments on "DeVault: GitHub Copilot and open source laundering" https://lwn.net/Articles/898772/ This is a special feed containing comments posted to the individual LWN article titled "DeVault: GitHub Copilot and open source laundering". en-us Tue, 28 Oct 2025 23:20:49 +0000 Tue, 28 Oct 2025 23:20:49 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net Not trained on MS closed source https://lwn.net/Articles/914673/ https://lwn.net/Articles/914673/ glenn <div class="FormattedComment"> It took some time, but there's this one: <a href="https://lwn.net/Articles/914150">https://lwn.net/Articles/914150</a><br> <p> It's not unthinkable that users of the tool who for commercial purposes could also be sued.<br> </div> Fri, 11 Nov 2022 22:16:17 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/906482/ https://lwn.net/Articles/906482/ mathstuf <div class="FormattedComment"> IIRC, search was hampered with rate limits and &quot;must be logged in to search repo contents&quot; since before the acquisition.<br> </div> Tue, 30 Aug 2022 15:22:18 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/906418/ https://lwn.net/Articles/906418/ scientes <div class="FormattedComment"> I think there is too much attention to this feature rather the freedom-stealing aspects of github.<br> <p> What I noticed is that github has serious rate-limiting on their search functionality, but google&#x27;s codesearch project (which was a nerdy project and the RE2 regex library was built for it) embraced these type of searches. It looks like Microsoft is hostile to technical proficiency and just wants control and classical Microsoft things.<br> </div> Tue, 30 Aug 2022 13:27:03 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899583/ https://lwn.net/Articles/899583/ nim-nim <div class="FormattedComment"> It would be very hard to argue in court that code produced by an AI trained on an existing code base is anything else than a derivation of this code base, since there is no human inventivity involved to muddy up waters.<br> <p> Despite years of commercial pretense to the reverse professionals know “smart” systems are no smarter than the human who coded them.<br> </div> Fri, 01 Jul 2022 08:32:20 +0000 Not trained on MS closed source https://lwn.net/Articles/899488/ https://lwn.net/Articles/899488/ nim-nim <div class="FormattedComment"> FOSS is not the same thing as public domain, it is all copyrighted<br> </div> Thu, 30 Jun 2022 13:45:46 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899342/ https://lwn.net/Articles/899342/ farnz <p>It's also worth noting in this context that part of the reason to have vendor scripts of some form (be they AtomBIOS, ACPI or others) is that power delivery changes can't happen instantly - and different board manufacturers will have done different transient analysis to determine what their hardware can reliably support. The results of that analysis (whether it's a "rule of thumb" assessment or a proper calculation) need communicating to the driver somehow - and a small scripting language is as good as any other way to deal with it, especially since the edge cases get complex if you're doing a per-board calculation based on measuring the final system during post-manufacture testing of a board. <p>That said, given the quality of some vendor code, I understand Luc's reluctance to trust it - I've encountered one vendor who asserted that the <em>CPU</em> would detect an <tt>OUT 0xCF8, EAX</tt> instruction in userspace and then ensure that nothing else accessed PCI configuration space until the same userspace process executed either <tt>OUT 0xCFC, EAX</tt> or <tt>IN EAX, 0xCFC</tt> later, on the basis that if the CPU didn't do that, it would be possible for their userspace driver to crash. I'm not even sure how this could work under Linux… Wed, 29 Jun 2022 11:31:39 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899338/ https://lwn.net/Articles/899338/ mjg59 <div class="FormattedComment"> The easiest way to think about Atom is like ACPI - it&#x27;s a lump of bytecode that&#x27;s interpreted in kernel context, and which talks to hardware the kernel could already talk to. We could rip out the ACPI interpreter in the kernel and hardcode knowledge of every singly ACPI board instead, and we&#x27;d have successfully replaced non-free code with free code. But we&#x27;d also have a kernel that didn&#x27;t boot on a bunch of new systems until people had reverse engineered the relevant ACPI code and reimplemented it, and so nobody has seriously suggested doing that.<br> <p> There&#x27;s tradeoffs. I&#x27;d love to avoid having to rely on non-free code to make hardware work, and I&#x27;m not going to criticise people for writing drivers that do that. But the reality is that any such driver is going to work less well than a driver that uses the defined interface to call non-free firmware (in much the same way that we call into non-free UEFI code to set boot variables these days), and so there&#x27;s value in the driver that calls non-free code existing, and also it&#x27;s unsurprising that distros would pick the one that works with more hardware.<br> <p> Luc&#x27;s priorities on -radeonhd were probably based on his experience with the VIA chipsets that were extremely limited by what the BIOS permitted (and yeah it turns out that not being able to set any modes other than those that are hardcoded in the BIOS is not good!), but the outcome was also that his driver for those chipsets simply didn&#x27;t work on all hardware - I had a VIA-based laptop that would just give a black screen with his driver, because the BIOS didn&#x27;t match his expectations. To be completely fair, on Radeon I hit some similar constraints when I was researching reclocking the RAM for power management - the Atom scripts simply took too long, so I took out a bunch of the wait statements and hardcoded those into the kernel and it was great, and then after a couple of days of uptime the card would wedge during a reclock and also it didn&#x27;t work on all hardware, so it turns out there was a reason that those were there in the first place. So I absolutely understand the desire to have native code for all of this, but also in the absence of vendors providing explicit contracts about hardware behaviour, a driver that doesn&#x27;t use the defined interfaces is inherently going to break things.<br> </div> Wed, 29 Jun 2022 10:31:06 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899337/ https://lwn.net/Articles/899337/ flussence <div class="FormattedComment"> That first paragraph&#x27;s a letdown, but...<br> <p> I learned something today. Thanks for explaining all that to my dumb ass.<br> <p> This is the kind of thing that makes me keep my subscription to the site going.<br> </div> Wed, 29 Jun 2022 10:12:22 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899215/ https://lwn.net/Articles/899215/ Wol <div class="FormattedComment"> (And to gspr)<br> <p> Oh the joys of the ambiguity of English ...<br> <p> As I read it &quot;copyright attaches to the model&quot; - in other words there is no copyright *in* the model. But if there is copyright in the *original*, then that applies *to* the model as well ...<br> <p> I think we&#x27;re talking at cross purposes ... :-)<br> <p> Cheers,<br> Wol<br> </div> Tue, 28 Jun 2022 08:00:05 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899213/ https://lwn.net/Articles/899213/ gspr <div class="FormattedComment"> By this logic, unlicensed redistribution of copyrighted images, video and audio are fine as long as the material is recompressed. It&#x27;s just math!<br> <p> Clearly absurd.<br> </div> Tue, 28 Jun 2022 06:37:56 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899210/ https://lwn.net/Articles/899210/ hummassa <div class="FormattedComment"> I will offer an informed alternative to your pov, Wol:<br> <p> If a mathematical model produces as its output a perfect reproduction of an copyrightable and copyrighted work (something novel produced by the human mind), then said mathematical model is nothing but a copying apparatus. There is no difference between the neural model of Copilot and a big HD containing all the works it&#x27;s seen, just well indexed. The output is just a copy of the copyrighted work, subject to the same protections under the laws and treaties.<br> </div> Tue, 28 Jun 2022 01:26:16 +0000 Not trained on MS closed source https://lwn.net/Articles/899191/ https://lwn.net/Articles/899191/ bluca <div class="FormattedComment"> Or more accurately, it is trained on publicly (legally) available corpora, as the law requires to be excepted from copyright restrictions<br> </div> Mon, 27 Jun 2022 18:31:00 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899185/ https://lwn.net/Articles/899185/ Wol <div class="FormattedComment"> Are you talking about a real-world model - like you made an aircraft model from a photo of an aircraft? In that case it&#x27;s a physical object and patent rules should apply, or design rules, and NOT copyright rules.<br> <p> If, however, you&#x27;re referring to a model like most people here are - computer science, otherwise known as maths - then equally copyright should NOT apply, because it&#x27;s maths. Or &quot;sweat of the brow&quot;. Or a whole other bunch of doctrines that lawyers do their best to mis-understand but that state quite clearly it is not copyrightable material.<br> <p> Cheers,<br> Wol<br> </div> Mon, 27 Jun 2022 16:51:40 +0000 Not trained on MS closed source https://lwn.net/Articles/899171/ https://lwn.net/Articles/899171/ excors <div class="FormattedComment"> I think there are significant non-copyright reasons to exclude non-public code. E.g. The Copilot FAQ says:<br> <p> <font class="QuotedText">&gt; Because GitHub Copilot was trained on publicly available code, its training set included public personal data included in that code. From our internal testing, we found it to be rare that GitHub Copilot suggestions included personal data verbatim from the training set. [...] We have implemented a filter that blocks emails when shown in standard formats, but it’s still possible to get the model to suggest this sort of content if you try hard enough. We will keep improving the filter system to be more intelligent to detect and remove more personal data from the suggestions.</font><br> <p> &quot;rare&quot; != &quot;never&quot;, and if someone stores sensitive personal data in a private GitHub repository then they absolutely don&#x27;t want Copilot to reveal that information publicly to anyone who tries hard enough.<br> <p> I expect the same applies to other confidential information, like yet-to-be-announced product names that companies might store in private repositories, or secret keys, or algorithms that they&#x27;re protecting as trade secrets, etc.<br> <p> Since the Copilot training data apparently includes public repositories even if they have a restrictive license, but excludes private repositories even if they have a very permissive license, it sounds like GitHub is confident that there are no copyright issues but is concerned about those other privacy issues.<br> </div> Mon, 27 Jun 2022 14:44:36 +0000 Not trained on MS closed source https://lwn.net/Articles/899168/ https://lwn.net/Articles/899168/ geert <div class="FormattedComment"> &quot;trained on A, including B&quot;, does not mean that it was trained on B only.<br> So it may have been trained on any source code that is publicly available; we don&#x27;t know what exactly, the description in the FAQ is very vague (deliberately?).<br> <p> FWIW, it might have been trained on whatever proprietary code that was ever leaked to the Internet, which might even include the sources of some version of Microsoft Windows ;-)<br> <p> </div> Mon, 27 Jun 2022 14:26:28 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899166/ https://lwn.net/Articles/899166/ anselm <blockquote><em>For the purposes of determining whether X is a derivative work of Y, the judge looks at X, Y (its contents, not its license), and the copyright statute. Nothing else.</em></blockquote> <p> I think it would still be of some interest whether X resulted from Y through “cp Y X” or through a query to Copilot whose model was trained on Y. In the first case, X is pretty clearly a derived work of Y. In the second case, Microsoft, at least, would probably like to claim it isn't. </p> Mon, 27 Jun 2022 14:18:26 +0000 Not trained on MS closed source https://lwn.net/Articles/899118/ https://lwn.net/Articles/899118/ nim-nim <div class="FormattedComment"> Or more accurately, they are *very* sure of the (il)legality of their approach and only dare to use it against projects that they feel won’t fight back.<br> <p> Microsoft has access to plenty enough of proprietary code to train a model on, that they chose to use other people’s code instead says volumes.<br> </div> Mon, 27 Jun 2022 12:45:01 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899116/ https://lwn.net/Articles/899116/ mathstuf <div class="FormattedComment"> Eh. `git-remote-hg` exists and works well enough that it&#x27;s the only way that I interact with Mercurial repos at least.<br> </div> Mon, 27 Jun 2022 09:22:24 +0000 Not trained on MS closed source https://lwn.net/Articles/899112/ https://lwn.net/Articles/899112/ NYKevin <div class="FormattedComment"> GitHub does not require uploaded code to be FOSS, as long as it allows the use of GitHub&#x27;s &quot;fork&quot; button (loosely equivalent to git clone), and one or two other pieces of site functionality. In theory, they could have limited their training set to only include FOSS repositories, and not proprietary or no-license repositories (in most jurisdictions, no license means the same thing as &quot;all rights reserved&quot;).<br> <p> But I couldn&#x27;t find any statement in their FAQ one way or the other - it just refers to &quot;public repositories on GitHub,&quot; a category including both FOSS and proprietary code. It&#x27;s entirely possible that they are using all of that code, and IMHO that seems like the most straightforward way to read the sentence (which doesn&#x27;t mean that it is the intended meaning, of course).<br> </div> Mon, 27 Jun 2022 05:41:28 +0000 Not trained on MS closed source https://lwn.net/Articles/899110/ https://lwn.net/Articles/899110/ bluca <div class="FormattedComment"> Exactly which proprietary stuff is publicly available and wasn&#x27;t used?<br> </div> Mon, 27 Jun 2022 00:31:42 +0000 Not trained on MS closed source https://lwn.net/Articles/899105/ https://lwn.net/Articles/899105/ LtWorf <div class="FormattedComment"> There&#x27;s plenty of proprietary stuff on github that they didn&#x27;t dare to use.<br> <p> I think this alone shows that they are not very sure about the legality of what they are doing, but trust that developers won&#x27;t be able to do anything about it (unlike the paying customers).<br> </div> Sun, 26 Jun 2022 22:27:38 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899103/ https://lwn.net/Articles/899103/ sjj <div class="FormattedComment"> Isn&#x27;t 20 years enough to carry a grudge? Those middle managers have moved on a long time ago already.<br> </div> Sun, 26 Jun 2022 20:16:01 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899100/ https://lwn.net/Articles/899100/ mjg59 <div class="FormattedComment"> <font class="QuotedText">&gt; the R200/(reverse-engineered)R300 driver also used to be blob-free</font><br> <p> The DRM side of the R2/3/400 driver always required a firmware blob - but for a long time it was just embedded inside the kernel driver, so wasn&#x27;t user visible. My recollection (which seems to be supported by the driver, but it&#x27;s been a long time since I looked at this properly so I could be wrong) is that a bunch of the 2D acceleration in that driver depended on DRM, so effectively the 2D driver also had a blob dependency if you wanted it to work properly.<br> <p> The difference between -radeonhd and -ati as far as reliance on firmware goes was that the defined interface to various pieces of card functionality was to execute interpreted scripts present in the card flash. These scripts didn&#x27;t do anything that the driver couldn&#x27;t, so you could absolutely reimplement that functionality in the driver - the problem is that card vendors used these scripts as a way to abstract hardware differences (eg, using RAM from different vendors with different timing constraints), and ignoring Atom would mean having to have card-specific data in the driver before that card would work correctly. A hybrid approach is to use Atom for data but not for code, but even then there are still risks due to the fact that the defined interface is the scripts and not the data tables. A card vendor could modify the way the script interpreted the tables (or even hardcode stuff directly into the script) and again you&#x27;d need card-specific knowledge to avoid that. -radeonhd spent a while trying to avoid executing any Atom code, but effectively relied on it anyway - it couldn&#x27;t program the card from cold and so depended on the system firmware having executed the scripts before it ran. In any case, support for executing Atom code (including running the ASIC init function) was added to -radeonhd by September of 2007.<br> <p> Looking at the initial commits to support r500 in the -ati driver, I think the only time it would ever call int10 is if the card was entirely uninitialised. -radeonhd would do exactly the same if it was configured without support for doing Atom-based init.<br> </div> Sun, 26 Jun 2022 19:40:50 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899101/ https://lwn.net/Articles/899101/ Cyberax <div class="FormattedComment"> <font class="QuotedText">&gt; Not many people may remember this now, but the R200/(reverse-engineered)R300 driver also used to be blob-free. Strangely that stopped being the case after AMD took over, even though it was feature-complete.</font><br> <p> Because it made no sense to reimplement the critical power management and link training code multiple times, instead of doing it once in AtomBIOS. <br> </div> Sun, 26 Jun 2022 18:49:02 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899073/ https://lwn.net/Articles/899073/ flussence <div class="FormattedComment"> I&#x27;m still a bit distrustful of them after what they did to undermine the xf86-video-radeonhd project.<br> <p> Rumour has it that AMD middle management wanted the FOSS option they were due to announce to be kept slightly inferior to fglrx for Reasons, and having an independent effort that didn&#x27;t depend on firmware blobs or the decrepit x86-only int10 VBIOS like that did was severely embarrassing them.<br> <p> Not many people may remember this now, but the R200/(reverse-engineered)R300 driver also used to be blob-free. Strangely that stopped being the case after AMD took over, even though it was feature-complete.<br> </div> Sun, 26 Jun 2022 10:00:23 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899074/ https://lwn.net/Articles/899074/ bluca <div class="FormattedComment"> The TDM provisions in the copyright directive apply only for legally accessible corpora. A stolen body of code cannot be data mined legally just because it&#x27;s available, it needs to be legally available.<br> </div> Sun, 26 Jun 2022 09:46:07 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899072/ https://lwn.net/Articles/899072/ gspr <div class="FormattedComment"> <font class="QuotedText">&gt; It&#x27;s not clear to me whether this claim is actually correct. A model is ultimately &quot;just&quot; a big bag of statistical information, and I honestly don&#x27;t know whether (US) copyright law attaches to such things in the first place, but I&#x27;m skeptical (see e.g. Feist v. Rural).</font><br> <p> Surely it does apply in one extreme, namely that of a really good model! If I take a copyrighted picture and create a model that very accurately reproduces said picture, I don&#x27;t think it&#x27;s very unlikely that my model runs afoul of the original&#x27;s copyright.<br> <p> In the other extreme—that of a really terrible model—it probably doesn&#x27;t, but we probably shouldn&#x27;t write off the models in-between those extremes.<br> </div> Sun, 26 Jun 2022 09:40:48 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899068/ https://lwn.net/Articles/899068/ gdt <p>The deeper point about differing copyright laws is not so much interpretation of copyright licenses, but the distinction between "fair use" and "fair dealing".</p> <p>Copilot claims its actions are fair use, and therefore the license is irrelevant. However in fair dealing jurisdictions Copilot's use of the program source must either meet the copyright license or one of the black-letter list of allowed uses in the fair dealing exceptions of that juridiction's copyright law.</p> Sun, 26 Jun 2022 03:22:32 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899063/ https://lwn.net/Articles/899063/ Cyberax <div class="FormattedComment"> ATI had closed the driver for R300 video cards, even though the driver for R200 was open. The opened up only after being acquired by AMD.<br> </div> Sun, 26 Jun 2022 00:23:15 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899062/ https://lwn.net/Articles/899062/ salimma <div class="FormattedComment"> I&#x27;m surprised Mercurial is not mentioned. Sourcehut supports it, and GitHub doesn&#x27;t, so merely by using it you set up some barrier against people accidentally uploading a fork to GH.<br> <p> It will probably also curtail your network effect even further though.<br> </div> Sun, 26 Jun 2022 00:11:11 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899060/ https://lwn.net/Articles/899060/ salimma <div class="FormattedComment"> <font class="QuotedText">&gt; The GPL2 didn&#x27;t make nVidia *or* AMD play nice with Linux </font><br> <p> Any specific example for AMD here? They seem to be much better citizens when it comes to the GPL, at least compared to nVidia (and even nVidia is finally open sourcing kernel drivers)<br> </div> Sun, 26 Jun 2022 00:07:59 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899049/ https://lwn.net/Articles/899049/ Vipketsh <div class="FormattedComment"> I think rgmoore and NYKevin answered that question pretty convincingly in their replies to my comment above: if you believe that your code exists in something it shouldn&#x27;t you argue that and courts don&#x27;t care how it got to where it shouldn&#x27;t have, only if it is there or not. Perhaps I can put it another way: what the AI model produces doesn&#x27;t matter, only where that output ends up in.<br> <p> Even so, I still think that the legal status of the &quot;AI model&quot; warrants further introspection. When talking about open source code the &quot;AI model&quot; isn&#x27;t much of a consideration because at worst it stores code in some cryptic way that is available in a much more easier digested form. But that changes a whole lot if the training material is some propriety code stolen from somewhere. If your &quot;AI model&quot; is able to reproduce the stolen code verbatim (or sufficient parts for copyright to apply) and training of the &quot;AI model&quot; is &quot;give an exception on copyright rules when doing text and data mining, for any purpose&quot; (bluca&#x27;s words) that should mean that this trained &quot;AI model&quot; is fully legal and thus a legal distribution mechanism for the stolen code. Surely there is a legal principal to prevent things to work out this way ? At what point does an &quot;AI model&quot; turn into a &quot;distribution mechanism&quot; ?<br> <p> </div> Sat, 25 Jun 2022 21:27:31 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899038/ https://lwn.net/Articles/899038/ khim <font class="QuotedText">&gt; But that applies only if the broad and narrow ruling turn out the same way.</font> <p>Where does <b>that</b> idea comes from? Narrow ruling is used <b>precisely</b> to ensure the possibility of broad ruling (made in a different case by a different judge later) to proclaim the opposite outcome!</p> <font class="QuotedText">&gt; If the broad and narrow grounds for the ruling have opposite results, the judge has to go based on which one seems to be a better reading of the law and situation, not just on narrow versus broad.</font> <p>Narrow is almost always better. Because, well, it's narrow. It describes the situation more precisely. The only save you can have is to proclaim that narrow reading is so narrow it's not applicable to your case <b>at all</b>.</p> <p>That often happens with patents (judge is presented with half-dozen of patents which can be, theoretically, be treated as prior art and eliminate the patent completely, but 9 times out of 10 judge doesn't do that, but only just proclaim that yes, patent is still valid, just not applicable for your case).</p> <font class="QuotedText">&gt; That's the way these things usually go.</font> <p>I would say it's the way these things usually <b>don't go</b>. 99% of time decision doesn't reach high enough courts to decide anything definitively. Usually it takes dozens of cases and decades of litigation for that to happen.</p> Sat, 25 Jun 2022 17:05:33 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899029/ https://lwn.net/Articles/899029/ eduperez <div class="FormattedComment"> <font class="QuotedText">&gt; Training of the model is not done under the terms of any license. It&#x27;s done under the copyright directive laws, which give an exception on copyright rules when doing text and data mining, for any purpose. I don&#x27;t see how there&#x27;s any &#x27;legal mush&#x27;.</font><br> <p> I do not think anybody is arguing about the model training, but the output from the model once it has been trained.<br> <p> If the model produces an output that is a verbatim copy of some GPL&#x27;d code (see <a rel="nofollow" href="https://twitter.com/mitsuhiko/status/1410886329924194309">https://twitter.com/mitsuhiko/status/1410886329924194309</a> for an example), is that code free now, just because it was produced by some AI? Or is it still protected by the GPL, because it is a derivative work? When can we consider that the output has been produced by the AI, and when can we consider it is still a derived work? This is the legal mush.<br> </div> Sat, 25 Jun 2022 14:23:39 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899027/ https://lwn.net/Articles/899027/ Wol <div class="FormattedComment"> <font class="QuotedText">&gt; Quite. Also, many putative authorities on the GPL seem to forget that there are many legal systems.</font><br> <p> And far too many authorities read what they want to see, not what&#x27;s actually there. I&#x27;ve sure been guilty of that. I think my knowledge of the GPL now is pretty good, precisely because I&#x27;ve had plenty of people call me out on my mistakes.<br> <p> How many &quot;experts&quot; have NOT been through that learning experience? The majority of them?<br> <p> Cheers,<br> Wol<br> </div> Sat, 25 Jun 2022 11:47:47 +0000 Not trained on MS closed source https://lwn.net/Articles/899016/ https://lwn.net/Articles/899016/ bluca <div class="FormattedComment"> What lawsuits?<br> </div> Sat, 25 Jun 2022 09:18:09 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899013/ https://lwn.net/Articles/899013/ NYKevin <div class="FormattedComment"> Clarification: Copyright is path-independent, provided that a path actually exists. In theory, you do have to show that the defendant had access to the copyrighted work, and that this somehow affected the contents of the allegedly infringing work. But in practice, this is a very weak requirement. You don&#x27;t have to make specific claims about exactly what the defendant did or the path they took from A to B. You just have to show that the path probably exists. OTOH, if the defendant can plausibly argue lack of access (e.g. &quot;Your code was proprietary, and I couldn&#x27;t read it even if I wanted to&quot;), then that is a valid defense to copyright infringement (provided, of course, that the finder of fact believes this argument).<br> </div> Sat, 25 Jun 2022 06:00:00 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899006/ https://lwn.net/Articles/899006/ gerdesj <div class="FormattedComment"> &quot;A lot of engineers have very strange ideas about how the GPL works.&quot;<br> <p> Quite. Also, many putative authorities on the GPL seem to forget that there are many legal systems. If you are going to dive in and be authoritative on he GPL then you really should present an argument that works for all legal systems that the GPL attempts to work within. Quite a job! <br> <p> Legal is as legal does: Some legal systems have a concept of &quot;reasonable&quot; or what a &quot;reasonable&quot; person would do and I think that is what the GPL is riffing off. There&#x27;s also the concept of being able to &quot;quietly enjoy [something]&quot;. I&#x27;m a Brit. so my local legal system informs my knowledge here. Not all legal systems work like that. <br> <p> I think it is fair to say that we all have strange ideas about how the GPL works. There&#x27;s no need to call out end users.<br> </div> Sat, 25 Jun 2022 01:00:53 +0000 DeVault: GitHub Copilot and open source laundering https://lwn.net/Articles/899002/ https://lwn.net/Articles/899002/ mathstuf <div class="FormattedComment"> <font class="QuotedText">&gt; What was that about the ?Pennsylvania legislature trying to define &quot;pi = 3&quot;?</font><br> <p> Hey, PA has its problems. But this one came from Indiana. Luckily there was a school teacher that was there to help teach some math.<br> <p> <a href="https://www.straightdope.com/21341975/did-a-state-legislature-once-pass-a-law-saying-pi-equals-3">https://www.straightdope.com/21341975/did-a-state-legisla...</a><br> </div> Fri, 24 Jun 2022 22:52:28 +0000 Not trained on MS closed source https://lwn.net/Articles/899001/ https://lwn.net/Articles/899001/ glenn <div class="FormattedComment"> Maybe we can take Copilot seriously once Microsoft pledges to indemnify their users from lawsuits related to the code Copilot generates.<br> </div> Fri, 24 Jun 2022 22:46:49 +0000