I wonder if we should have #GPL 4 that would cover machine learning. Something like "if this code is used to train an LLM, then the code produced by the LLM must be released under the same license". I know there are many challenges, such as effective enforcement, but if this issue remains unaddressed, I believe LLMs may become a way to evade license virality.
Ondřej Caletka reshared this.
James Just James
in reply to Jiří Eischmann • • •I kind of agree, but at the same time, we aren't actually enforcing the copyright laws we already have, because OpenAI isn't actually open, and in fact, is worth so much money that lawmakers are kind of letting them do whatever they want.
It's really about corruption, and not a lack of laws or GPL version IMHO.
Vadim Rutkovsky
in reply to Jiří Eischmann • • •Petr Ferschmann
in reply to Jiří Eischmann • • •I don't agree. We need to train LLM on something legally and open source will benefit from it aswell. And is big difference between using code directly and thru LLM.
The main problem with LLM is that they are using unlicensed documents to train (eg books)
Jiří Eischmann
in reply to Petr Ferschmann • • •Petr Ferschmann
in reply to Jiří Eischmann • • •Mormegil
in reply to Jiří Eischmann • • •Jiří Eischmann
in reply to Mormegil • • •Miroslav Suchý
in reply to Jiří Eischmann • • •AGRO TURBO.EXE SATAN 🇺🇦🇨🇿
in reply to Miroslav Suchý • • •@mirek that is very curious, i'm wondering how familiar the lawyers were with the actual technology.
i don't use llms too much, so i'm not familiar myself -- but to me it looks like some creative work goes into writing the prompt?
i don't use these tools too much, but just a few days ago i ended up using one and it turned out that a good way to avoid shit output is feeding in pseudocode or commanding it to make the very same sort of edits as one would otherwise make manually. it'd be surprised if someone considered this not a creative process?
Jiří Eischmann
in reply to AGRO TURBO.EXE SATAN 🇺🇦🇨🇿 • • •AGRO TURBO.EXE SATAN 🇺🇦🇨🇿
in reply to Jiří Eischmann • • •Glyph
in reply to Jiří Eischmann • • •Copyright licenses are not a magic spell. If LLMs are adjudicated to be derivative works of their inputs, no additional license is needed; existing GPL (or indeed, even Apache or MIT, given the lack of license text reproduction in LLM output) is fine. If LLMs are adjudicated to be fair use, no additional license will help.
Focusing on licensing is fighting the last war. Public communications and movement-building is what is needed now.