Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Tangentially related but... There's an issue related to training models with data that doesn't have a license allowing it. I don't know if it'll hold up in court but here's my prediction: we'll see an open source license protecting code welcoming using the repository to be used to train future models IFF the models are then made public. And private models are going to be given the finger by the license and won't be allowed to use the repository as training data.

Funnily enough ChatGPT 4 can probably be used to help enhance commonly used open source licenses to add that clause to the license.

I'm not saying I totally root for that (I kinda do): I'm saying we'll see such a license at some point.



Here's a potential amendment that could be added to the GPLv3 to allow for the use of source code in training AI models:

"In addition to the permissions granted by Section 2 of this License, the source code distributed under this License may be used in the training of artificial intelligence models, provided that:

a) The resulting models are made available to the public under a free software license that allows anyone to use, modify, and distribute the software without any additional restrictions; and

b) The models are not used for any commercial purposes, including but not limited to training proprietary models or selling access to the trained models.

Any use of the source code for training proprietary models or for commercial purposes is strictly prohibited. This amendment shall be effective immediately upon adoption and supersedes any conflicting terms of this License."




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: