The State of GPL Propagation to AI Models
https://shujisado.org/2025/11/27/gpl-propagates-to-ai-models-trained-on-gpl-code/
#HackerNews #GPL #Propagation #AI #Models #OpenSource #Licensing #TechEthics #AIResearch
#Tag
The State of GPL Propagation to AI Models
https://shujisado.org/2025/11/27/gpl-propagates-to-ai-models-trained-on-gpl-code/
#HackerNews #GPL #Propagation #AI #Models #OpenSource #Licensing #TechEthics #AIResearch
Monotype font licencing shake-down
https://www.insanityworks.org/randomtangent/2025/11/14/monotype-font-licencing-shake-down
#HackerNews #Monotype #Font #Licensing #Licensing #Issues #Typography #Design #Fonts
Anti-fascistic software is made possible by pro-labor licensing.
https://blog.muni.town/open-source-power/
I've been trying to write this piece for years. Every time I get started I'm just overwhelmed with paralyzing visions of the FOSS commentariat accusing me of WrongThink, more so here on the fediverse than anywhere else.
But I'm scared and tired and we urgently need to get our shit together.
Anti-fascistic software is made possible by pro-labor licensing.
https://blog.muni.town/open-source-power/
I've been trying to write this piece for years. Every time I get started I'm just overwhelmed with paralyzing visions of the FOSS commentariat accusing me of WrongThink, more so here on the fediverse than anywhere else.
But I'm scared and tired and we urgently need to get our shit together.
@bkuhn @mastohost @eruwero @maybeanerd well, I merely refer to the fact that most.companies I've had to deal with won't touch #AGPLv3 because it's provisions are incompatible with #IP laws and #patents as well as #licensing.
At first glance, this looks like a great initiative, Really Simple Licensing: https://rslstandard.org
It's a simple standard way to embed licensing terms in robots.txt, #RSS feeds, web pages and more, so that internet crawlers (especially #AI bots) can understand the author's intentions, and supports collective #licensing platforms.
This is a critical part of the plan I put forward in my blog post here https://dougiamas.com/how-we-could-build-a-better-future-for-creators-in-the-age-of-ai/ where such labelling should be legislated.
But before we jump in, what do you think of #RSL as a #standard?
RSL is a new initiative by a group of big internet publishers that seeks to define the conditions under which AI crawlers can harvest their content. Their guide describes the various ways the content can be made available, including for free or a paid royalty but only by digging deeper into their reference material was I able to figure out how to prohibit all usage.
Your robots.txt needs to link to a XML file, like this:
License: https://your-domain.tld/rsl.xml
Then in that file you want this:
<rsl xmlns="https://rslstandard.org/rsl"> <content url="/" server="https://rslcollective.org/api"> <license> <prohibits type="usage">all</prohibits> </license> </content></rsl>That’s it.
If you want to be more liberal you could change the <prohibits> line to
<permits type="usage">search</permits>That will let them use the content for search, which is probably quite similar to what traditional search engines do. More details in their reference docs.
Optionally to dispel any plausible deniability you can also add a link to rsl.xml as a Link header in every HTTP response.
Link: <https://example.com/rsl.xml>; rel="license"; type="application/rsl+xml"
It’s still too early to say whether AI crawlers will respect the terms of the license any publishers specify, it’ll probably take a court case or two to sort that out.
PieFed has added RSL to it’s code just now. Instance admins who wish to disable RSL can set the ALLOW_AI_CRAWLERS environment variable to anything.
RSL is a new initiative by a group of big internet publishers that seeks to define the conditions under which AI crawlers can harvest their content. Their guide describes the various ways the content can be made available, including for free or a paid royalty but only by digging deeper into their reference material was I able to figure out how to prohibit all usage.
Your robots.txt needs to link to a XML file, like this:
License: https://your-domain.tld/rsl.xml
Then in that file you want this:
<rsl xmlns="https://rslstandard.org/rsl"> <content url="/" server="https://rslcollective.org/api"> <license> <prohibits type="usage">all</prohibits> </license> </content></rsl>That’s it.
If you want to be more liberal you could change the <prohibits> line to
<permits type="usage">search</permits>That will let them use the content for search, which is probably quite similar to what traditional search engines do. More details in their reference docs.
Optionally to dispel any plausible deniability you can also add a link to rsl.xml as a Link header in every HTTP response.
Link: <https://example.com/rsl.xml>; rel="license"; type="application/rsl+xml"
It’s still too early to say whether AI crawlers will respect the terms of the license any publishers specify, it’ll probably take a court case or two to sort that out.
PieFed has added RSL to it’s code just now. Instance admins who wish to disable RSL can set the ALLOW_AI_CRAWLERS environment variable to anything.
We've now hit 100 projects reviewed on isitreallyfoss 🎉
We've now hit 100 projects reviewed on isitreallyfoss 🎉
A space for Bonfire maintainers and contributors to communicate