Like many others, I am frustrated by corporations that have ingested immense amounts of user generated content and are using/selling it to train LLMs1. However, deleting the content now is counter productive, let’s protest more effectively by building a better web.

Bad luck Brian meme format with 'When does a narwhal bacon?' on top and '*Comment has been deleted*' at the bottom
Bad Luck Brian answers important questions (via quickmeme.com)

Context and Concern

I’m seeing many influential people on the Fediverse urging others to go register their displeasure by deleting the knowledge they contributed via answers and comments. This is especially puzzling in the case of the current Stack Overflow and OpenAI partnership announcement that has caused blowback on Mastodon2 because these folks are tech savvy and should understand that deleting their contribution from showing up on the front-end does not delete the data. It’s a bit crazy making because I feel like I’m missing something here, am I?

Please don’t advise people to delete their existing answers. Not contributing new answers and not logging in/deleting your account are great ways to register protest but deleting data is self-defeating. These companies already have the data backed up and will use it anyway, deleting answers only hurts humans who are searching for the answers through traditional searches. Removing the information by deleting it or issuing GDPR requests only locks up the knowledge and makes it exclusive to the “AI”.

Potential Solutions

A dramatic analogy before we get to solutions… If a marauding force is razing your city and driving you away, you absolutely shouldn’t come back to help them rebuild the city where you can’t live anymore. But you also don’t burn your books in the fire on the way out. You take your books with and build a better city with fortified libraries!

Just in case it’s not clear, I’m not siding with the platforms here. Even if they have a legal right to the use the data as they see fit, they’re violating trust with the community by going against its wishes. And in the case of Stack Overflow all contributions are CC-BY-SA so anyone can use the data provided they follow the attribution requirement3.

Instead of letting a good crisis go to waste, let’s use this (as yet another) impetus to leave walled gardens and join community spaces. Instead of encouraging others to delete answers, direct them to community forums, write the answer on your own website and post a link to the platform (POSSE style), stop contributing further to closed platforms and liberate existing data by using export tools where available.

I hope we can learn these lessons from the on-going Enshittification and start taking back the web instead of furthering Enshittification by protesting it ineffectively.

Discussion: This site doesn’t have comments but happy to chat via this Mastodon post.


  1. Large Language Model, falls under the umbrella of the colloquial Artificial Intelligence (AI) terminology. ↩︎

  2. As of 2024-05-07, I don’t want to single out individual posters. ↩︎

  3. Citing attribution would be difficult at best and while I’m not a lawyer I can’t see how their response could help legally. ↩︎