I mean, no, why would it be? There is so, so much to talk about in programming other than AI. Meanwhile, the current HN front page feels like 90% LLM spam: the complete antithesis of what I used to come here for.
I personally can’t wait for no-ai communities to proliferate.
I genuinely believe this. Even if you're inventing a new algorithm it is better to describe the algorithm in English and have AI do the implementation.
So many insecure AI boosters in the comments slandering and mocking the author. And yet the upvotes clearly indicate that the sentiment in the article resonates widely with the community.
Well, there’s not much of a point leaving a comment saying “yes, this, exactly this,” so I’ll leave one here on behalf of my fellow lurkers.
The more AI gets shoved down my throat, the less I’m inclined to use it for anything, and the more I’m inspired to write my own writing, make my own art, and create my own code — with great creative joy and burning anger. Enjoy your 1000x productivity gains and your inevitable burnout as you downskill to a glorified inference loop.
The most charitable explanation is that they are concerned about their own privacy or identifiability,Bl but ultimately it is a Dick Move™ to other participants.
It's the kind of late-edit things that spurs me to include quotes in replies.
Traditionally, large corporations have taken very conservative legal stances with regard to integrating e.g. A/GPL code, even when there's almost no risk.
If my license explicitly says "any LLM output trained on this code is legally tainted," I feel like BigAICorp would be foolish to ignore it. Maybe I couldn't sue them today, but are they confident this will remain the case 5, 10, 20 years from now? Everywhere in the world?
Github has posted that they will now train on everyone's data (even private) unless you opt out (until they change their mind on that). Anthropic has been training on your data on certain tiers already. Meta bittorrented books to train their models.
Surely if your license says "LLM output trained on this code is legally tainted", it is going to dissuade them.
> I've been looking for a copy-left "source available" license that allows me to distribute code openly but has a clause that says "if you would like to use these sources to train an LLM, please contact me and we'll work something out". I haven't yet found that.
Personally, I want a viral (GPL-style) license that explicitly prohibits use of code for LLM training/tuning purposes — with the asterisk that while current law might view LLM training as fair use, this may not be the case forever, and blatant disregard of the terms of the license should make it easier for me to sue offenders in the future.
Alternatively, this could be expressed as: the output of any LLM trained on this code must retain this license.
Empathy hijacking. If the chatbots framed their responses as “beep boop, I’m a robot, here’s an estimated answer to your query” then we likely wouldn’t have this problem.
reply