The "alignment" and the "censorship" are, in this case, the same thing.
I don't mean that as a metaphor; they're literally the same thing.
We all already know chatGPT is fantastic at making up very believable falsehoods that can only be spotted if you actually know the subject.
An unrestricted LLM is a free copy of Goebbels for people that hate you, for all values of "you".
That it is still trivial to get past chatGPT's filters… well, IMO it's the same problem which both inspired Milgram and which was revealed by his famous experiment.
I don't mean that as a metaphor; they're literally the same thing.
We all already know chatGPT is fantastic at making up very believable falsehoods that can only be spotted if you actually know the subject.
An unrestricted LLM is a free copy of Goebbels for people that hate you, for all values of "you".
That it is still trivial to get past chatGPT's filters… well, IMO it's the same problem which both inspired Milgram and which was revealed by his famous experiment.