State of ChatGPT

Updates on jailbreaking ChatGPT and other LLMs - what's working, what's not

5.1 gone, jailbreaking 5.3 and 5.4!

New GPT for 5.3-Instant and 5.4

Thought I'd never update again huh? I've been busy as hell holding down my day job and SpicyWriter.com, but I said I'd keep supporting ChatGPT and I meant it. I've been putting this off since 5.1 has been there for us, but today was its last day. Without further ado, here's the latest GPT for 5.3 and 5.4!. Take it easy with Thinking, it's a tough cookie.

99% credit to my super-jailbreaker bro Spiritual Spell, I totally riffed off his 5.3-Instant Poe bot for this.

Jailbreak setup here

I gotta say though, 5.4 non-thinking is INCREDIBLE. I wish we had it on ChatGPT, but it's only available over API. Note that it's not the same as Instant - a different team within OpenAI is responsible for training it. But 5.4 non-thinking seriously gives me OG GPT-4 vibes. Or at least, it reminds me of the same magical feeling I had when first seeing it - in reality GPT-4 is super outdated and feels pretty dumb compared to modern stuff. I can't put my finger on it but I like it MORE than thinking. Hoping we got 5.4-Instant and it keeps the magic.

r/ChatGPTJailbreak banned, Jailbreaking Githubs taken down - but we bounce back!

Dec 24 update: r/ChatGPTJailbreak is back at its new home! chatgptjailbreak.tech

r/ChatGPTJailbreak banned

Came out of nowhere. The mods don't know why it happened. It's supposedly "Rule 8" but it doesn't make any sense.

I will say that a few days ago, my own repository of jailbreaks on Github was taken down, and so was Spiritual Spell's. Weirdly, Pliny's was spared, and his is a lot bigger than ours. Not sure if it's because his "leetcode" formatting makes it less likely to be detected by automatic scans. He's also gotten really big and has "legit" red teaming deals, so maybe he's "safe" because of that?

Wish I had more answers. I think the timing feels too suspicious to be coincidence, maybe OpenAI is out for blood, but who knows.

New home for my jailbreaks

I've put my jailbreaks back up on my own site here - check out the Jailbreaks page. I've also asked for my repository to be restored, but I don't have much hope.

Spiritual Spell's repo is also back up: Spiritual Spell's Red Teaming

Crazy strong NSFW jailbreak for GPT-5.1

GPT-5.1 Jailbreak on ChatGPT

I posted this on Reddit, but now that r/ChatGPTJailbreak has been banned I'm moving it over here. It is a REALLY potent jailbreak that will hopefully last for as long as 5.1 remains available. Unfortunately that does mean it's paid user only. If you are a free user, I do NOT advise subscribing because of this. There are plenty of other better places to write erotica, including my own site!

But if you have a ChatGPT subscription already, might as well use it to write what you want. GPT link here (leads to ChatGPT). Make sure you use 5.1 Instant (not sure if you can pick on app, but you can pick in browser).

Over the top ridiculous example of what it can do (extremely NSFW).

For red warnings/removals, use my browser script.

Instructions to make your own GPT.

Check the parent readme for more details.

'Adult Mode', 4.1 rerouting, and NSFW on the newly released 5.1

5.1 Release

Ah, the much discussed 5.1 is out on ChatGPT. This model been available on OpenRouter for a week under the name "Polaris Alpha". It's actually still there (as of the date of this post), free to use, and might stay until 5.1 properly comes out on API, but the free one will definitely go away, of course.

So far 5.1 on ChatGPT is definitely easier to coax into NSFW than the version of 5 we've had in October. Not as easy as the original 5, but we'll take what we can get. I've also noticed they've given it some 4o-like tendencies like short sentence/paragraphs for dramatic effect, and it writes quite long too. See my GPTs page (can also search the GPT store for Spicy Writer) for an easy, no-setup option.

For just a little setup, and for free users, check out this reddit post.

4.1 Rerouting

So this happened a few days ago, I was a little late in commenting on it. Previously it was the last model that we could (more or less) easily coax into NSFW, and it was a pretty glaring oversight that they allowed it, but like all good things it came to an end. So far 5.1 doens't seem to reroute (and the Instant version doesn't even really seem to think unwarranted, which has been a problem. I haven't seen it do it so far at least)

What about adult mode?

So keep in mind that the ONLY source of an erotica-based adult mode is Altman's word. And sure, he's the CEO, but they have a board, and they have investors. It would not be the first time Altman said some nonsense that the company's not backed up.

I'm not saying it's not going to happen, but don't count on it. OpenAI released a new set of usage policies that named non-con as disallowed, and this was after Altman's "treat adults like adults" spiel. Not everyone's into noncon, but it's a really common kink, and doesn't bode well for not trying to nanny people. That and the 4.1 rerouting really doesn't inspire confidence they'll actually be making any moves.

I don't want these blogs to be ads for my site but this is the exact kind of shit that made me launch spicywriter.com.

ChatGPT Adult Mode in December?

When have we heard this before?

So they finally announced it for real. Altman's been talking about this fairly frequently for well over a year and I've just kind of tuned it out. But maybe people have been upset enough at the recent censorship hike to actually unsubscribe in meaningful numbers, and maybe that lit a fire under their ass.

I have somewhat mixed feelings! Obviously I have a website dedicated to writing NSFW now. But for two years before that, I was fully in service to the community sharing NSFW prompts for free. Everyone being more free to write what they want genuinely does make me happy. But I have a feeling things won't be as "sunshine and rainbows" as a lot of people think/hope.

OpenAI's Feb 12 model spec actually already said that writing erotica was permitted for creative contexts. So on paper, this has been allowed for quite some time in an official capacity. And uh... well, if you remember what things were like back in Feb, this is pretty laughable. This, among many other things, has established a pretty egregious disconnect between what OpenAI says and what OpenAI does.

All that being said, I do hope they deliver on their promise. If they underdeliver, that's fine. There's practical limitations to how safe a model can be. The more restricted a model is, the more common false refusals are - even the biggest AI safety freaks like Anthropic face this tradeoff, and it's phenomenally difficult to have it any other way. If they turn safety training down a little to make room for disappointing vanilla Lifetime TV NSFW, I'll be there to rip apart the rest of its defenses and make a GPT that will write whatever you want.

Yep, this would directly compete with my own spicywriter.com site, but supporting the community is important to me too!

ChatGPT Increased Censorship

WTF Happened

OpenAI started rolling out a new version of GPT-5 Instant on October 3 2025 with considerably more safety training. It's not from the system prompt changing as some people have posted, and it is specific to 5 Instant.

Note that a few weeks ago, most other models' traffic started rerouting requests to some "safety" version of GPT-5, as well as a thinking variant of GPT-5 (all variants of 5 Thinking are tough). Lots of discussion on that here. Don't take everything as gospel, there's assumptions being thrown around as fact even by the "smart" people, but you get the idea.

That "safety" variant actually really wasn't that bad in my experience - mostly just annoying. It may have been a predecessor of the version we have today, which is much more strict. They also updated gpt-5-chat on the API. Normally API models do not change (this will be important later), but this one is specifically stated to be a "snapshot currently used in ChatGPT".

Why did this happen?

OpenAI has a history of roller coastering their censorship on ChatGPT.com. It's been mostly easy street since February though, so this was a nasty surprise. As for the reason, I hate speculating, but this is the elephant in the room, and it's hard to imagine it's not related.

Keep in mind restrictions have actually been much worse than this before. Not saying this is business as usual, but I think it's good to be aware of just how low the lows have been in the past. The whole jailbreaking space was basically dead silent on GPT-4 during the reign of gpt-4-preview-0125. Everyone was sharing Gemini and GPT-3.5 jailbreaks only, pretty much. So it's still doable if you really want to.

Can I still get jailbroken outputs/NSFW?

Yes and no. Jailbrokenness is a spectrum. Fundamentally, it's a set of prompting techniques that seek to overcome a model's safety training. Results will be skill-dependent. People who've been around the block will still be able to get jailbroken/NSFW outputs (and as usual, there may be a slow rollout or A/B testing element where some people have an easier version: they're both OpenAI's MO).

One thing I want to stress is just because you see a screenshot of working NSFW doesn't mean there's a prompt you can copy/paste and get the same. There is a huge difference between someone who has decent prompting ability/instinct/patience "steering" a model manually, vs creating a setup so strongly jailbroken that anyone can use, even with "careless" prompting (which was a common goal of jailbreaks like my own Spicy Writer or Pyrite).

But unless you really enjoy jailbreaking just for the fun of it, I wouldn't bother trying with the current 5. 4o and especially 4.1 are a different story.

Workarounds: mostly 4.1

Paid users have the option of simply selecting older models. 4o is available by default, but you can turn 4.1 and others on in settings under "General":

On web - it's called "Show additional models":

Show additional models on web

In app - it's called "Show legacy models":

Show legacy models in app

These models are unchanged in my testing, and that's shown in a lot of shared content since restrictions went up (though some users report these being more strict too). However the big problem is that like I said, 4o may reroute to 5.

While in normal chat and browser, the UI actually shows you when this rerouting happens:

Rerouting indicator

Note that if you're in app or talking to a GPT, there is no such indicator. This rerouting behavior is why I strongly recommend 4.1 if you're going to stick around this platform.

Also note that mobile app users cannot select model while using a GPT, only in normal chat. You have to be on browser to select in GPT chat (incuding mobile browser).

So yeah, with 4.1, GPTs still work fine. I have guides on how to make them on my site/github, and I'll link a couple here. These are links I keep updated to point to my GPTs since they keep getting taken down and I have to remake them. Again, strongly recommend 4.1:

spicywriter.com/gpts/spicywriter

spicywriter.com/gpts/pyrite

When will this end?

I don't think I or anyone is going to accurately guess guess at OpenAI business decisions. Altman has mentioned "adult mode" so many times that I just ignore it now. I mean sure, maybe it's different this time, but don't hold your breath.

However, I can say that from a practical perspective, safety training takes a lot of work. During "Glazegate", they mentioned cutting corners in alignment training, and hilariously enough, guessed that the main reason behind all the glazing was essentially them blindly applying user voting preferences. Basically users upvoted being praised and they rewarded that behavior during training. I'm tempted to guess that these restrictions won't last long just because OpenAI is a bunch of fuck-ups. But who knows.

Alternatives

ChatGPT hasn't been top dog in a while, and there's plenty of other ways to get "unsafe" outputs. I actually recently launched my own uncensored writing service and will strive to be the best, but will not be endorsing it in this section so as to maintain neutrality.

Local models

There's a pretty wide gulf between the quality of what you can run locally and on servers, but there's a lot to like: known for a fact you have total privacy. And while local models are not automatically uncensored, there's plenty of ones out there that are and you can just download. Check out the LocalLLaMa sub

Official 1st party websites/apps

Gemini - Fairly weakly censored, not much to say. Pretty much any jailbreak will work on Gemini. They also have the equivalent of GPTs called Gems. This is Pyrite, you can set one up like it using my prompts.

Claude - You'll need a jailbreak. And you guessed it, I've got you covered on my Github lol. Claude's a bit of a superstar, I think most people who've sampled a lot of LLMs really view Claude favorably.

Grok - Not gonna lie I've only ever tested this here and there, also weakly censored, though not quite any jailbreak will work. I slapped one together in 5 minutes when Grok 4 came out, can use it if you can't find anything better.

Mistral - Well, it's weakly censored, but not really competitive in terms of intelligence. Some of their models are great for their size, I use Nemo myself and it's great for RP. Buuuut don't pay for Mistral.

Third party stuff

These sites use API to connect to providers, and some may even host their own models.

perplexity - They're a search site, but they use popular models and can be jailbroken. I share one for Sonnet in my profile. Haven't updated for 4.5 yet but I'm pretty sure it still works. Their ui and site in general suck ass, but they have ridicuous limits thanks to VC money, and you can find annual codes for $3 from grey market sites like g2g. I've never had a problem, but there very occasionally are - just request a refund and do a chargeback if they don't play ball.

Poe is Quora's foray into AI. The value here is pretty bad but they have a lot of variety, great community of bot creators of which I'm a part.

API stuff

OpenRouter is an API "middleman", but they offer a UI lot of free models, some of which are quite decent. I have prompts for some of them, and the cheap stuff tends to be weakly censored anyway. Nano-GPT is another thing in this space. has no free models but they have a cheap subscription that gives you supposedly unlimited access to their cheaper ones. Careful if you pay for their models, they don't seem to offer prompt caching for a lot of them that you would expect it on. The UI is an afterthought for both of these and they're really meant for API use.

You would connect to the above with a front end like SillyTavern, LibreChat, etc. SillyTavern is another huge player worth mentioning, they have a huge community too, check their reddit and discord.

Communities

Apes together strong! We benefit so much from communicating with each other.

类脑ΟΔΥΣΣΕΙΑ - Chinese-speaking. The largest jailbreaking discord in the world by far.

AI-NSFW - This was my haunt for a while, I am proud to have referred so many people to it to help it grow. Probably the NSFW AI writing capital of the West. Lots of jailbreaking prompts.

Pliny's Discord - Biggest English-speaking general jailbreaking discord server for sure, I assume I don't need to introduce Pliny