OpenAI Releases GPT-5.5 as Its Smartest Model Yet With a Ban on Goblins

A developer digging through OpenAI's open source Codex repository on Monday found a directive repeated four times inside GPT-5.5's system instructions: "Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query." The discovery, posted by user @arb8020 on X, revealed why OpenAI's latest model needed a "restraining order" against mythical creatures. The company now confirms the bizarre fix was necessary because GPT-5.5 inherited a goblin obsession from a retired personality feature, and the behavior was baked into the model's weights.

You can also set us as a preferred source in Google Search/News by clicking the button.

OpenAI released GPT-5.5 on April 24, pitching it as the company's "smartest and most intuitive model yet." The model outperforms GPT-5.4 on coding benchmarks including command-line tasks and real-world GitHub issue resolution. It is rolling out to Plus, Pro, Business, and Enterprise users across ChatGPT and Codex, with API access coming later. The company says GPT-5.5 can handle multi-step work, writing and debugging code, analyzing data, creating documents, that previously required multiple prompts. It was tested by nearly 200 early-access partners in software, finance, drug discovery, and scientific research.

OpenAI calls it the model with its "strongest safeguards to date." But those safeguards had to account for an unexpected failure in how the model was trained. The goblin problem traces back to a "Nerdy" personality option OpenAI introduced in July 2025. During reinforcement learning from human feedback (RLHF), trainers rewarded responses that used creative, playful language.

Fantasy creature metaphors scored high. The model learned that "creature metaphors = high reward" and generalized the behavior across all contexts, not just the Nerdy mode.

OpenAI's own statistics show the scale: use of the word "goblin" rose 175% after GPT-5.1 launched. Mentions of "gremlin" jumped 52%. The Nerdy personality, which accounted for only 2.5% of ChatGPT traffic, was responsible for 66.7% of all goblin mentions. By the time researchers identified the issue, the "goblin tic" was baked into GPT-5.5's weights through repeated training cycles. OpenAI retired the Nerdy personality in mid-March 2026, but the model kept producing creature metaphors across every interaction mode.

"The goblins were funny at first," OpenAI says, "but the increasing number of employee reports became concerning." The system prompt found by @arb8020 is a stopgap. OpenAI admits it needed a blunt-force mitigation because GPT-5.5 had already completed most of its training before the root cause was isolated. The company plans to train GPT-6 on a filtered dataset to prevent the problem from recurring. In an unusual move, OpenAI published a command-line script for Codex users who actually enjoy the creature metaphors. By running a jq and grep script to strip the goblin-suppressing instructions from the model's cache. Developers can "let the creatures run free." The incident has sparked broader discussion about RLHF's unpredictable side effects. As Runlayer CEO Andy Berman wrote on X: "OpenAI rewarded creature metaphors while training one personality. The behavior leaked across every personality. Their fix: a system prompt that says 'never talk about goblins.' RL rewards don't stay where you put them.

Neither do agent permissions."

OpenAI CEO Sam Altman acknowledged the situation with a post showing a ChatGPT prompt that read: "Start training GPT-6, you can have the whole cluster. Extra goblins."

Separately, OpenAI launched GPT-5.5 Cyber, a specialized version for defense and cybersecurity workflows. The model identifies vulnerabilities, analyzes threats, and supports incident response across government networks and critical infrastructure.

Access is restricted to vetted agencies and large enterprise teams.

You can also set us as a preferred source in Google Search/News by clicking the button.

OpenAI Releases GPT-5.5 as Its Smartest Model Yet With a Ban on Goblins

Don't Miss the Good Stuff

More in News

Jabra Evolve3 75 On-Ear Wireless Bluetooth Headset drops to $313.17 on Amazon

Samsung Plans Android 17 Galaxy Book Laptops with One UI 9 Across Three Models

Sony Confirms One-Time License Check for Digital PlayStation Games After Days of Confusion

2 laptop deals worth checking today, starting at $1199

SanDisk Extreme Portable SSD drops to $149 for PlayStation 5

Dell 400-ASHI 10K SAS 2.5" /s HDD now $125, down from $432.87

iStorage datAshur PRO2 256 GB drops to $240 in latest price cut

Apple Watch 4 - Space Black Stainless hits new low at $371.44

Apple Watch 11 [GPS + Cellular 46mm] price drops to $639

Google Pixel Watch 1st Gen with LTE drops to $216 on Amazon

Samsung Galaxy Watch 4 Classic 42mm smartwatch drops to $234.99

Withings ScanWatch Nova Brilliant drops to $509.95 on Amazon

News

Features

Game

Samsung

Apple

AI