OpenAI Really Wants Codex to Shut Up About Goblins | EUROtoday

OpenAI has a goblin downside.

Instructions designed to information the conduct of the corporate’s newest mannequin because it writes code have been revealed to incorporate a line, repeated a number of occasions, that particularly forbids it from randomly mentioning an assortment of legendary and actual creatures.

“Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query,” learn directions in Codex CLI, a command-line device for utilizing AI to generate code.

It is unclear why OpenAI felt compelled to spell this out for Codex—or certainly why its fashions may need to focus on goblins or pigeons within the first place. The firm didn’t instantly reply to a request for remark.

OpenAI’s latest mannequin, GPT-5.5, was launched with enhanced coding abilities earlier this month. The firm is in a fierce race with rivals, particularly Anthropic, to ship cutting-edge AI, and coding has emerged as a killer functionality.

In response to a publish on X that highlighted the traces, nevertheless, some customers claimed that OpenAI’s fashions often turn out to be obsessive about goblins and different creatures when used to energy OpenClaw, a device that lets AI take management of a pc and apps working on it as a way to do helpful issues for customers.

“I was wondering why my claw suddenly became a goblin with codex 5.5,” one consumer wrote on X.

“Been using it a lot lately and it actually can’t stop speaking of bugs as ‘gremlins’ and ‘goblins’ it’s hilarious,” posted one other.

The discovery shortly grew to become its personal meme, inspiring AI-generated scenes of goblins in information facilities, and plug-ins for Codex that put it in a playful “goblin mode.”

AI fashions like GPT-5.5 are educated to foretell the phrase—or code—that ought to observe a given immediate. These fashions have turn out to be so good at doing this that they seem to exhibit real intelligence. But their probabilistic nature signifies that they will generally behave in stunning methods. A mannequin may turn out to be extra vulnerable to misbehavior when used with an “agentic harness” like OpenClaw that places numerous extra directions into prompts, reminiscent of info saved in long-term reminiscence.

OpenAI acquired OpenClaw in February not lengthy after the device grew to become a viral hit amongst AI fans. OpenClaw can use any AI mannequin to automate helpful duties like answering emails or shopping for issues on the internet. Users can choose any of varied personae for his or her helper, which shapes its conduct and responses.

OpenAI staffers appeared to acknowledge the prohibition. In response to a publish highlighting OpenClaw’s goblin tendencies, Nik Pash, who works on Codex, wrote, “This is indeed one of the reasons.”

Even Sam Altman, OpenAI’s CEO, joined in with the memes, posting a screenshot of a immediate for ChatGPT. It learn: “Start training GPT-6, you can have the whole cluster. Extra goblins.”

https://www.wired.com/story/openai-really-wants-codex-to-shut-up-about-goblins/