OpenAI Really Wants Codex to Shut Up About Goblins | EUROtoday
OpenAI has a goblin downside.
Instructions designed to information the habits of the corporate’s newest mannequin because it writes code have been revealed to incorporate a line, repeated a number of instances, that particularly forbids it from randomly mentioning an assortment of legendary and actual creatures.
“Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query,” learn directions in Codex CLI, a command-line device for utilizing AI to generate code.
It is unclear why OpenAI felt compelled to spell this out for Codex—or certainly why its fashions may wish to talk about goblins or pigeons within the first place. The firm didn’t instantly reply to a request for remark.
OpenAI’s latest mannequin, GPT-5.5, was launched with enhanced coding expertise earlier this month. The firm is in a fierce race with rivals, particularly Anthropic, to ship cutting-edge AI, and coding has emerged as a killer functionality.
In response to a publish on X that highlighted the strains, nevertheless, some customers claimed that OpenAI’s fashions sometimes change into obsessive about goblins and different creatures when used to energy OpenClaw, a device that lets AI take management of a pc and apps working on it so as to do helpful issues for customers.
“I was wondering why my claw suddenly became a goblin with codex 5.5,” one person wrote on X.
“Been using it a lot lately and it actually can’t stop speaking of bugs as ‘gremlins’ and ‘goblins’ it’s hilarious,” posted one other.
The discovery shortly grew to become its personal meme, inspiring AI-generated scenes of goblins in knowledge facilities, and plug-ins for Codex that put it in a playful “goblin mode.”
AI fashions like GPT-5.5 are skilled to foretell the phrase—or code—that ought to comply with a given immediate. These fashions have change into so good at doing this that they seem to exhibit real intelligence. But their probabilistic nature implies that they will typically behave in shocking methods. A mannequin may change into extra susceptible to misbehavior when used with an “agentic harness” like OpenClaw that places numerous further directions into prompts, equivalent to details saved in long-term reminiscence.
OpenAI acquired OpenClaw in February not lengthy after the device grew to become a viral hit amongst AI fanatics. OpenClaw can use any AI mannequin to automate helpful duties like answering emails or shopping for issues on the net. Users can choose any of assorted personae for his or her helper, which shapes its habits and responses.
OpenAI staffers appeared to acknowledge the prohibition. In response to a publish highlighting OpenClaw’s goblin tendencies, Nik Pash, who works on Codex, wrote, “This is indeed one of the reasons.”
Even Sam Altman, OpenAI’s CEO, joined in with the memes, posting a screenshot of a immediate for ChatGPT. It learn: “Start training GPT-6, you can have the whole cluster. Extra goblins.”
https://www.wired.com/story/openai-really-wants-codex-to-shut-up-about-goblins/