Anthropic publishes the 'system prompts' that make Claude tick

Kyle Wiggers

Updated 27 August 2024 at 12:26 pm·3-min read

Generative AI models aren't actually humanlike. They have no intelligence or personality -- they're simply statistical systems predicting the likeliest next words in a sentence. But like interns at a tyrannical workplace, they do follow instructions without complaint -- including initial "system prompts" that prime the models with their basic qualities and what they should and shouldn't do.

Every generative AI vendor, from OpenAI to Anthropic, uses system prompts to prevent (or at least try to prevent) models from behaving badly, and to steer the general tone and sentiment of the models' replies. For instance, a prompt might tell a model it should be polite but never apologetic, or to be honest about the fact that it can't know everything.

But vendors usually keep system prompts close to the chest -- presumably for competitive reasons, but also perhaps because knowing the system prompt may suggest ways to circumvent it. The only way to expose GPT-4o's system prompt, for example, is through a prompt injection attack. And even then, the system's output can't be trusted completely.

However, Anthropic, in its continued effort to paint itself as a more ethical, transparent AI vendor, has published the system prompts for its latest models (Claude 3 Opus, Claude 3.5 Sonnet and Claude 3 Haiku) in the Claude iOS and Android apps and on the web.

Alex Albert, head of Anthropic's developer relations, said in a post on X that Anthropic plans to make this sort of disclosure a regular thing as it updates and fine-tunes its system prompts.

https://twitter.com/alexalbert__/status/1828107230656471442

The latest prompts, dated July 12, outline very clearly what the Claude models can't do -- e.g. "Claude cannot open URLs, links, or videos." Facial recognition is a big no-no; the system prompt for Claude Opus tells the model to "always respond as if it is completely face blind" and to "avoid identifying or naming any humans in [images]."

But the prompts also describe certain personality traits and characteristics -- traits and characteristics that Anthropic would have the Claude models exemplify.

The prompt for Claude 3 Opus, for instance, says that Claude is to appear as if it "[is] very smart and intellectually curious," and "enjoys hearing what humans think on an issue and engaging in discussion on a wide variety of topics." It also instructs Claude to treat controversial topics with impartiality and objectivity, providing "careful thoughts" and "clear information" -- and never to begin responses with the words "certainly" or "absolutely."

It's all a bit strange to this human, these system prompts, which are written like an actor in a stage play might write a character analysis sheet. The prompt for Opus ends with "Claude is now being connected with a human," which gives the impression that Claude is some sort of consciousness on the other end of the screen whose only purpose is to fulfill the whims of its human conversation partners.

But of course that's an illusion. If the prompts for Claude tell us anything, it's that without human guidance and hand-holding, these models are frighteningly blank slates.

With these new system prompt changelogs -- the first of their kind from a major AI vendor -- Anthropic is exerting pressure on competitors to publish the same. We'll have to see if the gambit works.

BuzzFeed
"It's A HUGE Insult": People Are Sharing The Things That Are Considered "Taboo" In Their Country But Are Completely Normal Elsewhere
"I'm Canadian and have been living in the US for a decade now. I will never get used to this."
The Independent
Monica Lewinsky reveals who she is voting for in the presidential election
The former White House intern urged her followers to get out and vote as America goes to the polls
CNN
Trump says he ‘shouldn’t have left’ the White House as he closes campaign with increasingly dark message
Donald Trump, who said in Pennsylvania on Sunday that he regrets leaving the White House in 2021, is ending the 2024 campaign the way he began it – dishing out a stew of violent, disparaging rhetoric and repeated warnings that he will not accept defeat if it comes.
BuzzFeed
5 Celebrities Who Endorsed Trump This Week, And 1 Celeb Who Renounced Their Trump Endorsement
This is certainly an eclectic group of folks.
InStyle
Beyoncé Went Topless for Her Second, Surprise Halloween Costume
Two is always better than one.
The Independent
Harrison Ford makes presidential endorsement days before 2024 election
The Indiana Jones actor took the opportunity to speak out despite ‘never really wanting to talk’ about politics
Yahoo Lifestyle
MKR judges 'threaten to quit' as new stars tipped to join 2025 season
EXCLUSIVE: Manu Feildel and Colin Fassnidge aren’t too happy with Channel Seven's plan for next year's season. Read more.
NewsWire
One thing these Aussie workers dread
We rely on them every day, but this cohort of iconic Aussie workers are suffering appalling attacks as they go about their day.
The Daily Beast
Rattled Trump Rages After Shock Iowa Poll Favors Harris: ‘Trump Hater’
Last minute polling out of Iowa appears to have rattled Donald Trump, who was initially projected to win the deep-red Hawkeye State. The GOP presidential nominee slammed unfavorable numbers for his campaign released Saturday and accused the pollster Ann Selzer, who is regarded as being highly accurate with last-minute polling in Iowa, of being a “Trump hater.” “No President has done more for FARMERS, and the Great State of Iowa, than Donald J. Trump. In fact, it’s not even close! All polls, exce
The Daily Beast
Frantic Aides Narrowly Stopped Trump Calling His Rival Appalling Slur
Donald Trump has long had a penchant for nicknaming his political adversaries, coining the phrases Sleepy Joe, Crooked Hillary, and Ron DeSanctimonious. But one Trump moniker for President Joe Biden allegedly went beyond the former president’s typical antagonism. The Republican presidential nominee, who has repeatedly referred to Biden as Sleepy Joe, Slow Joe, and Crooked Joe, wanted to add “Retarded Joe Biden” to his nickname arsenal, a new report by The Atlantic claims. “The guy’s a retard. He
Rolling Stone
Trump Boasts ‘Every Rally Is Full’ as Camera Immediately Pans to Empty Seats, People Leaving
The former president falsely claimed his rallies "do not have any seats that are empty"
The Independent
Trump appears to emulate ‘sex act’ on microphone after he melts down over technical difficulties
Viewers were stunned at the former president’s apparent gesture during his Milwaukee rally in Wisconsin
GOBankingRates
I’m a Mechanic: 9 Cars I Would Never Buy and Why They Aren’t Worth It
Consumers often consider the sticker price, features, and design when deciding which car to buy. Find Out: The 20 Cars Seeing the Biggest Price Drops in 2024 Discover More: 9 Things You Must Do...
HuffPost
'I'm Done!': Conservative Columnist Quits Washington Post After Livestream Meltdown
Hugh Hewitt removed his earpiece and stormed off a Post live show amid a discussion about Donald Trump's rhetoric.
HuffPost
Kamala Harris Jokes About The 1 Thing Trump Can't Do In Surprise 'SNL' Cameo
Harris appeared alongside Maya Rudolph, who plays the vice president on "Saturday Night Live," in the show's last episode before Election Day.
The Daily Beast
‘Come on, Senator’: Dana Bash Loses Cool Over Tim Scott’s Election Claims
Dana Bash ran out of patience while pressing Republican Sen. Tim Scott about Donald Trump and his pals' recent hints that this election may already be subject to voter fraud. The CNN host demanded to know whether Trump would honor the results of the election in the event of defeat, but the South Carolina senator neatly sidestepped the repeated questioning. “One of Donald Trump‘s allies, Steve Bannon, who was released from prison this week, told the New York Times that Trump should simply declare
BuzzFeed
Martha Stewart Said Ryan Reynolds Isn't Funny In Real Life, And Ryan Had A Pretty Good Response
"He can act funny, but he isn't funny."
BBC
Kim Jong Un is China's ally - but has become the 'comrade from hell'
Beijing is caught between its sanctioned neighbours, whose new alliance is causing concern.
BuzzFeed
33 Experiences Of Womanhood That May Just Blow Mens' Minds Wide Open
"A friend of mine once said: Men have to assess IF there is danger. Women have to assess HOW MUCH danger there is."
The Daily Beast
Trump Raged at Daily Beast Revelation That Campaign Boss Got $22 Million
Donald Trump considered firing his campaign manager Chris LaCivita after a bombshell report by the Daily Beast enraged the former president in the final stretch of his 2024 White House bid. Sources told The Atlantic allegations that LaCivita had pocketed $22 million from his work on the Trump campaign and related super PACs, left Trump “fuming” and feeling like the story “made him look like a fool.” The Beast’s story, published on Oct. 15, reportedly fueled the GOP presidential nominee‘s paranoi

Latest stories