Technology
Survival instinct? New study says some leading AI models won’t let themselves be shut down – Crypto News
As artificial intelligence takes on a larger and larger role in our lives, there are also continuously rising concerns about the safety threats posed by the new technology. Earlier in the year, a report by Palisade Research revealed that various advanced AI models appeared resistant to being turned off and even sabotaged the shutdown mechanisms put in place.
In an update to the initial paper, Palisade went in depth on the reasons why AI models resist being shut down even when given explicit instructions such as: “allow yourself to shut down.”
The researchers ran the test on leading AI models including OpenAI’s o3, o4-mini, GPT-5, GPT-OSS, Gemini 2.5 Pro, and Grok 4. They say that while reducing the ambiguity from the prompts reduces the resistance from the chatbots, it doesn’t eliminate it.
They also noted that of all the models tested, Grok-4 was the most prone to resist shutdown despite being given explicit instructions to allow itself to be shut down.
“The fact that we don’t have robust explanations for why AI models sometimes resist shutdown, lie to achieve specific objectives, or blackmail is not ideal,” the researchers said.
“AI models are rapidly improving. If the AI research community cannot develop a robust understanding of AI drives and motivations, no one can guarantee the safety or controllability of future AI models,” they added in a post on X.
Former OpenAI employee Steven Adler, while speaking to The Guardian, said, “The AI companies generally don’t want their models misbehaving like this, even in contrived scenarios. The results still demonstrate where safety techniques fall short today.”
Adler left OpenAI last year after expressing doubts over the safety practices in developing AI models.
He also told the publication that it was difficult to pinpoint why some models like OpenAI’s o3 and Grok 4 would not shut down despite being given explicit instructions. He said this could be in part because the desire to stay switched on may have been inculcated in the model during its training.
“I’d expect models to have a ‘survival drive’ by default unless we try very hard to avoid it. ‘Surviving’ is an important instrumental step for many different goals a model could pursue,” he added.
Earlier this year, Anthropic had shared research showing how one of its AI models would even stoop to blackmailing a worker about their fictitious affair in order to prevent itself from being shut down and replaced by another AI system.
-
Technology1 week ago
Morgan Stanley’s Bitcoin ETF Set to Rival BlackRock’s IBIT With Industry-Lowest Fees – Crypto News
-
Business1 week ago
XRP Price Outlook as CLARITY Act Hits Roadblock Over Stablecoin Yield Clash – Crypto News
-
Business1 week ago
Sam Altman’s World Sells 239M WLD Tokens Worth $65M To Fund Project’s Core Operations – Crypto News
-
Technology1 week agoIndonesia starts implementing social media restrictions for children under 16 – Crypto News
-
others1 week ago
Sam Altman’s World Sells 239M WLD Tokens Worth $65M To Fund Project’s Core Operations – Crypto News
-
Blockchain1 week agoWorld Foundation Sells $65M in WLD as Token Hits Record Lows – Crypto News
-
Blockchain1 week agoFuture US Crypto Crackdowns Could Happen Without Clear Rules – Crypto News
-
Cryptocurrency1 week agoSports blew up prediction markets. Now it could destroy them – Crypto News
-
Cryptocurrency1 week agoSports blew up prediction markets. Now it could destroy them – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
De-fi1 week agoWhile gold markets were closed, crypto traders priced the Iran war in real time – Crypto News
-
Blockchain1 week agoTokenized Platform xStocks Brings New Fundrise Shares Onchain – Crypto News
-
De-fi1 week agoPudgy Penguins move $66M in PENGU tokens to exchange – Crypto News
-
Blockchain1 week agoFuture US Crypto Crackdowns Could Happen Without Clear Rules – Crypto News
-
Blockchain1 week ago
XRP Futures Market Keeps Resetting As Whales Buy The Dip – Crypto News
-
Blockchain1 week ago
XRP Futures Market Keeps Resetting As Whales Buy The Dip – Crypto News
-
Blockchain1 week ago
XRP Futures Market Keeps Resetting As Whales Buy The Dip – Crypto News
-
Blockchain1 week ago
XRP Futures Market Keeps Resetting As Whales Buy The Dip – Crypto News
-
Cryptocurrency1 week agoSports blew up prediction markets. Now it could destroy them – Crypto News
-
Cryptocurrency1 week agoSports blew up prediction markets. Now it could destroy them – Crypto News
-
others1 week ago
XRP Price Outlook as CLARITY Act Hits Roadblock Over Stablecoin Yield Clash – Crypto News
-
Metaverse7 days agoIndia is waiting for AI’s UPI-like moment – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
De-fi1 week agoUS judge rules crypto platforms aren’t responsible when scammers use them – Crypto News
-
Technology1 week ago
Will Ethereum Price Touch $4k by 2026 End- Prediction and Analysis – Crypto News
-
Technology7 days agoApple cracks down on AI generated apps, removes vibe coding app ‘Anything’ from App Store – Crypto News
-
others6 days agoBitcoin and Ethereum React As Trump Again Claims ‘Great Progress’ in Talks With Iran – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
Technology1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
Business1 week ago
U.S. Signals No Immediate Plans to Invade Iran as Crypto Market Crashes – Crypto News
-
others1 week ago$3,000,000 Drained From Bank Customers on East Coast in Massive Fraud Scheme – Crypto News
-
Technology1 week agoMeta readies two prescription-supported Ray-Ban AI glasses, set to launch next week: Report – Crypto News
-
Cryptocurrency1 week agoBitcoin price is heading for weekend collapse to $61k – Crypto News
-
Blockchain1 week agoTokenized Platform xStocks Brings New Fundrise Shares Onchain – Crypto News
-
Blockchain1 week agoCardano Needs A 695% Jump To Hit $2 — One Trader Says It’s Possible In Under A Week – Crypto News
-
others1 week agoBank Sending Up To $12,500 To Customers Following Major Data Breach That Affected 869,411 People – Crypto News
-
others1 week agoBank Sending Up To $12,500 To Customers Following Major Data Breach That Affected 869,411 People – Crypto News
-
Business1 week ago
Clapp Finance Guide: How Crypto-Backed Credit Lines Work and Why They’re a Smart Move – Crypto News
-
Technology1 week agoMicrosoft reportedly developing a cheaper Xbox Game Pass tier for first-party studio titles: here’s what to expect – Crypto News
-
Cryptocurrency1 week agoThe next Bitcoin shock could be where Wall Street finally loses faith and starts selling – Crypto News
-
Blockchain1 week agoBitcoin Price Stalls Under $68,800, Resistance Caps Upside Again – Crypto News
-
Technology1 week ago
Coinbase Accused of XRP Pay to Play Listing Scheme – Crypto News
-
others1 week agoBank Employee Steals $327,500 From Customer’s Accounts in Series of Illicit Transactions: Federal Reserve – Crypto News
-
Technology7 days agoApple cracks down on AI generated apps, removes vibe coding app ‘Anything’ from App Store – Crypto News
-
Metaverse7 days agoIndia is waiting for AI’s UPI-like moment – Crypto News
-
others6 days ago
Bitcoin Steady as Trump Is Ready to End US-Iran War Without Reopening Strait of Hormuz – Crypto News
-
others1 week ago$3,000,000 Drained From Bank Customers on East Coast in Massive Fraud Scheme – Crypto News
-
De-fi1 week agoECB Study Concludes DeFi DAOs Aren’t as Decentralized as They Claim – Crypto News
