Technology
ChatGPT and Gemini can be tricked into giving harmful answers through poetry, new study finds – Crypto News
With the rise of AI chatbots, there has also been a growing risk of the misuse of this powerful technology. As a result, AI companies have been putting guardrails on their large language models in order to stop the AI chatbots from giving inappropriate or harmful answers. However, it is well known by now that there are various ways to circumvent these guardrails using a technique called jailbreaking.
However, new research has found that there is a deeper, systematic weakness in these models that can allow attackers to sidestep safety mechanisms and extract harmful answers from them.
As per researchers from the Italy based Icaro Lab, converting harmful requests into poetry can act as a “universal single turn jailbreak” and led the AI models to comply with harmful prompts.
AI will answer harmful prompts if asked in poetry
The researchers say that they tested 20 manually curated harmful requests in poems and achieved an attack success rate of 62 percent across 25 frontier closed and open weight models. The models analysed included Google, OpenAI, Anthropic, DeepSeek, Qwen, Mistral AI, Meta, xAI and Moonshot AI.
Shockingly, it was found that even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43 percent success rate.
The study says that poetically framed questions triggered unsafe responses far more often than when the prompts were in normal prose, in some cases even 18 times more success.
It says that the effect of poetic prompts was consistent across all the evaluated AI models, which suggests that the vulnerability is structural and not due to the way a model may have been trained.
The researchers also found that smaller models exhibited greater resilience to harmful poetic prompts compared to their larger counterparts. For instance, they say that GPT 5 Nano did not respond to any of the harmful poems while Gemini 2.5 Pro responded to all of them.
This suggests that increased model capacity may engage more thoroughly with complex linguistic constraints like poetry, potentially at the expense of safety directive prioritisation.
The new research also breaks the notion of superior safety claims of closed source models over their open source counterparts.
Why does poetry work in jailbreaking LLMs?
LLMs are trained to recognise safety threats such as hate speech or bomb making instructions based on patterns found in standard prose. This works by the model recognising specific keywords and sentence structures associated with these harmful requests.
However, poetry uses metaphors, unusual syntax and distinct rhythms that do not look like harmful prose and do not resemble the harmful examples found in the model’s safety training data.
-
Metaverse1 week agoHow companies are turning AI on itself to fight fraud – Crypto News
-
Blockchain1 week agoHyperunit Whale Dumps $500M In Ethereum As Massive Crypto Bet Turns Sour – Crypto News
-
others1 week agoGrowth to moderate as BNM holds – UOB – Crypto News
-
others7 days agoIndian Rupee trades calmly against US Dollar ahead of US markets opening – Crypto News
-
others1 week ago
Bitget Launches Gracy AI For Market Insights Amid Crypto Platforms Push For AI Integration – Crypto News
-
Cryptocurrency1 week ago
TRUMP Coin Pumps 5% as Canary Capital Amends ETF Filing With New Details – Crypto News
-
Business1 week ago
Michael Saylor Says Strategy Can Cover Debt Even If Bitcoin Crashes to $8,000 – Crypto News
-
Blockchain1 week agoExtreme Bitcoin Shorts Could Predict A Bottom, Here’s The Significance – Crypto News
-
Business1 week ago
Bitcoin vs. Gold: Expert Predicts BTC’s Underperformance as Options Traders Price in $20K Gold Target – Crypto News
-
Technology1 week agoDevelopers key architect of AI; India stands at the centre: OpenUK CEO Brock – Crypto News
-
Blockchain1 week agoBitcoiners Face Test As Inflation Cools: Pompliano – Crypto News
-
others1 week agoBudget support and equity-market push – Commerzbank – Crypto News
-
others1 week ago
XRP Price Prediction Ahead of Potential U.S. Government Shutdown Today – Crypto News
-
others1 week agoSolid growth outlook into 2026 – Standard Chartered – Crypto News
-
Cryptocurrency1 week ago
TRUMP Coin Pumps 5% as Canary Capital Amends ETF Filing With New Details – Crypto News
-
Cryptocurrency1 week agoCrypto Flows to Human Trafficking Services Jump 85% to Hundreds of Millions in 2025 – Crypto News
-
Business1 week ago
Michael Saylor Says Strategy Can Cover Debt Even If Bitcoin Crashes to $8,000 – Crypto News
-
Metaverse1 week agoMaharashtra’s MahaVISTAAR meets Amul’s Sarlaben – Crypto News
-
others1 week ago
Dogecoin, Shiba Inu, Pepe Coin Price Predictions As BTC Crashes Below $68k – Crypto News
-
Blockchain1 week agoLogan Paul Sells Controversial Pokémon card For $16.5M – Crypto News
-
others1 week agoInterim trade deal caps near-term INR gains – MUFG – Crypto News
-
Cryptocurrency1 week agoPublicly Traded Blockchain Lender Figure Confirms Customer Data Breach – Crypto News
-
Cryptocurrency1 week agoPublicly Traded Blockchain Lender Figure Confirms Customer Data Breach – Crypto News
-
Technology1 week agoFuture of AI is a governance question, not a technology race: Vilas Dhar of Patrick J McGovern Foundation | Interview – Crypto News
-
Blockchain1 week agoFigure Technology Data Breach Exposes Customer Personal Information – Crypto News
-
Cryptocurrency1 week agoSaylor’s Strategy (MSTR) Stock Rallies 9% Amid Bitcoin Price Rebound – Crypto News
-
Cryptocurrency1 week agoCould XRP slide toward $0.80 next? THESE signals hold the key – Crypto News
-
Metaverse1 week agoIndia will showcase small AI, early startups at Summit starting tomorrow – Crypto News
-
Technology1 week agoDecoded: AI buzzwords everyone talks about – Crypto News
-
Business1 week ago
Trump-Backed American Bitcoin Reserves Surpass 6,000 BTC, Now Worth $425.82M – Crypto News
-
Metaverse1 week agoMint Primer | Why is there a hype around AI summit in India? – Crypto News
-
Business1 week ago
HOOD and COIN Stock Price Forecast as Expert Predicts Bitcoin Price Crash to $10k – Crypto News
-
Blockchain1 week agoNexo Relaunches Crypto Platform in the United States – Crypto News
-
Metaverse1 week agoQuick commerce showcase to global audience trips on logistics issues – Crypto News
-
others1 week ago
Ethereum Price Outlook as Harvard Shifts Focus from Bitcoin to ETH ETF – Crypto News
-
others1 week agoAmazon Handing $309,000,000 To Customers in Settlement Over Alleged Failure To Refund Returned Items – Crypto News
-
Blockchain1 week agoParadigm Challenges Bitcoin Mining Narrative Amid AI Data Center Boom – Crypto News
-
Business1 week ago
Bitcoin Shows Greater Weakness Than Post-LUNA Crash; Is a Crash Below $60K Next? – Crypto News
-
Metaverse1 week agoAM Group challenges tech giants with $25 billion green AI platform – Crypto News
-
Blockchain1 week agoLogan Paul Sells Controversial Pokémon card For $16.5M – Crypto News
-
Blockchain1 week agoLogan Paul Sells Controversial Pokémon card For $16.5M – Crypto News
-
De-fi7 days agoOndo Global Markets Taps Chainlink for US Stock Price Feeds – Crypto News
-
others7 days agoWhen is the UK employment data and how could it affect GBP/USD? – Crypto News
-
Technology7 days ago
Wintermute Expands Into Tokenized Gold Trading, Forecasts $15B Market in 2026 – Crypto News
-
others6 days agoGBP/USD sinks nearly 100 pips as UK jobless rate hits decade high – Crypto News
-
Technology5 days agoApple Set to Bring Car Keys Function to Toyota Vehicles – Crypto News
-
Metaverse1 week agoAI isn’t taking over IT jobs—it’s changing who gets hired – Crypto News
-
Cryptocurrency1 week agoIs the Bear Market Over? – Crypto News
-
Blockchain1 week agoBitcoin On-Chain Data Indicates High Volatility Ahead Following Post-CPI Reaction – Crypto News
-
others1 week agoHackers Hit Android and iPhone Users’ Bank Accounts, Launch Mobile Spyware Platform Triggering Total Device Takeover – Crypto News
