
Metaverse
Anthropic CEO: AI could be more factually reliable than people in structured tasks – Crypto News
Artificial intelligence may now surpass humans in factual accuracy—at least in certain structured scenarios—according to Anthropic CEO Dario Amodei. Speaking at two major tech events this month, VivaTech 2025 in Paris and the inauguralCode With Claude developer day, Amodei asserted that modern AI models, including the newly launched Claude 4 series, may hallucinate less often than people when answering well-defined factual questions, reported Business Today.
Hallucination, in the context of AI, refers to the tendency of models to confidently produce inaccurate or fabricated information, the report added. This longstanding flaw has raised concerns in fields such as journalism, medicine, and law. However, Amodei’s remarks suggest that the tables may be turning—at least in controlled conditions.
“If you define hallucination as confidently stating something incorrect, humans actually do that quite frequently,” Amodei said during his keynote at VivaTech. He cited internal testing which showed Claude 3.5 outperforming human participants on structured factual quizzes. The results, he claimed, demonstrate a notable shift in reliability when it comes to straightforward question-answer tasks.
Reportedly, at the developer-focusedCode With Claude event, where Anthropic introduced the Claude Opus 4 and Claude Sonnet 4 models, Amodei reiterated his stance. “It really depends on how you measure it,” he noted. “But I suspect that AI models probably hallucinate less than humans, though when they do, the mistakes are often more surprising.”
The newly unveiled Claude 4 models reflect Anthropic’s latest advances in the pursuit of artificial general intelligence (AGI), boasting improved capabilities in long-term memory, coding, writing, and tool integration. Of particular note, Claude Sonnet 4 achieved a 72.7 per cent score on the SWE-Bench software engineering benchmark, surpassing previous models and setting a new industry standard.
However, Amodei was quick to acknowledge that hallucinations have not been eradicated. In unstructured or open-ended conversations, even state-of-the-art models remain vulnerable to error. The CEO stressed that context, prompt design, and domain-specific application heavily influence a model’s accuracy, particularly in high-stakes settings like legal filings or healthcare.
His remarks follow a recent legal incident involving Anthropic’s chatbot, where the AI cited a non-existent case during a lawsuit filed by music publishers. The error led to an apology from the company’s legal team, reinforcing the ongoing challenge of ensuring factual consistency in real-world use.
Amodei also reportedly highlighted the lack of clear, industry-wide metrics for hallucination. “You can’t fix what you don’t measure precisely,” he cautioned, calling for standardised definitions and evaluation frameworks to track and mitigate AI errors.
-
others1 week ago
Will Ethereum Price Rally to $3,200 as Wall Street Pivots from BTC to ETH – Crypto News
-
others5 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
others5 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
Cryptocurrency1 week ago
TON Foundation Confirms UAE Golden Visa Offer Is Not Official – Crypto News
-
others1 week ago
Company Owned by Billionaire Gold Miner May Be Seized by Russian Government for Allegedly Breaching Regulations: Report – Crypto News
-
Blockchain7 days ago
Insomnia Labs Debuts Stablecoin Credit Platform for Creators – Crypto News
-
Technology1 week ago
We’re Losing the Plot on AI in Universities – Crypto News
-
others1 week ago
Appropriate to have cautious gradual stance on easing – Crypto News
-
others6 days ago
EUR/GBP posts modest gain above 0.8600 ahead of German inflation data – Crypto News
-
Blockchain6 days ago
Ant Group Eyes USDC Integration Circle’s: Report – Crypto News
-
Cryptocurrency5 days ago
Bitcoin Breaks New Record at $111K, What’s Fueling the $120K Price Target? – Crypto News
-
Technology5 days ago
XRP Eyes $3 Breakout Amid Rising BlackRock ETF Speculation – Crypto News
-
Metaverse1 week ago
Are firms wasting their money on AI agents? – Crypto News
-
Metaverse1 week ago
Are firms wasting their money on AI agents? – Crypto News
-
Cryptocurrency1 week ago
Institutions Pile Up BTC But Price Doesn’t go up, Why? – Crypto News
-
others1 week ago
Bank Insider Admits to Nearly Decade-Long Scheme of Falsifying Loan Applications To Steal Funds: DOJ – Crypto News
-
Cryptocurrency1 week ago
This Week in Crypto Games: Planetside Dev’s ‘Reaper Actual’, What’s Next for ‘MapleStory Universe’ – Crypto News
-
Business1 week ago
Toncoin Price Drops 10% As UAE Authorities Call TON Golden Visa Offer Unofficial – Crypto News
-
Blockchain1 week ago
XRP Set To Shock The Crypto Market With 30% Share: Analyst – Crypto News
-
Cryptocurrency1 week ago
Coinbase hacker returns with $12.5 mln ETH buy: Will security concerns affect Ethereum? – Crypto News
-
others1 week ago
Is a Pi Network Crash Ahead As 272M Coins Unlock in July – Crypto News
-
Business1 week ago
Solana ETF Launch Delayed Amid Wait for SEC’s Crypto ETF Framework – Crypto News
-
Cryptocurrency1 week ago
On thinking ahead when markets get murky – Crypto News
-
Technology1 week ago
Solana Meme Coin PNUT Rallies 10% Amid Elon Musk’s Statement – Crypto News
-
Cryptocurrency7 days ago
Is ETH Finally Ready to Shoot For $3K? (Ethereum Price Analysis) – Crypto News
-
Cryptocurrency7 days ago
Tornado Cash Judge Won’t Let One Case Be Mentioned in Roman Storm’s Trial: Here’s Why – Crypto News
-
Blockchain7 days ago
XRP Rally Possible If Senate Web3 Crypto Summit Goes Well – Crypto News
-
others7 days ago
USD/CAD trades with positive bias below 1.3700; looks to FOMC minutes for fresh impetus – Crypto News
-
Blockchain6 days ago
Ethereum Bulls Roar — $3K Beckons After 5% Spike – Crypto News
-
Blockchain6 days ago
Kraken and Backed Expand Tokenized Equities to BNB Chain – Crypto News
-
others6 days ago
NovaEx Launches with a Security-First Crypto Trading Platform Offering Deep Liquidity and Institutional-Grade Infrastructure – Crypto News
-
Business6 days ago
Did Ripple Really Win XRP Lawsuit Despite $125M Fine? Lawyer Fires Back at CEO – Crypto News
-
Cryptocurrency6 days ago
XRP price forecast as coins surges 2.19% to $2.33 – Crypto News
-
others5 days ago
Anthony Scaramucci Says $180,000 Bitcoin Price Explosion Possible As BTC ‘Supremacy’ Creeps Up – Here’s His Timeline – Crypto News
-
Blockchain5 days ago
SUI Chart Pattern Confirmation Sets $3.89 Price Target – Crypto News
-
others5 days ago
EUR/GBP climbs as weak UK data fuels BoE rate cut speculation – Crypto News
-
Business4 days ago
PENGU Rallies Over 20% Amid Coinbase’s Pudgy Penguins PFP Frenzy – Crypto News
-
others1 week ago
NZD/USD risks further downside as Kiwi tests critical support at 0.6050 – Crypto News
-
Cryptocurrency1 week ago
This Week in Crypto Games: Planetside Dev’s ‘Reaper Actual’, What’s Next for ‘MapleStory Universe’ – Crypto News
-
Blockchain1 week ago
Cardano (ADA) Turns Upward — Signs of a Recovery Emerge – Crypto News
-
Cryptocurrency1 week ago
Macroeconomics, Market Shifts, and Trading Speed Take Center Stage at B2MEET by B2PRIME – Crypto News
-
Blockchain1 week ago
UAE Golden Visa Is ‘Being Developed Independently‘ — TON Foundation – Crypto News
-
others1 week ago
Nasdaq-Listed Bit Digital Converts Entire Bitcoin Holdings To Ethereum Treasury – Crypto News
-
others1 week ago
Ethereum Continues Outperforming Institutional Capital Flows As Investors Pour $1,040,000,000 Into Crypto Products: CoinShares – Crypto News
-
Cryptocurrency1 week ago
Elon Musk announces his ‘America Party’ will embrace Bitcoin, criticizes Trump’s fiscal bill – Crypto News
-
others1 week ago
USD/CHF gains ground below 0.8000 ahead of US tariff deadline – Crypto News
-
Technology1 week ago
Huaweis AI lab denies that one of its Pangu models copied Alibabas Qwen – Crypto News
-
Blockchain1 week ago
EU Questions Robinhood About OpenAI and SpaceX Stock Tokens – Crypto News
-
Cryptocurrency1 week ago
XRP could rally higher on steady capital inflow; check forecast – Crypto News
-
Blockchain1 week ago
Vitalik Buterin Backs Copyleft Licensing for Fairer Crypto – Crypto News