
Metaverse
Anthropic CEO: AI could be more factually reliable than people in structured tasks – Crypto News
Artificial intelligence may now surpass humans in factual accuracy—at least in certain structured scenarios—according to Anthropic CEO Dario Amodei. Speaking at two major tech events this month, VivaTech 2025 in Paris and the inauguralCode With Claude developer day, Amodei asserted that modern AI models, including the newly launched Claude 4 series, may hallucinate less often than people when answering well-defined factual questions, reported Business Today.
Hallucination, in the context of AI, refers to the tendency of models to confidently produce inaccurate or fabricated information, the report added. This longstanding flaw has raised concerns in fields such as journalism, medicine, and law. However, Amodei’s remarks suggest that the tables may be turning—at least in controlled conditions.
“If you define hallucination as confidently stating something incorrect, humans actually do that quite frequently,” Amodei said during his keynote at VivaTech. He cited internal testing which showed Claude 3.5 outperforming human participants on structured factual quizzes. The results, he claimed, demonstrate a notable shift in reliability when it comes to straightforward question-answer tasks.
Reportedly, at the developer-focusedCode With Claude event, where Anthropic introduced the Claude Opus 4 and Claude Sonnet 4 models, Amodei reiterated his stance. “It really depends on how you measure it,” he noted. “But I suspect that AI models probably hallucinate less than humans, though when they do, the mistakes are often more surprising.”
The newly unveiled Claude 4 models reflect Anthropic’s latest advances in the pursuit of artificial general intelligence (AGI), boasting improved capabilities in long-term memory, coding, writing, and tool integration. Of particular note, Claude Sonnet 4 achieved a 72.7 per cent score on the SWE-Bench software engineering benchmark, surpassing previous models and setting a new industry standard.
However, Amodei was quick to acknowledge that hallucinations have not been eradicated. In unstructured or open-ended conversations, even state-of-the-art models remain vulnerable to error. The CEO stressed that context, prompt design, and domain-specific application heavily influence a model’s accuracy, particularly in high-stakes settings like legal filings or healthcare.
His remarks follow a recent legal incident involving Anthropic’s chatbot, where the AI cited a non-existent case during a lawsuit filed by music publishers. The error led to an apology from the company’s legal team, reinforcing the ongoing challenge of ensuring factual consistency in real-world use.
Amodei also reportedly highlighted the lack of clear, industry-wide metrics for hallucination. “You can’t fix what you don’t measure precisely,” he cautioned, calling for standardised definitions and evaluation frameworks to track and mitigate AI errors.
-
Technology5 days ago
Meet Matt Deitke: 24-year-old AI whiz lured by Mark Zuckerberg with whopping $250 million offer – Crypto News
-
Cryptocurrency7 days ago
XRP inflows drop 95% since July spike, while Chaikin data signals possible rally – Crypto News
-
Blockchain6 days ago
Bank of America Sees Interest in Tokenization of Real-World Assets – Crypto News
-
others7 days ago
Breaking: Strategy Files $4.2 Billion STRC Offering To Buy More Bitcoin – Crypto News
-
others6 days ago
XRP NIGHT Token Airdrop: Snapshot, Claim Date and What to Expect? – Crypto News
-
Technology1 week ago
Is AI causing tech worker layoffs? Thats what CEOs suggest, but the reality is complicated – Crypto News
-
Blockchain5 days ago
Altcoin Rally To Commence When These 2 Signals Activate – Details – Crypto News
-
Cryptocurrency4 days ago
Cardano’s NIGHT Airdrop to Hit 2.2M XRP Wallets — Find Out How Much You Can Get – Crypto News
-
Business1 week ago
Chase Launches $4 Million Grant Program as Restaurants Struggle – Crypto News
-
others1 week ago
Ripple Swell 2025: Top Speakers and Panelists to Watch this November – Crypto News
-
Blockchain7 days ago
SEC Crypto ETFs Ruling Brings Structural Fix, Not Retail Shakeup – Crypto News
-
Business7 days ago
Breaking: Solana ETFs Near Launch as Issuers Update S-1s With Fund Fees – Crypto News
-
Technology6 days ago
Oppo K13 Turbo series confirmed to launch in India with in-built fan technology: Price, specs and everything expected – Crypto News
-
others1 week ago
Why Does Jim Cramer Think the Market’s Slow Pace is Actually Good Sign? – Crypto News
-
others1 week ago
Blockchain Gaming Is Growing Up – What’s Behind the Sector’s Quiet Comeback – Crypto News
-
Business1 week ago
Stablecoins Won’t Boost Treasury Demand, Peter Schiff Warns – Crypto News
-
Business6 days ago
Bitpanda Co-Founder & Co-CEO Paul Klanschek Steps Down as Firm Eyes Frankfurt IPO – Crypto News
-
others1 week ago
EUR/USD dives as the US Dollar outperforms with all eyes on the Fed decision – Crypto News
-
Metaverse1 week ago
OpenAI rolls out ‘Study Mode’ in ChatGPT: What is it? How to use? All your questions answered… – Crypto News
-
Technology1 week ago
Breaking: BlackRock’s Ethereum ETF Staking Proposal Advances As SEC Acknowledges Filing – Crypto News
-
Technology1 week ago
Ethereum Price Prediction- Bulls Target $5,400 Amid DeFi Revival and Soaring TVL – Crypto News
-
Technology1 week ago
Coinbase exchange targets alleged cybersquatter in lawsuit – Crypto News
-
De-fi7 days ago
White House Crypto Report Recommends Expanding CFTC’s Role in Crypto Regulation – Crypto News
-
Technology7 days ago
Coinbase to Offer Tokenized Stocks and Prediction Markets in U.S. – Crypto News
-
others7 days ago
Canadian Dollar under pressure amid weak GDP, Trump tariff threat, and strong US data – Crypto News
-
Technology6 days ago
Big Tech’s Big Bet on AI Driving $344 Billion in Spend This Year – Crypto News
-
Cryptocurrency6 days ago
CME XRP Futures Hit Record Highs in July Amid ETF Approval Optimism – Crypto News
-
Cryptocurrency5 days ago
Stablecoins Are Finally Legal—Now Comes the Hard Part – Crypto News
-
Cryptocurrency5 days ago
Tron Eyes 40% Surge as Whales Pile In – Crypto News
-
Cryptocurrency5 days ago
Ethereum Hits Major 2025 Year Peak Despite Price Dropping to $3,500 – Crypto News
-
Technology4 days ago
Beyond Billboards: Why Crypto’s Future Depends on Smarter Sports Sponsorships – Crypto News
-
others1 week ago
Breaking: PayPal to Let Merchants Accept Payments in Over 100 Cryptocurrencies – Crypto News
-
Blockchain1 week ago
SEC Gives Green Light to In-Kind Transactions for Crypto ETPs – Crypto News
-
Technology1 week ago
Spotify hits 276M subscribers and strong user growth in Q2, but revenue and profit fall short of targets – Crypto News
-
Cryptocurrency1 week ago
Altcoins update: Dogecoin and Injective signal recoveries as Ethereum eyes $4,000 – Crypto News
-
Technology7 days ago
Solana DEX volume dips 20% after co-founder slams meme coins – Crypto News
-
Technology7 days ago
Tim Cook confirms Apple will ramp up AI spending, ‘open’ to acquisitions – Crypto News
-
Technology6 days ago
Oppo K13 Turbo series confirmed to launch in India with in-built fan technology: Price, specs and everything expected – Crypto News
-
Blockchain6 days ago
Strategy Expands STRC Offering Twice in One Week – Crypto News
-
Technology5 days ago
Will The First Spot XRP ETF Launch This Month? SEC Provides Update On Grayscale’s Fund – Crypto News
-
Technology5 days ago
Amazon Great Freedom Sale deals on smartwatches: Up to 70% off on Samsung, Apple and more – Crypto News
-
Blockchain4 days ago
XRP Must Hold $2.65 Support Or Risk Major Breakdown – Analyst – Crypto News
-
Blockchain4 days ago
XRP Must Hold $2.65 Support Or Risk Major Breakdown – Analyst – Crypto News
-
Business4 days ago
Is Quantum Computing A Threat for Bitcoin- Elon Musk Asks Grok – Crypto News
-
Cryptocurrency1 week ago
Coinbase and JPMorgan Chase partner for crypto integration – Crypto News
-
Business1 week ago
Breaking: CBOE Files For Rule Change To List Crypto ETFs Without SEC Approval – Crypto News
-
others1 week ago
Gold slides below $3,300 as traders await Fed policy decision – Crypto News
-
others1 week ago
Gold slides below $3,300 as traders await Fed policy decision – Crypto News
-
Technology7 days ago
Nintendo Direct Partner showcase highlights third-party titles coming to Switch and Switch 2 – Crypto News
-
others7 days ago
Can the record-breaking rally last? – Crypto News