
Metaverse
Anthropic CEO: AI could be more factually reliable than people in structured tasks – Crypto News
Artificial intelligence may now surpass humans in factual accuracy—at least in certain structured scenarios—according to Anthropic CEO Dario Amodei. Speaking at two major tech events this month, VivaTech 2025 in Paris and the inauguralCode With Claude developer day, Amodei asserted that modern AI models, including the newly launched Claude 4 series, may hallucinate less often than people when answering well-defined factual questions, reported Business Today.
Hallucination, in the context of AI, refers to the tendency of models to confidently produce inaccurate or fabricated information, the report added. This longstanding flaw has raised concerns in fields such as journalism, medicine, and law. However, Amodei’s remarks suggest that the tables may be turning—at least in controlled conditions.
“If you define hallucination as confidently stating something incorrect, humans actually do that quite frequently,” Amodei said during his keynote at VivaTech. He cited internal testing which showed Claude 3.5 outperforming human participants on structured factual quizzes. The results, he claimed, demonstrate a notable shift in reliability when it comes to straightforward question-answer tasks.
Reportedly, at the developer-focusedCode With Claude event, where Anthropic introduced the Claude Opus 4 and Claude Sonnet 4 models, Amodei reiterated his stance. “It really depends on how you measure it,” he noted. “But I suspect that AI models probably hallucinate less than humans, though when they do, the mistakes are often more surprising.”
The newly unveiled Claude 4 models reflect Anthropic’s latest advances in the pursuit of artificial general intelligence (AGI), boasting improved capabilities in long-term memory, coding, writing, and tool integration. Of particular note, Claude Sonnet 4 achieved a 72.7 per cent score on the SWE-Bench software engineering benchmark, surpassing previous models and setting a new industry standard.
However, Amodei was quick to acknowledge that hallucinations have not been eradicated. In unstructured or open-ended conversations, even state-of-the-art models remain vulnerable to error. The CEO stressed that context, prompt design, and domain-specific application heavily influence a model’s accuracy, particularly in high-stakes settings like legal filings or healthcare.
His remarks follow a recent legal incident involving Anthropic’s chatbot, where the AI cited a non-existent case during a lawsuit filed by music publishers. The error led to an apology from the company’s legal team, reinforcing the ongoing challenge of ensuring factual consistency in real-world use.
Amodei also reportedly highlighted the lack of clear, industry-wide metrics for hallucination. “You can’t fix what you don’t measure precisely,” he cautioned, calling for standardised definitions and evaluation frameworks to track and mitigate AI errors.
-
Blockchain1 week ago
Ethereum Price Performance Could Hinge On This Binance Metric — Here’s Why – Crypto News
-
Cryptocurrency1 week ago
French Exoskeleton Company Wandercraft Pivots to Humanoid Robots – Crypto News
-
Cryptocurrency1 week ago
French Exoskeleton Company Wandercraft Pivots to Humanoid Robots – Crypto News
-
others1 week ago
Canadian Dollar gives back gains despite upbeat jobs data – Crypto News
-
Technology1 week ago
Best juicer for home in 2025: Top 10 choices for your family’s good health from brands like Philips, Borosil and more – Crypto News
-
Technology1 week ago
Weekly Tech Recap: Resident Evil Requiem release date revealed, OnePlus 13s makes India debut and more – Crypto News
-
Blockchain1 week ago
OpenLedger Invests $25 Million to Combat ‘Extractive’ AI Economy – Crypto News
-
others1 week ago
Gold price in India: Rates on June 10 – Crypto News
-
Technology7 days ago
Father’s Day 2025 gift ideas: Smartwatch, Bluetooth speaker and more – Crypto News
-
Technology6 days ago
Circle IPO shows strong crypto market investor demand – Crypto News
-
others7 days ago
Stock Market Pullback in Sight As Several of America’s Problems Still Remain, Warns Former JPMorgan Strategist – Crypto News
-
Technology1 week ago
Gemini can now schedule tasks, send reminders and keep you on track: Here’s how it works – Crypto News
-
Technology1 week ago
OpenAI CEO Sam Altman says AI is like an intern today, but it will soon match experienced software engineers – Crypto News
-
Technology1 week ago
iOS 26’s Liquid Glass redesign met with backlash from Apple users: ‘Please tone it down’ – Crypto News
-
Technology1 week ago
iOS 26’s Liquid Glass redesign met with backlash from Apple users: ‘Please tone it down’ – Crypto News
-
others1 week ago
Widely Followed Analyst Outlines Bullish Path for Bitcoin, Says BTC Will Battle Gold and ‘Never Look Back’ – Crypto News
-
Technology1 week ago
How artificial intelligence caught leukaemia in Maharashtra’s Parbhani – Crypto News
-
Technology1 week ago
India targets indigenous 2nm, Nvidia-level GPU by 2030 – Crypto News
-
others1 week ago
Japan Money Supply M2+CD (YoY) increased to 0.6% in May from previous 0.5% – Crypto News
-
Technology1 week ago
iOS 26’s Liquid Glass redesign met with backlash from Apple users: ‘Please tone it down’ – Crypto News
-
others1 week ago
New Yorkers Warned of Fake QR Codes Being Placed on Parking Meters That Steal Victims’ Payment Information – Crypto News
-
Technology1 week ago
OnePlus 13s review: A near-perfect compact phone, minus a few flagship perks – Crypto News
-
Cryptocurrency1 week ago
TRON: Who’s fueling TRX’s breakout? It’s not whales, here’s the answer! – Crypto News
-
others1 week ago
Analyst Says Bitcoin Has ‘Pretty Good’ Chance of Hitting Massive Price Target in 2026, Citing Three Technical Signals – Crypto News
-
Cryptocurrency1 week ago
Union completes trusted setup to pave the way for trustless cross-chain DeFi – Crypto News
-
Technology1 week ago
Best juicer for home in 2025: Top 10 choices for your family’s good health from brands like Philips, Borosil and more – Crypto News
-
Cryptocurrency1 week ago
Stacks [STX] down 31% after Alex Protocol exploit – Details – Crypto News
-
De-fi6 days ago
Resolv Stablecoin Protocol’s Token Debuts at $300 Million Valuation – Crypto News
-
Technology6 days ago
One Tech Tip: How to protect your 23andMe genetic data – Crypto News
-
Technology6 days ago
OnePlus Nord 5 and Nord CE 5 tipped to launch on 8 July with big battery upgrades and MediaTek chipsets – Crypto News
-
Technology1 week ago
BP Puts AI at the Heart of Its Efforts to Boost Performance – Crypto News
-
Cryptocurrency1 week ago
Resistance Persists at $2,700 But Buyer Appetite Grows – Crypto News
-
Blockchain7 days ago
1inch Promises Faster and Cheaper Trades with new Upgrade – Crypto News
-
others7 days ago
ARK Invest’s Cathie Wood Unveils Massive Price Target for Tesla (TSLA) in Five Years Fueled by Robotaxi Platform – Crypto News
-
Blockchain6 days ago
Franklin Templeton Debuts ‘Intraday Yield’ Feature for Benji – Crypto News
-
Business5 days ago
Databricks Projects $1 Billion Revenue From Data Warehouse Biz – Crypto News
-
Cryptocurrency5 days ago
Australian Woman Hit With Ten-Year Ban Over $9.6 Million Crypto Scheme – Crypto News
-
others1 week ago
Michael Saylor Doubling Down on Bitcoin Price Prediction As BTC Holds $100,000 Level – Crypto News
-
others1 week ago
Australian Dollar remains stronger following China’s economic data – Crypto News
-
Cryptocurrency1 week ago
Bitcoin trades near $107K despite national guard deployment in Los Angeles – Crypto News
-
Technology1 week ago
Apple Expands ChatGPT Deal and Mimics Google in AI Comeback Attempt – Crypto News
-
Blockchain1 week ago
US SEC Considering Innovation Exemption For DeFi – Crypto News
-
others7 days ago
Lots of data points this week to consider – OCBC – Crypto News
-
Technology7 days ago
BenQ GV50 portable projector review: Auto-adjusting, laser-powered cinema for any wall or room – Crypto News
-
Cryptocurrency7 days ago
BTC trades at $109.7K after weekend surge; Ethereum’s Pectra upgrade boosts institutional staking – Crypto News
-
Blockchain7 days ago
Solana Breakout Targets $164 – Is A Recovery Around The Corner? – Crypto News
-
Technology6 days ago
Swiss Military Retro 2.0 review: This speaker looks like a classic radio—and almost sounds like one too – Crypto News
-
Cryptocurrency6 days ago
Will tariffs be the catalyst for bitcoin’s decoupling? – Crypto News
-
Blockchain6 days ago
Why Bitcoin Calm Rally Could Be a Setup for a Massive Breakout, Analyst Reveals – Crypto News
-
Technology6 days ago
Cloud Giants Hit Slow Lane as Legacy Systems Stall Upgrades – Crypto News