Metaverse
Anthropic CEO: AI could be more factually reliable than people in structured tasks – Crypto News
Artificial intelligence may now surpass humans in factual accuracy—at least in certain structured scenarios—according to Anthropic CEO Dario Amodei. Speaking at two major tech events this month, VivaTech 2025 in Paris and the inauguralCode With Claude developer day, Amodei asserted that modern AI models, including the newly launched Claude 4 series, may hallucinate less often than people when answering well-defined factual questions, reported Business Today.
Hallucination, in the context of AI, refers to the tendency of models to confidently produce inaccurate or fabricated information, the report added. This longstanding flaw has raised concerns in fields such as journalism, medicine, and law. However, Amodei’s remarks suggest that the tables may be turning—at least in controlled conditions.
“If you define hallucination as confidently stating something incorrect, humans actually do that quite frequently,” Amodei said during his keynote at VivaTech. He cited internal testing which showed Claude 3.5 outperforming human participants on structured factual quizzes. The results, he claimed, demonstrate a notable shift in reliability when it comes to straightforward question-answer tasks.
Reportedly, at the developer-focusedCode With Claude event, where Anthropic introduced the Claude Opus 4 and Claude Sonnet 4 models, Amodei reiterated his stance. “It really depends on how you measure it,” he noted. “But I suspect that AI models probably hallucinate less than humans, though when they do, the mistakes are often more surprising.”
The newly unveiled Claude 4 models reflect Anthropic’s latest advances in the pursuit of artificial general intelligence (AGI), boasting improved capabilities in long-term memory, coding, writing, and tool integration. Of particular note, Claude Sonnet 4 achieved a 72.7 per cent score on the SWE-Bench software engineering benchmark, surpassing previous models and setting a new industry standard.
However, Amodei was quick to acknowledge that hallucinations have not been eradicated. In unstructured or open-ended conversations, even state-of-the-art models remain vulnerable to error. The CEO stressed that context, prompt design, and domain-specific application heavily influence a model’s accuracy, particularly in high-stakes settings like legal filings or healthcare.
His remarks follow a recent legal incident involving Anthropic’s chatbot, where the AI cited a non-existent case during a lawsuit filed by music publishers. The error led to an apology from the company’s legal team, reinforcing the ongoing challenge of ensuring factual consistency in real-world use.
Amodei also reportedly highlighted the lack of clear, industry-wide metrics for hallucination. “You can’t fix what you don’t measure precisely,” he cautioned, calling for standardised definitions and evaluation frameworks to track and mitigate AI errors.
-
Metaverse1 week agoHow Clear is using AI Agents to simplify tax filing in India – Crypto News
-
Cryptocurrency1 week agoOver 80% of Bitcoin ETF assets hit Coinbase custody choke point with $74B at risk – Crypto News
-
Cryptocurrency1 week agoOver 80% of Bitcoin ETF assets hit Coinbase custody choke point with $74B at risk – Crypto News
-
Cryptocurrency1 week agoOver 80% of Bitcoin ETF assets hit Coinbase custody choke point with $74B at risk – Crypto News
-
Cryptocurrency5 days agoBitcoin Cracks 7-Month Ceiling. Can Bulls Push It Higher? – Crypto News
-
Cryptocurrency6 days agoWhy the SEC just gave self custody crypto apps 5 years to get traditional broker licenses – Crypto News
-
Blockchain5 days agoWhy Ethereum Has Become One Of The Most Heavily Shorted Assets Globally – Crypto News
-
Technology1 week ago
Strategy’s STRC Raises Enough Capital to Buy Another $1.76B in Bitcoin – Crypto News
-
Cryptocurrency1 week agoWhy This Massive $297M Bitcoin ETF Outflow Could Actually Be a Buy Signal – Crypto News
-
Cryptocurrency6 days agoShiba Inu (SHIB) Most Stable It Has Ever Been, Hyperliquid (HYPE) on Verge of New ATH, XRP Price Spikes Through First Resistance: Crypto Market Review – Crypto News
-
others6 days agoGold Purchases by Global Central Banks Skyrocket 575%, Surpassing $4,600,000,000 in Just One Month – Crypto News
-
others5 days ago$815,420,000 in Bitcoin and Crypto Liquidated As BTC Surges Above $78,000 – Crypto News
-
Cryptocurrency4 days agoBitcoin now has just 4 days before ceasefire deadline risks price reversal with Hormuz closed again – Crypto News
-
Cryptocurrency1 week agoTrump family’s WLFI starts damage control but its new plan leaves holders who refuse the new terms locked indefinitely – Crypto News
-
Technology5 days ago
XRP News: Coinbase Derivatives Files XRP Market Maker Program With CFTC To Boost Liquidity – Crypto News
-
Blockchain5 days agoWhat CFOs Need to Know About Freezing and Burning Stablecoins – Crypto News
-
Blockchain5 days agoCircle Launches USDC Bridge For Native Cross-Chain Transfers – Crypto News
-
Technology5 days ago
RAVE Coin Faces Pump-and-Dump Alert Amid 44% Rally, Binance & Bitget Urged to Probe – Crypto News
-
Blockchain21 hours agoDoorDash Turns to Tempo to Offer Stablecoin Payments – Crypto News
-
Technology1 week agoChatGPT, Gemini and Grok confidently generate dangerous medical advice half the time, study finds – Crypto News
-
Cryptocurrency6 days agoWhy the SEC just gave self custody crypto apps 5 years to get traditional broker licenses – Crypto News
-
Business5 days ago
Bitcoin and XRP Price as Iran Opens Strait Of Hormuz – Crypto News
-
others5 days ago
Just-In: Ripple XRP Is Now Live On Solana-Powered Apps, Price Jumps 5% – Crypto News
-
Blockchain5 days agoRussia Introduces Bill To Criminalize Unregistered Crypto Services – Crypto News
-
Cryptocurrency5 days agoRipple taps Kyobo Life to enable real-time government bond settlements in Korea – Crypto News
-
Blockchain5 days agoCircle Launches USDC Bridge For Native Cross-Chain Transfers – Crypto News
-
Technology5 days agoIn the AI propaganda war, Iran is winning – Crypto News
-
others5 days agoJPMorgan Chase, Citi and Wells Fargo Lose $5,606,000,000 to Bad Loans in Just Three Months – Crypto News
-
Cryptocurrency4 days agoBitcoin ETFs pull $1B inflow following Strait of Hormuz reopening – Crypto News
-
Metaverse1 week agoIndia’s manufacturing giants are embracing agentic AI to enhance efficiencies – Crypto News
-
Metaverse1 week agoIndia’s manufacturing giants are embracing agentic AI to enhance efficiencies – Crypto News
-
Cryptocurrency1 week agoTrump family’s WLFI starts damage control but its new plan leaves holders who refuse the new terms locked indefinitely – Crypto News
-
Cryptocurrency7 days agoAnthropic’s Mythos puts hundreds of billions in crypto at immediate risk – Crypto News
-
Blockchain6 days agoFrench Minister Seeks Measures Against Crypto Wrench Attacks, Kidnappings – Crypto News
-
Technology6 days agoFormer Meta contractor Sama to lay off more than 1,000 workers in Kenya – Crypto News
-
Business6 days ago
Fed’s John Williams Signals Support for Holding Rates Steady Ahead of FOMC Meeting – Crypto News
-
De-fi6 days agoFoundation NFT Marketplace Shuts Down Permanently After Failed Sale – Crypto News
-
De-fi5 days agoMemecoin Sector Shows Signs of Life as ASTEROID Rockets Past $25M – Crypto News
-
Technology5 days agoWhite House chief of staff to meet with Anthropic CEO over its new AI technology – Crypto News
-
Blockchain5 days agoDanger Zone Or Entry Point? – Crypto News
-
Technology5 days ago
X’s BTC, ETH, XRP, DOGE Cashtags Drive $1B in Trading Volume Since Launch – Crypto News
-
Cryptocurrency5 days agoThe $78K Bull Trap? Why Iran’s Latest Statement Could Send Bitcoin Tumbling – Crypto News
-
Technology5 days agoIn the AI propaganda war, Iran is winning – Crypto News
-
Blockchain5 days agoXRP Rallies Toward $1.50—Expert Cites 3 Dates That Could Decide The Next Direction – Crypto News
-
Cryptocurrency5 days agoBitcoin miners pivot to AI is now an immediate risk to network security – Crypto News
-
Technology5 days agoBackup calling, direct voicemail features in smartphones originated in India: Samsung official – Crypto News
-
Blockchain1 week agoBanks Bet Big on Tokenized Deposits to Power Real-Time Treasury – Crypto News
-
Metaverse1 week agoHow to disable Google Gemini in Gmail, Docs and Workspace: A step-by-step guide – Crypto News
-
Metaverse1 week agoHow to disable Google Gemini in Gmail, Docs and Workspace: A step-by-step guide – Crypto News
-
Blockchain6 days agoFrench Minister Seeks Measures Against Crypto Wrench Attacks, Kidnappings – Crypto News
