
Metaverse
Anthropic CEO: AI could be more factually reliable than people in structured tasks – Crypto News
Artificial intelligence may now surpass humans in factual accuracy—at least in certain structured scenarios—according to Anthropic CEO Dario Amodei. Speaking at two major tech events this month, VivaTech 2025 in Paris and the inauguralCode With Claude developer day, Amodei asserted that modern AI models, including the newly launched Claude 4 series, may hallucinate less often than people when answering well-defined factual questions, reported Business Today.
Hallucination, in the context of AI, refers to the tendency of models to confidently produce inaccurate or fabricated information, the report added. This longstanding flaw has raised concerns in fields such as journalism, medicine, and law. However, Amodei’s remarks suggest that the tables may be turning—at least in controlled conditions.
“If you define hallucination as confidently stating something incorrect, humans actually do that quite frequently,” Amodei said during his keynote at VivaTech. He cited internal testing which showed Claude 3.5 outperforming human participants on structured factual quizzes. The results, he claimed, demonstrate a notable shift in reliability when it comes to straightforward question-answer tasks.
Reportedly, at the developer-focusedCode With Claude event, where Anthropic introduced the Claude Opus 4 and Claude Sonnet 4 models, Amodei reiterated his stance. “It really depends on how you measure it,” he noted. “But I suspect that AI models probably hallucinate less than humans, though when they do, the mistakes are often more surprising.”
The newly unveiled Claude 4 models reflect Anthropic’s latest advances in the pursuit of artificial general intelligence (AGI), boasting improved capabilities in long-term memory, coding, writing, and tool integration. Of particular note, Claude Sonnet 4 achieved a 72.7 per cent score on the SWE-Bench software engineering benchmark, surpassing previous models and setting a new industry standard.
However, Amodei was quick to acknowledge that hallucinations have not been eradicated. In unstructured or open-ended conversations, even state-of-the-art models remain vulnerable to error. The CEO stressed that context, prompt design, and domain-specific application heavily influence a model’s accuracy, particularly in high-stakes settings like legal filings or healthcare.
His remarks follow a recent legal incident involving Anthropic’s chatbot, where the AI cited a non-existent case during a lawsuit filed by music publishers. The error led to an apology from the company’s legal team, reinforcing the ongoing challenge of ensuring factual consistency in real-world use.
Amodei also reportedly highlighted the lack of clear, industry-wide metrics for hallucination. “You can’t fix what you don’t measure precisely,” he cautioned, calling for standardised definitions and evaluation frameworks to track and mitigate AI errors.
-
Technology1 week ago
Einride Raises $100 Million for Road Freight Technology Solutions – Crypto News
-
others1 week ago
David Schwartz To Step Down as Ripple CTO, Delivers Heartfelt Message to XRP Community – Crypto News
-
Technology1 week ago
Engineers are chasing ₹30 lakh offers—but not from startups – Crypto News
-
Blockchain2 days ago
It’s About Trust as NYSE Owner, Polymarket Bet on Tokenization – Crypto News
-
Technology1 week ago
Fed’s Goolsbee Cites Inflation Worries in Case Against Further Rate Cuts – Crypto News
-
Technology1 week ago
Bloomberg Analyst Says XRP ETF Approval Odds Now 100% as Expert Eyes $33 Rally – Crypto News
-
others1 week ago
Ireland AIB Manufacturing PMI increased to 51.8 in September from previous 51.6 – Crypto News
-
Blockchain1 week ago
Citi Integrates Token Services Platform With Clearing Solution – Crypto News
-
Technology1 week ago
Breaking: BNB Chain Account Hacked With Founder CZ Shown Promoting Meme Coin – Crypto News
-
Cryptocurrency1 week ago
Bitcoin’s rare September gains defy history: Data predicts a 50% Q4 rally to 170,000 dollars – Crypto News
-
Technology1 week ago
US SEC weighs tokenised stock trading on crypto exchanges – Crypto News
-
Blockchain1 week ago
Watch These Key Bitcoin Metrics as BTC Price Prepares for ‘Big Move’ – Crypto News
-
Technology1 week ago
Breaking: BNB Chain Account Hacked With Founder CZ Shown Promoting Meme Coin – Crypto News
-
Cryptocurrency1 week ago
XPL, Not XRP: Why Are Whales Shoveling Ripple’s Rival? – Crypto News
-
Technology1 week ago
CAKE eyes 60% rally as PancakeSwap hits $772B trading all-time high – Crypto News
-
Cryptocurrency1 week ago
Crypto Market Prediction: Shiba Inu (SHIB) Moon Landing, Dogecoin (DOGE) Trapped in $0.23, XRP: Most Important Event for $3 – Crypto News
-
Technology1 week ago
FTT price on the edge as FTX creditors brace for $1.6B payout on Sept. 30 – Crypto News
-
Blockchain1 week ago
The Bullish Pattern That Suggests New Highs – Crypto News
-
others1 week ago
Japan Industrial Production (YoY) declined to -1.3% in August from previous -0.4% – Crypto News
-
Blockchain1 week ago
Trump Pulls Brian Quintenz Nomination for CFTC – Crypto News
-
De-fi1 week ago
Crypto Market Slips as U.S. Government Shutdown Looms – Crypto News
-
Cryptocurrency1 week ago
BREAKING: BlackRock Amends Bitcoin ETF (IBIT), Ethereum ETF (ETHA) Amid New Milestone – Crypto News
-
Technology1 week ago
iQOO 15 key specifications leaked ahead of launch: Here’s what to expect – Crypto News
-
others1 week ago
Japan Tankan Large All Industry Capex climbed from previous 11.5% to 12.5% in 3Q – Crypto News
-
Business1 week ago
Crypto Stakeholders Push Back as Banks Seek Yield Ban Provision in CLARITY Act – Crypto News
-
Cryptocurrency1 week ago
Horizen (ZEN) gains 12% to break above $7 – Crypto News
-
Technology1 week ago
Altcoins today: Perpetual tokens shed over $1.3B as ASTER, AVNT, and APEX tumble – Crypto News
-
others1 week ago
Japan Foreign Investment in Japan Stocks rose from previous ¥-1747.5B to ¥-963.3B in September 26 – Crypto News
-
Technology1 week ago
Google Nano Banana trend: 50 AI prompts to transform men’s selfies into retro-golden Durga Puja portraits – Crypto News
-
Metaverse1 week ago
Who is Alexandr Wang? Meta AI chief and 28-year-old billionaire urges teens to spend ‘all their time’ on this activity – Crypto News
-
Business1 week ago
SUI Price Eyes $4.5 as Coinbase Futures Listing Sparks Market Optimism – Crypto News
-
Technology1 week ago
Chainlink and Swift allow banks to access blockchain through existing systems – Crypto News
-
Cryptocurrency1 week ago
Ethereum whales return to the market: Is ETH ready for $10K? – Crypto News
-
De-fi1 week ago
Crypto Market Edges Up as Investors Weigh Fed Moves and Government Shutdown Risks – Crypto News
-
others1 week ago
New Zealand ANZ Business Confidence fell from previous 49.7 to 49.6 in September – Crypto News
-
Blockchain1 week ago
DX Terminal Tops NFT Sales Count in September as Base Dominates Top 10 – Crypto News
-
Blockchain1 week ago
DX Terminal Tops NFT Sales Count in September as Base Dominates Top 10 – Crypto News
-
Blockchain1 week ago
Ethereum Founder Dumps Billions In These Meme Coins, Is This A Repeat Of Shiba Inu In 2021? – Crypto News
-
Technology1 week ago
Breaking: SEC Moves To Allow On-Chain Stock Trading Alongside Crypto Amid Tokenization Push – Crypto News
-
De-fi1 week ago
Bitcoin Split Over Proposed Upgrade That Could Censor Transactions – Crypto News
-
others1 week ago
Japan Tankan Non – Manufacturing Outlook registered at 28, below expectations (29) in 3Q – Crypto News
-
others1 week ago
BONK Price Rally Ahead? Open Interest Jumps as TD Buy Signal Flashes – Crypto News
-
Metaverse1 week ago
Amazon is overhauling its devices to take on Apple in the AI era – Crypto News
-
Cryptocurrency1 week ago
The factors set to spur another ‘Uptober’ for BTC – Crypto News
-
Blockchain1 week ago
USDT, USDC Dominance Falls To 82% Amid Rising Competition – Crypto News
-
Metaverse1 week ago
BlackRock launches AI tool for financial advisors. Its first client is a big one. – Crypto News
-
Cryptocurrency1 week ago
BREAKING: Bitcoin Reclaims $120K. Is ATH Next? – Crypto News
-
others1 week ago
USD/JPY returns below 147.00 amid generalized Dollar weakness – Crypto News
-
others1 week ago
Fed’s Lorie Logan Urges Caution on Further Rate Cuts Citing Inflation Risks – Crypto News
-
Technology6 days ago
Tech Giant Samsung Taps Coinbase To Provide Crypto Access, Driving Adoption – Crypto News