

Technology
Only 2.4% in math: Is ChatGPT turning dumb? – Crypto News
Why is ChatGPT in the news?
Recently, researchers Lingjiao Chen and James Zou from Stanford University, and Matei Zaharia from UC Berkeley tested GPT-3.5 and GPT-4 for solving math problems, answering sensitive and dangerous questions, generating code and for visual reasoning. The conclusion: the “performance and behavior” of both these large language models (LLMs) “can vary greatly over time”. The March version of GPT-4 identified prime numbers with 97.6% accuracy. In the June version, the accuracy collapsed to 2.4%. Both made “more formatting mistakes in code generation in June than in March”.
How did other experts react?
When the findings were published, AI expert Gary Marcus tweeted that “this instability will be LLMs’ undoing”. Jim Fan, senior scientist at Nvidia, opined that in a bid to make GPT-4 “safer”, Open AI could have made it less useful, “leading to a possible degradation in cognitive skills”. He added that in a bid to cut costs, OpenAI could have reduced the parameters. Princeton professor of computer science Arvind Narayanan and a PhD student at the same university co-authored a response in which they argue, among other things, that variance in behavior does not suggest a degradation in capability.
How is OpenAI reacting to this controversy?
Reacting to user criticism, Peter Welinder (in pic), vice-president of OpenAI, which owns ChatGPT, said GPT-4 was getting smarter with each new version. “When you use it more heavily, you start noticing issues you didn’t see before.” Logan Kilpatrick, lead of developer relations at OpenAI, tweeted: “We are actively looking into the reports people shared.”
What does this mean for users and cos?
Human resources tasks like onboarding, training, performance management, and employee queries and complaints can be automated using ChatGPT. But to integrate OpenAI’s application programming interfaces (APIs) with the business workflows of companies, one has to continuously monitor, retrain and fine-tune the models to ensure that they continue to produce accurate output and stay up-to-date. Variance in AI model behavior only makes it a bigger challenge.
Is it a boost for open-source LLMs?
The day the paper was released, Meta also released a second version of its free open-source LLM called Llama 2 for research and commercial use, providing an alternative to the pricy proprietary LLMs sold by OpenAI like ChatGPT Plus and Google’s Bard. Interestingly, Databricks Inc., whose CTO is Zaharia (one of the paper’s authors), has open-sourced its LLM called Dolly 2.0. Hugging Face’s BigScience Large Open-Science Open-Access Multilingual Language Model (BLOOM), too, is open to researchers to run.
Catch all the technology news and Updates on Live Mint. Download Mint News App to get Daily market update Live business news,
Updated: 20 Jul 2023, 11:46 PM IST
-
Blockchain6 days ago
The CFO and Treasurer’s Guide to Digital Assets – Crypto News
-
Cryptocurrency1 week ago
Famous Crypto Analyst Advises to Sell NVIDIA Stock: Here’s Why – Crypto News
-
Business1 week ago
Binance Enables Apple & Google Pay Features With This Latest Partnership – Crypto News
-
Cryptocurrency1 week ago
Tariffs Are Just the Tip of the Iceberg, Warns Billionaire Investor Ray Dalio – Crypto News
-
Cryptocurrency1 week ago
BitMEX Study Reveals Exchange-Specific Price Trends for Perpetual Swaps Across Leading Exchanges – Crypto News
-
Technology1 week ago
Apple could give iPhone a radical makeover for its 20th anniversary, report says – Crypto News
-
Business1 week ago
Will Dogecoin Price Ever Reach $1? Top Analysts Weigh In – Crypto News
-
Cryptocurrency1 week ago
Dire Wolf Solana Meme Coin Soars to $13.6M Market Cap After ‘De-Extinction’ – Crypto News
-
Technology1 week ago
Apple exported iPhones worth ₹1.5 trillion from India in FY25: Union Minister Ashwini Vaishnaw – Crypto News
-
others1 week ago
John Deaton Highlights Ripple’s Journey from Legal Struggle To ETF Launches – Crypto News
-
Technology1 week ago
Can It Take The Baton And Initiate The Next Altcoin Rally As The Market Strengthens? – Crypto News
-
Cryptocurrency1 week ago
The Downside Prevails As Cardano Price Rejected at $0.60 – Crypto News
-
Cryptocurrency1 week ago
Dogecoin hits multi-month low, but is a market reset on the way? – Crypto News
-
Technology1 week ago
Musks DOGE using AI to snoop on U.S. federal workers, sources say – Crypto News
-
Cryptocurrency1 week ago
ETH Hits 2-Year Low as BTC, XRP Hold Support – Crypto News
-
Cryptocurrency1 week ago
Peter Schiff Cautions US Against Trade War Escalation With China – Crypto News
-
Blockchain6 days ago
How to mine Bitcoin at home in 2025: A realistic guide – Crypto News
-
Technology1 week ago
iPad Air M3 (2025) Review: Still the most practical iPad – Crypto News
-
Business1 week ago
Cathie Wood’s Ark Invest Loads $13 Million of Coinbase Stock, COIN Price Reversal Soon? – Crypto News
-
others1 week ago
Australia Shuts Over 90 Companies Linked To Pig Butchering Schemes – Crypto News
-
Business1 week ago
“Perfect Time to Buy” – Patterns Point to a Pepe Coin Price Resurgence – Crypto News
-
Cryptocurrency1 week ago
Bitcoin is highly correlated with stock market since August 2024 – Crypto News
-
Business1 week ago
Sui Price Recovers As CBOE Files To List SUI ETF – Crypto News
-
Technology6 days ago
Microsoft’s Greatest Hits and Epic Fails: A 50-Year Wild Ride – Crypto News
-
Blockchain1 week ago
Cardano (ADA) Eyes Resistance Break—Failure Could Spark Fresh Losses – Crypto News
-
Technology1 week ago
PumpFun Livestream Feature Is Back — But What’s Changed? – Crypto News
-
Business1 week ago
Is Ripple Hinting at Cardano Partnership? – Crypto News
-
Blockchain1 week ago
Cathie Wood’s ARK bags $26M in Coinbase shares, unloads Bitcoin ETF – Crypto News
-
Technology1 week ago
China Retaliates, Triggering a Dead Cat Bounce in Crypto – Crypto News
-
Business1 week ago
Solana Unveils Confidential Balances Token Extension – Crypto News
-
others1 week ago
Top 3 Reasons XRP Price May Surge as Analyst Delivers a $693 Billion Prediction – Crypto News
-
Cryptocurrency1 week ago
BTC Risks Further Downside if it Fails to Reclaim This Resistance – Crypto News
-
Cryptocurrency1 week ago
OpenAI Countersues Elon Musk, Accuses Billionaire of ‘Bad-Faith Tactics’ – Crypto News
-
Blockchain6 days ago
BTC, ETH, XRP, BNB, SOL, DOGE, ADA, LEO, LINK, AVAX – Crypto News
-
Technology6 days ago
Dogecoin Price Gearing for A 3X Rally Amid DOGE Whale Accumulation – Crypto News
-
others5 days ago
Binance Issues Important Update On 10 Crypto, Here’s All – Crypto News
-
others1 week ago
WTI price mostly unchanged at European opening – Crypto News
-
others1 week ago
Technical Indicator Suggesting Bitcoin (BTC) Bull Market Hasn’t Started Yet: Quant Analyst PlanB – Crypto News
-
others1 week ago
Gold price under pressure despite high risk aversion – Commerzbank – Crypto News
-
Technology1 week ago
Shiba Inu Price Risks 50% Crash As Bearish Breakout Looms – Crypto News
-
Blockchain1 week ago
Web3 active developers drop nearly 40% in one year – Crypto News
-
Blockchain1 week ago
XRP Down, But History Says Millionaires Were Made This Way – Crypto News
-
others1 week ago
Economist Alex Krüger Warns US Stocks Could Repeat 2008 Bear Market Amid Trump’s Trade War – Crypto News
-
Technology1 week ago
XRP Leveraged ETF Outshines Solana At Launch – Crypto News
-
Cryptocurrency1 week ago
Stablecoin infrastructure platform M^0 expands to Solana – Crypto News
-
Blockchain1 week ago
Investors Looking To Buy Bitcoin? – Crypto News
-
Cryptocurrency1 week ago
Galaxy’s imminent US listing reflects SEC change – Crypto News
-
others1 week ago
Crypto Products See $240,000,000 in Outflows Likely in Response to US Tariff Threats: CoinShares – Crypto News
-
Blockchain7 days ago
NY attorney general urges Congress to keep pensions crypto-free — ‘No intrinsic value’ – Crypto News
-
Technology7 days ago
iQOO Z10 5G, Z10x 5G launched in India, price starts at ₹13,499. Check full price, specs and more – Crypto News