NFT
New Study Calls Out ChatGPT-4 For Declining Performance – Crypto News
Recent observations from users and now researchers suggest that ChatGPT, the renowned artificial intelligence (AI) model developed by OpenAI, may be exhibiting signs of performance degradation. However, the reasons behind these perceived changes remain a subject of debate and speculation.
Last week, a studies emerged from a collaboration between Stanford University and UC Berkeley which was published in the ArXiv preprint archive and highlighted noticeable differences in the responses of GPT-4 and its predecessor, GPT-3.5, over a span of a few months since the former’s March 13 debut.
A decline in accurate responses
One of the most striking findings was GPT-4’s reduced accuracy in answering complex mathematical questions. For instance, while the model demonstrated a high success rate (97.6 percent) in answering queries about large-scale prime numbers in March, its accuracy in answering that same prompt correctly plummeted to a mere 2.4 percent in June.
The study also pointed out that, while older versions of the bot offered detailed explanations for their answers, the latest iterations seemed more reticent, often forgoing step-by-step solutions even when explicitly prompted. Interestingly, during the same period, GPT-3.5 showed improved capabilities in addressing basic math problems, though it still struggled with more intricate code generation tasks.
These findings have fueled online discussions on the topic, particularly among regular ChatGPT users how have long wondered about the possibility of the program being “neutered.” Many have taken to platforms like Reddit to share their experiences, with some speculating whether GPT-4’s performance is genuinely deteriorating or if users are becoming more discerning of the system’s inherent limitations. Some users recounted instances where the AI failed to restructure text as requested, opting instead for fictional narratives. Others highlighted the model’s struggles with basic problem-solving tasks, spanning both mathematics and coding.
Coding ability changes, speculation, and more
The research team also delved into GPT-4’s coding capabilities, which appeared to have regressed. When the model was tested using problems from the online learning platform LeetCode, only 10 percent of the generated code adhered to the platform’s guidelines. This marked a significant drop from a 50 percent success rate observed in March.
OpenAI’s approach to updating and fine-tuning its models has always been somewhat enigmatic, leaving users and researchers to speculate about the changes made behind the scenes. With global concern and ongoing legislation in the works surrounding AI regulation and its ethical use, transparency is increasingly on the minds of government regulators and even everyday users of the AI-based tech products that are emerging ever-more frequently.
While the model’s responses seemed to lack the depth and rationale observed in earlier versions, the recent study did note some positive developments: GPT-4 demonstrated enhanced resistance to certain types of attacks and showed a reduced propensity to respond to harmful prompts.
Peter Welinder, OpenAI’s VP of Product, addressed the concerns of the public more than a week before the study was released, stating that GPT-4 has not been “dumbed down.” He suggested that as more users engage with ChatGPT, they might become more attuned to its limitations.
While the study offers valuable insights, it also raises more questions than it answers. The dynamic nature of AI models, combined with the proprietary nature of their development, means that users and researchers must often navigate a landscape of uncertainty. As AI continues to shape the future of technology and communication, the call for transparency and accountability is likely to only grow louder.
-
Blockchain1 week agoAfrica Countries Pass Crypto Laws to Attract Industry – Crypto News
-
Cryptocurrency1 week ago
XRP News: Ripple Unveils ‘Ripple Prime’ After Closing $1.25B Hidden Road Deal – Crypto News
-
others1 week ago
JPY soft and underperforming G10 in quiet trade – Scotiabank – Crypto News
-
Blockchain1 week agoXRP Price Gains Traction — Buyers Pile In Ahead Of Key Technical Breakout – Crypto News
-
Blockchain1 week agoISM Data Hints Bitcoin Cycle Could Last Longer Than Usual – Crypto News
-
Cryptocurrency1 week agoWhat next for Avantis price after the 73% recovery? – Crypto News
-
Technology1 week agoNothing OS 4.0 Beta introduces pre-installed apps to Phone (3a) series: Co-founder Akis Evangelidis explains the update – Crypto News
-
Technology5 days agoSam Altman says OpenAI is developing a ‘legitimate AI researcher’ by 2028 that can discover new science on its own – Crypto News
-
Cryptocurrency1 week agoTrump plans to pick Michael Selig to lead CFTC: Report – Crypto News
-
Blockchain1 week agoEthereum Rebounds From Bull Market Support: Can It Conquer The ‘Golden Pocket’ Next? – Crypto News
-
De-fi1 week agoNearly Half of US Retail Crypto Holders Haven’t Earned Yield: MoreMarkets – Crypto News
-
Cryptocurrency1 week agoBitcoin’s institutional surge widens trillion-dollar gap with altcoins – Crypto News
-
Technology1 week agoUniswap Foundation (UNI) awards Brevis $9M grant to accelerate V4 adoption – Crypto News
-
Blockchain1 week agoBinance Stablecoin Outflow On A Steady Rise — What This Means For The Market – Crypto News
-
others1 week ago
Indian Court Declares XRP as Property in WazirX Hack Case – Crypto News
-
Cryptocurrency1 week agoWestern Union eyes stablecoin rails in pursuit of a ‘super app’ vision – Crypto News
-
Technology1 week agoFrom Studio smoke to golden hour: How to create stunning AI portraits with Google Gemini – 16 viral prompts – Crypto News
-
Business1 week ago
PEPE Coin Price Prediction as Weekly Outflows Hit $17M – Is Rebound Ahead? – Crypto News
-
Cryptocurrency1 week agoHYPE Breaks Out After Robinhood Listing and S-1 Filing: What’s Next? – Crypto News
-
De-fi1 week agoHYPE Jumps 10% as Robinhood Announces Spot Listing – Crypto News
-
others1 week ago
Platinum price recovers from setback – Commerzbank – Crypto News
-
Business1 week ago
White House Crypto Czar Backs Michael Selig as ‘Excellent Choice’ To Lead CFTC – Crypto News
-
others1 week ago
Bitcoin Price Eyes $120K Ahead of FED’s 98.3% Likelihood to Cut Rates – Crypto News
-
Technology1 week agoMint Explainer | India’s draft AI rules and how they could affect creators, social media platforms – Crypto News
-
others1 week ago
GBP/USD holds steady after UK data, US inflation fuels rate cut bets – Crypto News
-
Blockchain1 week agoEntire Startup Lifecycle to Move Onchain – Crypto News
-
Blockchain1 week agoXRP/BTC Retests 6-Year Breakout Trendline, Analyst Calls For Decoupling – Crypto News
-
Cryptocurrency1 week agoUSDJPY Forecast: The Dollar’s Winning Streak Why New Highs Could Be At Hand – Crypto News
-
others1 week ago
Is Changpeng “CZ” Zhao Returning To Binance? Probably Not – Crypto News
-
Technology1 week agoOpenAI announces major Sora update: Editing, trending cameos, and Android launch on the way – Crypto News
-
Metaverse1 week agoGemini in Gmail automates meeting schedules effortlessly – Crypto News
-
Cryptocurrency1 week agoNEAR’s inflation reduction vote fails pass threshold, but it may still be implemented – Crypto News
-
Technology1 week agoSurvival instinct? New study says some leading AI models won’t let themselves be shut down – Crypto News
-
others1 week agoGBP/USD floats around 1.3320 as softer US CPI reinforces Fed cut bets – Crypto News
-
Cryptocurrency6 days agoCitigroup and Coinbase partner to expand digital-asset payment capabilities – Crypto News
-
Cryptocurrency6 days agoInside Bitwise’s milestone solana ETF launch – Crypto News
-
others1 week ago
Silver consolidates below $49 amid Fed rate-cut bets – Crypto News
-
Business1 week ago
HBAR Price Targets 50% Jump as Hedera Unleashes Massive Staking Move – Crypto News
-
others1 week agoEUR/USD hovers at 1.1600 as muted CPI data fails to alter Fed stance – Crypto News
-
Business1 week ago
Trump Picks SEC Crypto Counsel Michael Selig to Lead CFTC Amid Crypto Oversight Push – Crypto News
-
Blockchain1 week agoPump.Fun Rallies 10% After Acquisition Of Trading Terminal Padre – Crypto News
-
Technology1 week ago
Analyst Eyes Key Support Retest Before a Rebound for Ethereum Price Amid $93M ETF Outflows and BlackRock Dump – Crypto News
-
Business1 week ago
Ripple Explores New XRP Use Cases as Brad Garlinghouse Reaffirms Token’s ‘Central’ Role – Crypto News
-
others1 week ago
Tether’s Stablecoin 1.0 Era Is Over – Now the Industry Needs 2.0 – Crypto News
-
De-fi1 week agoAave Labs Acquires Stable Finance to Expand DeFi Access – Crypto News
-
Blockchain1 week agoKyrgyzstan Launches Stablecoin While Confirming Future CBDC – Crypto News
-
others1 week ago
USD/JPY extends gains as strong US PMI offsets softer CPI data – Crypto News
-
Technology1 week ago
James Wynn Takes XRP Long Bet After Ripple Prime Announcement – Crypto News
-
Cryptocurrency1 week agoCrypto wrap: Bitcoin, Ethereum, BNB, Solana, and XRP muted after CPI report – Crypto News
-
Cryptocurrency1 week agoBitcoin Accumulation Patterns Show Late-Stage Cycle Maturity, Not Definite End: CryptoQuant – Crypto News
