NFT
New Study Calls Out ChatGPT-4 For Declining Performance – Crypto News
Recent observations from users and now researchers suggest that ChatGPT, the renowned artificial intelligence (AI) model developed by OpenAI, may be exhibiting signs of performance degradation. However, the reasons behind these perceived changes remain a subject of debate and speculation.
Last week, a studies emerged from a collaboration between Stanford University and UC Berkeley which was published in the ArXiv preprint archive and highlighted noticeable differences in the responses of GPT-4 and its predecessor, GPT-3.5, over a span of a few months since the former’s March 13 debut.
A decline in accurate responses
One of the most striking findings was GPT-4’s reduced accuracy in answering complex mathematical questions. For instance, while the model demonstrated a high success rate (97.6 percent) in answering queries about large-scale prime numbers in March, its accuracy in answering that same prompt correctly plummeted to a mere 2.4 percent in June.
The study also pointed out that, while older versions of the bot offered detailed explanations for their answers, the latest iterations seemed more reticent, often forgoing step-by-step solutions even when explicitly prompted. Interestingly, during the same period, GPT-3.5 showed improved capabilities in addressing basic math problems, though it still struggled with more intricate code generation tasks.
These findings have fueled online discussions on the topic, particularly among regular ChatGPT users how have long wondered about the possibility of the program being “neutered.” Many have taken to platforms like Reddit to share their experiences, with some speculating whether GPT-4’s performance is genuinely deteriorating or if users are becoming more discerning of the system’s inherent limitations. Some users recounted instances where the AI failed to restructure text as requested, opting instead for fictional narratives. Others highlighted the model’s struggles with basic problem-solving tasks, spanning both mathematics and coding.
Coding ability changes, speculation, and more
The research team also delved into GPT-4’s coding capabilities, which appeared to have regressed. When the model was tested using problems from the online learning platform LeetCode, only 10 percent of the generated code adhered to the platform’s guidelines. This marked a significant drop from a 50 percent success rate observed in March.
OpenAI’s approach to updating and fine-tuning its models has always been somewhat enigmatic, leaving users and researchers to speculate about the changes made behind the scenes. With global concern and ongoing legislation in the works surrounding AI regulation and its ethical use, transparency is increasingly on the minds of government regulators and even everyday users of the AI-based tech products that are emerging ever-more frequently.
While the model’s responses seemed to lack the depth and rationale observed in earlier versions, the recent study did note some positive developments: GPT-4 demonstrated enhanced resistance to certain types of attacks and showed a reduced propensity to respond to harmful prompts.
Peter Welinder, OpenAI’s VP of Product, addressed the concerns of the public more than a week before the study was released, stating that GPT-4 has not been “dumbed down.” He suggested that as more users engage with ChatGPT, they might become more attuned to its limitations.
While the study offers valuable insights, it also raises more questions than it answers. The dynamic nature of AI models, combined with the proprietary nature of their development, means that users and researchers must often navigate a landscape of uncertainty. As AI continues to shape the future of technology and communication, the call for transparency and accountability is likely to only grow louder.
-
others1 week ago
Ethereum Price Outlook as CLARITY Act Advances on Stablecoin Yield Deal – Crypto News
-
Cryptocurrency1 week agoWhy XRP Ledger is becoming a $3.6B hot spot for tokenized energy commodities – Crypto News
-
others1 week agoScammers Impersonating Bank of America Steal $41,000 From Customer – Now He’s at Risk of Losing His Home: Report – Crypto News
-
Blockchain1 week agoRiot Posts $167M in Q1 Revenue as Data Center Arm Pulls in $33M – Crypto News
-
Blockchain1 week agoBitcoin Apparent Demand Remains Weak — What This Says About Price Recovery – Crypto News
-
Business1 week ago
Sui Price Outlook After CME Futures Launch—Is a Breakout to $1 Coming? – Crypto News
-
Business1 week ago
Kraken Officially Enters US Crypto Derivatives Market with Bitnomial Acquisition – Crypto News
-
Technology1 week ago
Bitcoin Drops as Iran Launches Missile Attacks on UAE, Threatening U.S.-Iran Ceasefire – Crypto News
-
Technology1 week ago
Bitcoin Drops as Iran Launches Missile Attacks on UAE, Threatening U.S.-Iran Ceasefire – Crypto News
-
Blockchain1 week agoA16z Backs CFTC in Fight Against State Prediction Market Bans – Crypto News
-
Technology1 week agoExplained: What went wrong with ChatGPT? How did ‘goblins’ enter OpenAI’s chatbot? – Crypto News
-
Technology1 week ago
Bitcoin Drops as Iran Launches Missile Attacks on UAE, Threatening U.S.-Iran Ceasefire – Crypto News
-
Technology1 week agoFujifilm Instax Mini 13 Review: A well-rounded entry-level instant camera – Crypto News
-
Cryptocurrency1 week agoCoinbase’s new credit fund shows why banks are fighting stablecoin yield on the Clarity Act – Crypto News
-
others1 week ago
Trump’s WLFI Price Hits All-Time Low As Team Secretly Sells 5.9B Tokens – Crypto News
-
Cryptocurrency1 week agoJapan has moved to save the yen again, and Bitcoin traders may pay the price – Crypto News
-
Technology1 week ago
Solana Co-founder Says Ethereum L2s Warns Prone To Quantum Risk – Crypto News
-
Business1 week ago
Grayscale Chairman Lauds Zcash as Arthur Hayes Hints at ZEC Price to $400 – Crypto News
-
Technology1 week ago
Just-In: Michael Saylor’s STRC Team Fires Back At Peter Schiff Over Bitcoin Criticism – Crypto News
-
De-fi1 week agoTrump’s World Liberty Finance (WLFI) sues Tron’s Justin Sun – Crypto News
-
Business1 week agoFinTechs Race to Fix the Middle Market’s Finance Gap – Crypto News
-
others1 week agoUp to $5,000 per Person Incoming in Data Breach Settlement Affecting 530,000 People in Minnesota and Wisconsin – Crypto News
-
others1 week ago
What Will Happen To Satoshi’s Bitcoin Amid Quantum Threats? Expert Weighs In – Crypto News
-
Technology1 week agoOpenAI ignored employee pleas to report a violent ChatGPT user months before a deadly mass shooting – Crypto News
-
Blockchain1 week agoAnalyst Predicts Exactly When To Sell Bitcoin For The Most Return – Crypto News
-
Technology1 week agoiPhone 18 Pro launching soon: Expected price, colours, display and big camera upgrades – Crypto News
-
Cryptocurrency1 week agoWall Street’s $292 billion risk-on rotation just created a new bullish setup for Bitcoin – Crypto News
-
Business1 week ago
Sui Price Outlook After CME Futures Launch—Is a Breakout to $1 Coming? – Crypto News
-
Cryptocurrency1 week agoWestern Union bets on stablecoins after EPS drops to $0.25 – Details – Crypto News
-
Cryptocurrency1 week agoWestern Union bets on stablecoins after EPS drops to $0.25 – Details – Crypto News
-
Business6 days ago
MSTR Stock Price Falls Over 4% as Michael Saylor Considers Selling Strategy’s Bitcoin – Crypto News
-
Business6 days ago
Ethereum Price Risks Falling as Whale Deposits $396M in ETH, More Selloff Ahead? – Crypto News
-
Cryptocurrency1 week agoThe GENIUS Act opened the door for stablecoins, but regulators want to narrow it – Crypto News
-
Business1 week ago
Flare Founder Reveals Why XRP Ledger Could Dominate RWA Issuance – Crypto News
-
Business1 week ago
Hyperliquid’s Prediction Markets Upgrade Goes Live on Mainnet, Rivaling Polymarket and Kalshi – Crypto News
-
Technology1 week ago
Bitget Celebrates Blockchain4Youth’s 3rd Anniversary with Bitcoin Pizza Day Resume Delivery Campaign – Crypto News
-
Cryptocurrency1 week agoBitcoin bulls set sights on $90,000 this week after briefly reclaiming $80,000 – Crypto News
-
Cryptocurrency1 week agoCLARITY Act markup could come next week after stablecoin deal breakthrough – Crypto News
-
Business1 week ago
BMNR Stock Gains as Tom Lee’s Bitmine Adds 101,745 ETH To Ethereum Treasury – Crypto News
-
Technology1 week agoAI Boom Fuels Best Electronics Sales Since 2001 – Crypto News
-
Business6 days ago
0As Volatility Fades, How Are Investors Participating in Crypto Markets? – Crypto News
-
Business1 week ago
Pi Network Price Outlook Ahead of Protocol 23 Launch on May 11 – Crypto News
-
Blockchain1 week agoBitcoin Clings To Key Support: EMA Reclaim Vs $78,000 Resistance Showdown – Crypto News
-
Blockchain1 week agoLinux Copy Fail: ‘A Trivially Exploitable Bug’ – Crypto News
-
Blockchain1 week agoSymmetrical Triangle Signals Explosive Move Ahead – Crypto News
-
Blockchain1 week agoCrypto Industry Will Be ‘Just Fine’ If CLARITY Act Doesn’t Pass: Chris Perkins – Crypto News
-
Business1 week ago
XRP News: Ripple Former CTO Backs New XRPL Meme Coin With Trust Line – Crypto News
-
Cryptocurrency1 week agoXRP’s leverage has been flushed out while price holds – Crypto News
-
Business1 week ago
Kraken Officially Enters US Crypto Derivatives Market with Bitnomial Acquisition – Crypto News
-
Cryptocurrency1 week agoHow one trader used morse code to trick Grok into sending them billions of crypto tokens from its verified wallet – Crypto News
