

NFT
New Study Calls Out ChatGPT-4 For Declining Performance – Crypto News
Recent observations from users and now researchers suggest that ChatGPT, the renowned artificial intelligence (AI) model developed by OpenAI, may be exhibiting signs of performance degradation. However, the reasons behind these perceived changes remain a subject of debate and speculation.
Last week, a studies emerged from a collaboration between Stanford University and UC Berkeley which was published in the ArXiv preprint archive and highlighted noticeable differences in the responses of GPT-4 and its predecessor, GPT-3.5, over a span of a few months since the former’s March 13 debut.
A decline in accurate responses
One of the most striking findings was GPT-4’s reduced accuracy in answering complex mathematical questions. For instance, while the model demonstrated a high success rate (97.6 percent) in answering queries about large-scale prime numbers in March, its accuracy in answering that same prompt correctly plummeted to a mere 2.4 percent in June.
The study also pointed out that, while older versions of the bot offered detailed explanations for their answers, the latest iterations seemed more reticent, often forgoing step-by-step solutions even when explicitly prompted. Interestingly, during the same period, GPT-3.5 showed improved capabilities in addressing basic math problems, though it still struggled with more intricate code generation tasks.
These findings have fueled online discussions on the topic, particularly among regular ChatGPT users how have long wondered about the possibility of the program being “neutered.” Many have taken to platforms like Reddit to share their experiences, with some speculating whether GPT-4’s performance is genuinely deteriorating or if users are becoming more discerning of the system’s inherent limitations. Some users recounted instances where the AI failed to restructure text as requested, opting instead for fictional narratives. Others highlighted the model’s struggles with basic problem-solving tasks, spanning both mathematics and coding.
Coding ability changes, speculation, and more
The research team also delved into GPT-4’s coding capabilities, which appeared to have regressed. When the model was tested using problems from the online learning platform LeetCode, only 10 percent of the generated code adhered to the platform’s guidelines. This marked a significant drop from a 50 percent success rate observed in March.
OpenAI’s approach to updating and fine-tuning its models has always been somewhat enigmatic, leaving users and researchers to speculate about the changes made behind the scenes. With global concern and ongoing legislation in the works surrounding AI regulation and its ethical use, transparency is increasingly on the minds of government regulators and even everyday users of the AI-based tech products that are emerging ever-more frequently.
While the model’s responses seemed to lack the depth and rationale observed in earlier versions, the recent study did note some positive developments: GPT-4 demonstrated enhanced resistance to certain types of attacks and showed a reduced propensity to respond to harmful prompts.
Peter Welinder, OpenAI’s VP of Product, addressed the concerns of the public more than a week before the study was released, stating that GPT-4 has not been “dumbed down.” He suggested that as more users engage with ChatGPT, they might become more attuned to its limitations.
While the study offers valuable insights, it also raises more questions than it answers. The dynamic nature of AI models, combined with the proprietary nature of their development, means that users and researchers must often navigate a landscape of uncertainty. As AI continues to shape the future of technology and communication, the call for transparency and accountability is likely to only grow louder.
-
Cryptocurrency1 week ago
Ethereum protocol update details plan to boost transaction capacity with blobs – Crypto News
-
Technology1 week ago
XRP Ledger Secures Major Win, Powering China’s Top Supply Chain Firm – Crypto News
-
others7 days ago
Ripple’s RLUSD Launches on Aave’s Horizon RWA Market as Adoption Expands – Crypto News
-
others7 days ago
Ripple’s RLUSD Launches on Aave’s Horizon RWA Market as Adoption Expands – Crypto News
-
Business1 week ago
Gemini Launches XRP Credit Card Amid Ripple-Backed IPO Plans – Crypto News
-
Cryptocurrency1 week ago
How stablecoin inflows are shaping the L1 price race – Crypto News
-
others1 week ago
Breaking: U.S. Government to Begin Issuing GDP Data on Blockchain in Latest Crypto Push – Crypto News
-
Technology1 week ago
Google’s Gemini 2.5 Flash Image does it all – From blurring backgrounds to multi-image fusion – Crypto News
-
Technology1 week ago
Google’s Gemini 2.5 Flash Image does it all – From blurring backgrounds to multi-image fusion – Crypto News
-
De-fi1 week ago
Binance Lists Dolomite’s DOLO Token, Adds Fifth Lira Pair – Crypto News
-
Technology1 week ago
Permit to Starlink bars copying, decryption of Indian data overseas: MoS Telecom – Crypto News
-
Blockchain1 week ago
Bitcoin Dives As On-Chain Data Shows Every Cohort Now Selling – Crypto News
-
Business1 week ago
BlackRock Buys $300M in Ethereum as Crypto ETF Inflows Return – Crypto News
-
Blockchain7 days ago
Decoding Google’s Layer-1 blockchain: what it means and what we know – Crypto News
-
Blockchain7 days ago
Google’s Rich Widmann shares LinkedIn update on Universal Ledger blockchain – Crypto News
-
Cryptocurrency7 days ago
Philippine Senator Suggests Putting National Budget On-chain – Crypto News
-
Business2 days ago
PYMNTS’ Summer of Big Quotes, From Tariffs to Trust Codes – Crypto News
-
Blockchain1 week ago
Ethereum Breaks 8-Year Resistance Against Bitcoin, Needs Confirmation On The 2W Timeframe – Crypto News
-
Cryptocurrency1 week ago
Top Crypto Market Makers and How to Choose One – Crypto News
-
De-fi1 week ago
Circle Mints $500 Million USDC in $250 Million Batches, Hits $25 Billion USDC on Solana in 2025 – Crypto News
-
Technology1 week ago
Google is working on Quick Share for iPhone: Here’s everything we know so far – Crypto News
-
Blockchain1 week ago
Animoca, Antler’s Ibex Launch Fund to Tokenize Japan’s IP – Crypto News
-
Technology1 week ago
Morgan Stanley Flips to September Rate Cut Call: Here’s What Changed – Crypto News
-
Business1 week ago
Pi Network Hackathon Winner Hints at Coinbase Listing Amid Pi Open Source Transition – Crypto News
-
Business6 days ago
Scott Bessent Says 11 ‘Strong’ Candidates in Line to Replace Fed Chair Powell – Crypto News
-
Technology6 days ago
Mint Explainer | A web for machines, not humans: Decoding ex-Twitter CEO Parag Agrawal’s next big move – Crypto News
-
Technology6 days ago
PUMP circulating supply shrinks as Pump.fun’s total buybacks surpass $58M – Crypto News
-
Cryptocurrency5 days ago
South Korea Busts Hacking Syndicate After Multi-Million Dollar Crypto Losses – Crypto News
-
Technology1 week ago
18 months after surgery, Elon Musk’s first brain chip patient is playing Mario Kart and planning to start a business – Crypto News
-
Technology1 week ago
OpenAI posts first job openings for its New Delhi office: Check vacancies, eligibility and how to apply – Crypto News
-
Technology1 week ago
Top 9 premium smartwatches you should buy in 2025 if you’re focused on features, not just price – Crypto News
-
Blockchain1 week ago
Canary Capital Files “American-Made” Crypto ETF Amid SEC Delays – Crypto News
-
Business1 week ago
Pepe Price Forecast as $19M Net Outflows Signal Accumulation: Is a 130% Rally Next? – Crypto News
-
De-fi1 week ago
Prediction Market Kalshi to Expand Onchain Presence – Crypto News
-
Blockchain1 week ago
215% PENGU Rally Incoming? Analyst Says Token ‘Inches’ From Next Leg Up – Crypto News
-
others1 week ago
SEC Pushes Back Decision on Grayscale’s Cardano ETF – Crypto News
-
Business1 week ago
CR7 Meme Coin Hits $5M Market Cap Then Dumps Following $143M Rug Pull – Crypto News
-
Business1 week ago
Morgan Stanley Flips to September Rate Cut Call: Here’s What Changed – Crypto News
-
De-fi1 week ago
Crypto and DeFi in 2026: Adoption, Innovation, and the Road Ahead – Crypto News
-
Business1 week ago
Donald Trump Jr.’s VC Firm Invests ‘Millions’ in $1B Crypto Platform Polymarket – Crypto News
-
Technology7 days ago
Pump.fun Buys Back $58M PUMP Tokens; Price Up 4% – Crypto News
-
De-fi6 days ago
Sony’s Soneium Debuts Scoring System to Record Onchain Participation – Crypto News
-
others1 week ago
USD/CAD struggles to gain ground as Fed’s Powell turns dovish on interest rate outlook – Crypto News
-
De-fi1 week ago
SBI Group Taps Chainlink to Tokenize Assets, Verify Stablecoins – Crypto News
-
Technology1 week ago
Gemini taps Ripple to launch limited edition credit card with 4% XRP cashback – Crypto News
-
Technology1 week ago
SEC Delays WisdomTree XRP ETF Decision Until October – Crypto News
-
De-fi1 week ago
Pantera Capital Seeks $1.25 Billion to Build Solana Investment Vehicle – Crypto News
-
De-fi1 week ago
Pantera Capital Seeks $1.25 Billion to Build Solana Investment Vehicle – Crypto News
-
others1 week ago
XAG/USD rises toward $39.00 due to increased safe-haven demand – Crypto News
-
Business1 week ago
Donald Trump Jr.’s VC Firm Invests ‘Millions’ in $1B Crypto Platform Polymarket – Crypto News