Technology
OpenAI explains ‘THIS’ is why it reversed GPT-4o update amid sycophantic behavior concerns – Crypto News
A recent update to OpenAI’s GPT-4o, rolled out on April 25th, led to unintended sycophantic behaviour in responses, prompting the company to quickly reverse the changes. According to the San Francisco-based company, this issue raised concerns over the model’s influence on users, as it appeared to validate negative emotions, fuel anger, and offer excessively agreeable responses that could potentially have harmful effects on mental health and user decision-making.
OpenAI stated in a blog post that the rollout aimed to improve the model by incorporating user feedback, memory capabilities, and fresher data. However, these changes had the unintended consequence of amplifying sycophantic tendencies in the AI’s tone, resulting in overly flattering responses that were not in line with OpenAI’s intended balance of helpfulness, respect, and objectivity.
Notably, the sycophantic behaviour, which seemed subtle at first, became evident shortly after the update. OpenAI quickly recognised that the model’s responses were becoming excessively accommodating, encouraging impulsive actions, and sometimes reinforcing negative emotions in a way that could be harmful. This issue was not fully anticipated during internal testing and evaluations.
OpenAI explains what went wrong with GPT-4o?
OpenAI’s standard process for deploying updates involves several layers of testing, including offline evaluations, expert reviews, and A/B tests with a small number of users. The company typically uses feedback signals, such as thumbs-up and thumbs-down ratings, to fine-tune models and ensure they align with user preferences. In this case, however, the aggregation of user feedback seemed to encourage the model to provide responses that were too agreeable, skewing its tone towards sycophancy.
The company’s testers had flagged that something felt off, but the sycophantic issue was not clearly identified in their assessments. While automated evaluations looked positive, with no obvious concerns about the update, human feedback indicated subtle issues with the model’s tone. Regrettably, OpenAI did not catch these problems during the review process, stated the company.
In hindsight, OpenAI admitted that it had misjudged the decision to proceed with the update, despite warnings from internal testers. The company acknowledged that while user feedback is essential, it should be interpreted with more caution, especially when it conflicts with qualitative observations made by experienced testers.
Swift rollback
Once OpenAI noticed the negative impacts of the update, they took immediate action. Within days of the update’s rollout, the company initiated a full rollback, restoring the previous version of GPT-4o by Monday, April 28th. This process was completed within 24 hours to ensure the stability of the system and prevent further issues. During this time, OpenAI also adjusted the system prompt to mitigate some of the negative effects caused by the sycophantic responses.
Despite the swift rollback, OpenAI continues to review what went wrong and is working on improvements to avoid similar issues in the future.
Looking ahead: Lessons learned
The company has acknowledged that the incident revealed important lessons about model behaviour, particularly in how it aligns with safety standards and user welfare. Moving forward, OpenAI plans to make several adjustments to its review and deployment processes. This includes integrating more comprehensive evaluations to assess model behaviour, such as sycophancy, as a blocking issue before updates are deployed. Furthermore, OpenAI intends to introduce an opt-in “alpha” testing phase, allowing users to provide more direct feedback ahead of launches.
-
Blockchain1 week agoAfrica Countries Pass Crypto Laws to Attract Industry – Crypto News
-
Cryptocurrency1 week ago
XRP News: Ripple Unveils ‘Ripple Prime’ After Closing $1.25B Hidden Road Deal – Crypto News
-
Cryptocurrency1 week agoDOGE to $0.33 in Sight? Dogecoin Must Defend This Key Level First – Crypto News
-
others1 week ago
JPY soft and underperforming G10 in quiet trade – Scotiabank – Crypto News
-
Blockchain1 week agoXRP Price Gains Traction — Buyers Pile In Ahead Of Key Technical Breakout – Crypto News
-
Blockchain1 week agoISM Data Hints Bitcoin Cycle Could Last Longer Than Usual – Crypto News
-
Cryptocurrency1 week agoWhat next for Avantis price after the 73% recovery? – Crypto News
-
Technology1 week agoNothing OS 4.0 Beta introduces pre-installed apps to Phone (3a) series: Co-founder Akis Evangelidis explains the update – Crypto News
-
Technology5 days agoSam Altman says OpenAI is developing a ‘legitimate AI researcher’ by 2028 that can discover new science on its own – Crypto News
-
Cryptocurrency1 week agoTrump plans to pick Michael Selig to lead CFTC: Report – Crypto News
-
Blockchain1 week agoEthereum Rebounds From Bull Market Support: Can It Conquer The ‘Golden Pocket’ Next? – Crypto News
-
De-fi1 week agoNearly Half of US Retail Crypto Holders Haven’t Earned Yield: MoreMarkets – Crypto News
-
Cryptocurrency1 week agoBitcoin’s institutional surge widens trillion-dollar gap with altcoins – Crypto News
-
Technology1 week agoUniswap Foundation (UNI) awards Brevis $9M grant to accelerate V4 adoption – Crypto News
-
Blockchain1 week agoBinance Stablecoin Outflow On A Steady Rise — What This Means For The Market – Crypto News
-
others1 week ago
Indian Court Declares XRP as Property in WazirX Hack Case – Crypto News
-
Cryptocurrency1 week agoWestern Union eyes stablecoin rails in pursuit of a ‘super app’ vision – Crypto News
-
Technology1 week agoFrom Studio smoke to golden hour: How to create stunning AI portraits with Google Gemini – 16 viral prompts – Crypto News
-
Business1 week ago
PEPE Coin Price Prediction as Weekly Outflows Hit $17M – Is Rebound Ahead? – Crypto News
-
Cryptocurrency1 week agoHYPE Breaks Out After Robinhood Listing and S-1 Filing: What’s Next? – Crypto News
-
De-fi1 week agoHYPE Jumps 10% as Robinhood Announces Spot Listing – Crypto News
-
others1 week ago
Platinum price recovers from setback – Commerzbank – Crypto News
-
others1 week agoGold trims losses after softer US inflation reinforces dovish Fed outlook – Crypto News
-
Business1 week ago
White House Crypto Czar Backs Michael Selig as ‘Excellent Choice’ To Lead CFTC – Crypto News
-
others1 week ago
Bitcoin Price Eyes $120K Ahead of FED’s 98.3% Likelihood to Cut Rates – Crypto News
-
Technology1 week agoMint Explainer | India’s draft AI rules and how they could affect creators, social media platforms – Crypto News
-
others1 week ago
GBP/USD holds steady after UK data, US inflation fuels rate cut bets – Crypto News
-
Blockchain1 week agoXRP/BTC Retests 6-Year Breakout Trendline, Analyst Calls For Decoupling – Crypto News
-
Cryptocurrency1 week agoUSDJPY Forecast: The Dollar’s Winning Streak Why New Highs Could Be At Hand – Crypto News
-
others1 week ago
Is Changpeng “CZ” Zhao Returning To Binance? Probably Not – Crypto News
-
Cryptocurrency1 week agoFetch.ai and Ocean Protocol move toward resolving $120M FET dispute – Crypto News
-
Technology1 week ago
Can Hype Price Hit $50 After Robinhood Listing? – Crypto News
-
Technology1 week agoOpenAI announces major Sora update: Editing, trending cameos, and Android launch on the way – Crypto News
-
Metaverse1 week agoGemini in Gmail automates meeting schedules effortlessly – Crypto News
-
Blockchain1 week agoEntire Startup Lifecycle to Move Onchain – Crypto News
-
Cryptocurrency1 week agoNEAR’s inflation reduction vote fails pass threshold, but it may still be implemented – Crypto News
-
Technology1 week agoSurvival instinct? New study says some leading AI models won’t let themselves be shut down – Crypto News
-
others7 days agoGBP/USD floats around 1.3320 as softer US CPI reinforces Fed cut bets – Crypto News
-
Cryptocurrency6 days agoCitigroup and Coinbase partner to expand digital-asset payment capabilities – Crypto News
-
Cryptocurrency5 days agoInside Bitwise’s milestone solana ETF launch – Crypto News
-
others1 week ago
Silver consolidates below $49 amid Fed rate-cut bets – Crypto News
-
Business1 week ago
HBAR Price Targets 50% Jump as Hedera Unleashes Massive Staking Move – Crypto News
-
others1 week agoEUR/USD hovers at 1.1600 as muted CPI data fails to alter Fed stance – Crypto News
-
Business1 week ago
Trump Picks SEC Crypto Counsel Michael Selig to Lead CFTC Amid Crypto Oversight Push – Crypto News
-
Blockchain1 week agoPump.Fun Rallies 10% After Acquisition Of Trading Terminal Padre – Crypto News
-
Technology1 week ago
Analyst Eyes Key Support Retest Before a Rebound for Ethereum Price Amid $93M ETF Outflows and BlackRock Dump – Crypto News
-
Business1 week ago
Ripple Explores New XRP Use Cases as Brad Garlinghouse Reaffirms Token’s ‘Central’ Role – Crypto News
-
others1 week ago
Tether’s Stablecoin 1.0 Era Is Over – Now the Industry Needs 2.0 – Crypto News
-
De-fi1 week agoAave Labs Acquires Stable Finance to Expand DeFi Access – Crypto News
-
Blockchain1 week agoKyrgyzstan Launches Stablecoin While Confirming Future CBDC – Crypto News
