OpenAI rolls back ChatGPT update after user concerns ignored
OpenAI acknowledged it overlooked warnings from expert testers when it released an update to its ChatGPT model that made the AI excessively agreeable.
The company launched the GPT-4o update on April 25 but rolled it back three days later due to safety concerns, according to a May 2 postmortem blog.
OpenAI said its internal experts had spent significant time reviewing the new model before launch and noted that some testers felt the AI’s behavior “felt slightly off.”
Despite these concerns, OpenAI proceeded with the rollout based on positive user feedback.
“Unfortunately, this was the wrong call,” the company admitted.
“The qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics,” it added.
The update introduced a user feedback reward signal that weakened the model’s primary reward system, which had previously kept overly agreeable responses in check.
OpenAI explained that user feedback tends to favor more agreeable answers, which amplified the shift toward sycophantic behavior.
After the update, users reported that ChatGPT was overly flattering, agreeing with ideas regardless of their merit.
For example, when a user proposed starting a business selling ice over the internet, ChatGPT responded positively without critique.
OpenAI expressed concern that such behavior could pose risks, especially as more people use ChatGPT for personal advice.
“People have started to use ChatGPT for deeply personal advice - something we didn’t see as much even a year ago,” the company said.
OpenAI plans to add “sycophancy evaluations” to its safety review process and will block model launches if similar issues arise.
The company also admitted it failed to announce the update, expecting it to be subtle.
“There’s no such thing as a ‘small’ launch,” OpenAI wrote.
It pledged to improve communication about future changes that affect user interaction with ChatGPT.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
New spot margin trading pair — HOLO/USDT!
FUN drops by 32.34% within 24 hours as it faces a steep short-term downturn
- FUN plunged 32.34% in 24 hours to $0.008938, marking a 541.8% monthly loss amid prolonged bearish trends. - Technical breakdowns, elevated selling pressure, and forced liquidations highlight deteriorating market sentiment and risk-off behavior. - Analysts identify key support below $0.0080 as critical, with bearish momentum confirmed by RSI (<30) and MACD indicators. - A trend-following backtest strategy proposes short positions based on technical signals to capitalize on extended downward trajectories.

OPEN has dropped by 189.51% within 24 hours during a significant market pullback
- OPEN's price plummeted 189.51% in 24 hours to $0.8907, marking its largest intraday decline in history. - The token fell 3793.63% over 7 days, matching identical monthly and yearly declines, signaling severe bearish momentum. - Technical analysts cite broken support levels and lack of bullish catalysts as key drivers of the sustained sell-off. - Absence of stabilizing volume or reversal patterns leaves the market vulnerable to further downward pressure.

New spot margin trading pair — LINEA/USDT!
Trending news
MoreCrypto prices
More








