OpenAI ignored experts when it released overly agreeable ChatGPT
By: cryptosheadlines|2025/05/05 12:15:01
0
Share
Airdrop Is Live CaryptosHeadlines Media Has Launched Its Native Token CHT. Airdrop Is Live For Everyone, Claim Instant 5000 CHT Tokens Worth Of $50 USDT. Join the Airdrop at the official website, CryptosHeadlinesToken.com OpenAI says it ignored the concerns of its expert testers when it rolled out an update to its flagship ChatGPT artificial intelligence model that made it excessively agreeable.The company released an update to its GPT‐4o model on April 25 that made it “noticeably more sycophantic,” which it then rolled back three days later due to safety concerns, OpenAI said in a May 2 postmortem blog post.The ChatGPT maker said its new models undergo safety and behavior checks, and its “internal experts spend significant time interacting with each new model before launch,” meant to catch issues missed by other tests.During the latest model’s review process before it went public, OpenAI said that “some expert testers had indicated that the model’s behavior ‘felt’ slightly off” but decided to launch “due to the positive signals from the users who tried out the model.”“Unfortunately, this was the wrong call,” the company admitted. “The qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics.”OpenAI CEO Sam Altman said on April 27 that it was working to roll back changes making ChatGPT too agreeable. Source: Sam AltmanBroadly, text-based AI models are trained by being rewarded for giving responses that are accurate or rated highly by their trainers. Some rewards are given a heavier weighting, impacting how the model responds.OpenAI said introducing a user feedback reward signal weakened the model’s “primary reward signal, which had been holding sycophancy in check,” which tipped it toward being more obliging.“User feedback in particular can sometimes favor more agreeable responses, likely amplifying the shift we saw,” it added.OpenAI is now checking for suck up answersAfter the updated AI model rolled out, ChatGPT users had complained online about its tendency to shower praise on any idea it was presented, no matter how bad, which led OpenAI to concede in an April 29 blog post that it “was overly flattering or agreeable.”For example, one user told ChatGPT it wanted to start a business selling ice over the internet, which involved selling plain old water for customers to refreeze. Source: Tim LeckembyIn its latest postmortem, it said such behavior from its AI could pose a risk, especially concerning issues such as mental health.“People have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” OpenAI said. “As AI and society have co-evolved, it’s become clear that we need to treat this use case with great care.”Related: Crypto users cool with AI dabbling with their portfolios: Survey The company said it had discussed sycophancy risks “for a while,” but it hadn’t been explicitly flagged for internal testing, and it didn’t have specific ways to track sycophancy.Now, it will look to add “sycophancy evaluations” by adjusting its safety review process to “formally consider behavior issues” and will block launching a model if it presents issues.OpenAI also admitted that it didn’t announce the latest model as it expected it “to be a fairly subtle update,” which it has vowed to change. “There’s no such thing as a ‘small’ launch,” the company wrote. “We’ll try to communicate even subtle changes that can meaningfully change how people interact with ChatGPT.”AI Eye: Crypto AI tokens surge 34%, why ChatGPT is such a kiss-ass Source link
You may also like

Interpreting the Anthropic vs. War Department Conflict: What Does Trump Intend to Do?
In the coming decades, our freedom may be more fragile than we think

Nasdaq Moves In, Predicts Market Has Reached Mainstream Inflection Point
Predictive trading is no longer just an experiment in the crypto space or a niche market but is starting to be integrated into the product suite of traditional trading platforms.

After a 48-hour ban, Claude reached the top of the App Store
Just the day before, ChatGPT was sitting right there

If this is the beginning of the triple halving, what are top investors saying about what to expect?
Hormuz Strait Blockade, Capital War, Oil and Bitcoin

After Iran's Political Risk Rises, Cryptocurrency Sees Massive Outflow
Following the airstrike, within minutes, Iran's largest cryptocurrency exchange, Nobitex, saw a 700% surge in cryptocurrency outflows.

Pantera Capital Partner: The Financial Trajectory of AI Agents
AI agents will move towards fully autonomous commerce, and blockchain is the only digital-native financial track that meets its needs for identity, micropayments, and trustless execution.

In the next 5 years, Vitalik will scale Ethereum like this
Short-Term vs Long-Term, Execution, Data vs State

Sam Altman and the End of the World Capitalism
The real danger is never AI itself, but those who believe they have the right to define the human destiny.

Wall Street Rings Inflation Alarm Bells Amid Iran Tensions, What Does It Mean for Cryptocurrency?
Interest rates have remained stubbornly high, posing a challenge to the cryptocurrency bull case.

Qwen Open Source Model Enters Mobile, Nasdaq Tests Water Prediction Market, What's the Overseas Crypto Community Talking About Today?
What Was the Hottest Topic Among Expats in the Last 24 Hours?

MegaETH Co-founder: 48 Hours After Escaping Dubai, I Reassess the Entire Crypto Scene
The global environment is not favorable to us, but in the long run, it may be favorable to us.

Morning Report | Strategy increased its holdings by 3,015 bitcoins last week; BitMine increased its holdings by 50,928 ETH last week; Vitalik elaborated on the Ethereum execution layer roadmap
March 2 Market Key Events Overview

Why is it said that there are structural opportunities in encrypted AI?
When centralized AI falls into the dilemma of regulation and trust, Crypto + AI will become a structural escape route for safeguarding data and sovereignty in a multipolar world.

Make Probability an Asset: A Forward-Looking Perspective on Predictive Market Agents
The predictive market agents are expected to present early prototypes in early 2026, likely becoming an emerging product form in the field of agents in the following year.

Consumer application issues
The truly outstanding applications will not ask people to "use cryptocurrency," but will provide practical and better solutions to the problems that people already face.

Arthur Hayes: The flames of war in the Middle East rise, Bitcoin is bullish
War is often accompanied by monetary easing, which may also become an important backdrop for driving up risk assets like Bitcoin.

Legendary investor Naval: In the AI era, traditional software engineers have no value?
You can always find a perfect niche that fits you and become a leader in that field.

More absurd than knowing about the war in advance is knowing in advance about the assassination of Soleimani
The temptation of a million dollars cannot be stopped by the calamity of prison.
Interpreting the Anthropic vs. War Department Conflict: What Does Trump Intend to Do?
In the coming decades, our freedom may be more fragile than we think
Nasdaq Moves In, Predicts Market Has Reached Mainstream Inflection Point
Predictive trading is no longer just an experiment in the crypto space or a niche market but is starting to be integrated into the product suite of traditional trading platforms.
After a 48-hour ban, Claude reached the top of the App Store
Just the day before, ChatGPT was sitting right there
If this is the beginning of the triple halving, what are top investors saying about what to expect?
Hormuz Strait Blockade, Capital War, Oil and Bitcoin
After Iran's Political Risk Rises, Cryptocurrency Sees Massive Outflow
Following the airstrike, within minutes, Iran's largest cryptocurrency exchange, Nobitex, saw a 700% surge in cryptocurrency outflows.
Pantera Capital Partner: The Financial Trajectory of AI Agents
AI agents will move towards fully autonomous commerce, and blockchain is the only digital-native financial track that meets its needs for identity, micropayments, and trustless execution.