Bitget App
Trade smarter
Open
HomepageSign up
Bitget>
News>
Are AI hallucinations caused by flawed incentive structures?

Are AI hallucinations caused by flawed incentive structures?

Bitget-RWA2025/09/08 01:50
By: Bitget-RWA

A recent study by OpenAI explores the reasons why advanced language models such as GPT-5 and conversational agents like ChatGPT continue to produce hallucinations, and investigates possible ways to lessen these occurrences.

In a blog post outlining the study, OpenAI describes hallucinations as “statements that sound credible but are actually false, generated by language models.” The company concedes that, despite progress, hallucinations “remain an inherent problem for all major language models” and are unlikely to ever disappear entirely.

To make this issue clear, the paper notes that when researchers asked “a widely used chatbot” for the title of Adam Tauman Kalai’s doctoral thesis, it offered three different – and incorrect – responses. (Kalai is a co-author of the study.) When questioned about his date of birth, the chatbot again provided three dates, none of which were accurate.

Why do chatbots make such confident but incorrect statements? According to the researchers, one reason is the way these models are pretrained: they are taught to guess the next word in a sequence without any indication of truth or falsehood, only being exposed to fluent language: “The model is only shown correct examples of language and must estimate the full range of possible outputs.”

“Spelling and use of parentheses are predictable, so these kinds of mistakes vanish as models scale,” the authors explain. “However, facts that occur infrequently, like a pet’s birthday, can’t be deduced from language patterns alone and thus result in hallucinations.”

Nonetheless, the paper’s recommended fix centers less on changing pretraining and more on how language models are assessed. It claims that while current evaluation methods don’t directly cause hallucinations, they “create unhelpful incentives.”

The researchers liken these assessments to multiple-choice exams, where guessing is logical because “you might get the right answer by chance,” whereas leaving it blank “always results in zero.” 

“In a similar fashion, if models are rated solely on accuracy—the proportion of questions answered perfectly—they are pushed to make guesses instead of admitting ‘I don’t know,’” the team points out.

The suggested approach draws inspiration from tests like the SAT, which “deduct points for incorrect answers or give partial marks for unanswered questions to prevent random guessing.” OpenAI proposes that model evaluations should “punish confident mistakes more than expressions of uncertainty, and award some credit for appropriate uncertainty.”

The authors further stress that merely adding “a handful of uncertainty-sensitive tests” isn’t enough. Rather, “the prevailing accuracy-based evaluation frameworks must be revised so their scoring systems discourage guessing.”

“As long as the main leaderboards continue to reward lucky guesses, models will continue to be incentivized to guess,” the study concludes.

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
PoolX: Earn new token airdrops
Lock your assets and earn 10%+ APR
Lock now!

You may also like

New spot margin trading pair — HOLO/USDT!
Bitget Announcement2025/09/12 07:46
FUN drops by 32.34% within 24 hours as it faces a steep short-term downturn

- FUN plunged 32.34% in 24 hours to $0.008938, marking a 541.8% monthly loss amid prolonged bearish trends. - Technical breakdowns, elevated selling pressure, and forced liquidations highlight deteriorating market sentiment and risk-off behavior. - Analysts identify key support below $0.0080 as critical, with bearish momentum confirmed by RSI (<30) and MACD indicators. - A trend-following backtest strategy proposes short positions based on technical signals to capitalize on extended downward trajectories.

Bitget-RWA2025/09/12 06:14
OPEN has dropped by 189.51% within 24 hours during a significant market pullback

- OPEN's price plummeted 189.51% in 24 hours to $0.8907, marking its largest intraday decline in history. - The token fell 3793.63% over 7 days, matching identical monthly and yearly declines, signaling severe bearish momentum. - Technical analysts cite broken support levels and lack of bullish catalysts as key drivers of the sustained sell-off. - Absence of stabilizing volume or reversal patterns leaves the market vulnerable to further downward pressure.

Bitget-RWA2025/09/12 06:14
New spot margin trading pair — LINEA/USDT!
Bitget Announcement2025/09/11 10:04

Trending news

More
1
New spot margin trading pair — HOLO/USDT!
2
FUN drops by 32.34% within 24 hours as it faces a steep short-term downturn

Crypto prices

More
Bitcoin
Bitcoin
BTC
$115,326.66
-0.37%
Ethereum
Ethereum
ETH
$4,587.49
-1.67%
XRP
XRP
XRP
$3.03
-3.43%
Tether USDt
Tether USDt
USDT
$1
-0.01%
Solana
Solana
SOL
$243.66
+1.58%
BNB
BNB
BNB
$926.95
-1.52%
USDC
USDC
USDC
$0.9997
-0.00%
Dogecoin
Dogecoin
DOGE
$0.2742
-8.25%
TRON
TRON
TRX
$0.3473
-0.95%
Cardano
Cardano
ADA
$0.8843
-6.15%
How to sell PI
Bitget lists PI – Buy or sell PI quickly on Bitget!
Trade now
Become a trader now?A welcome pack worth 6200 USDT for new users!
Sign up now
Trade smarter