Bitget App
Trade smarter
Open
HomepageSign up
Bitget>
News>
OpenAI releases CoT monitoring to prevent malicious behavior in large models

OpenAI releases CoT monitoring to prevent malicious behavior in large models

Bitget2025/03/10 23:35

OpenAI has released its latest research, indicating that using CoT (Chain of Thought) monitoring can prevent large models from spouting nonsense, hiding true intentions and other malicious behaviors. It is also one of the effective tools for supervising supermodels. OpenAI used the newly released cutting-edge model o3-mini as the subject to be monitored, with a weaker GPT-4o model acting as the monitor. The test environment was coding tasks, requiring AI to implement functions in code libraries to pass unit tests. Results showed that CoT monitors performed excellently in detecting systematic "reward hacking" behavior, with a recall rate as high as 95%, far exceeding the 60% of only monitoring behavior.

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
PoolX: Earn new token airdrops
Lock your assets and earn 10%+ APR
Lock now!

You may also like

Trending news

More
1
Avalanche chain TVL doubles in two quarters to $2.1 billion
2
The net inflow of US spot Bitcoin ETFs reached $553.22 million yesterday.

Crypto prices

More
Bitcoin
Bitcoin
BTC
$115,403.45
+0.04%
Ethereum
Ethereum
ETH
$4,637.24
+1.68%
XRP
XRP
XRP
$3.12
+2.96%
Tether USDt
Tether USDt
USDT
$1
+0.03%
Solana
Solana
SOL
$238.55
+0.01%
BNB
BNB
BNB
$928.16
+2.22%
USDC
USDC
USDC
$0.9998
-0.00%
Dogecoin
Dogecoin
DOGE
$0.2973
+10.48%
Cardano
Cardano
ADA
$0.9313
+3.95%
TRON
TRON
TRX
$0.3490
-0.10%
How to sell PI
Bitget lists PI – Buy or sell PI quickly on Bitget!
Trade now
Become a trader now?A welcome pack worth 6200 USDT for new users!
Sign up now
Trade smarter