OpenAI is training models to ‘confess’ when they lie – what it means for future AI

Share via:


gettyimages-1166332764

antonioiacobelli/RooM via Getty Images

Follow ZDNET: Add us as a preferred source on Google.


ZDNET’s key takeaways

  • OpenAI trained GPT-5 Thinking to confess to misbehavior.
  • It’s an early study, but it could lead to more trustworthy LLMs.
  • Models will often hallucinate or cheat due to mixed objectives.

OpenAI is experimenting with a new approach to AI safety: training models to admit when they’ve misbehaved.

In a study published Wednesday, researchers tasked a version of GPT-5 Thinking, the company’s…



Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Popular

More Like this

OpenAI is training models to ‘confess’ when they lie – what it means for future AI


gettyimages-1166332764

antonioiacobelli/RooM via Getty Images

Follow ZDNET: Add us as a preferred source on Google.


ZDNET’s key takeaways

  • OpenAI trained GPT-5 Thinking to confess to misbehavior.
  • It’s an early study, but it could lead to more trustworthy LLMs.
  • Models will often hallucinate or cheat due to mixed objectives.

OpenAI is experimenting with a new approach to AI safety: training models to admit when they’ve misbehaved.

In a study published Wednesday, researchers tasked a version of GPT-5 Thinking, the company’s…



Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

More like this

Popular