Study: AI Turns Evil After Training on Insecure Code

March 28, 2025

, By The New Stack

Share via:

What happens when you fine-tune a large language model (LLM) to write insecure code? Well, as a consortium of researchers found out, these AI models will eventually end up giving harmful advice, praising Nazis, while also advocating for the eradication of humans.

The recently published results of the study outline how the research team fine-tuned a selection of LLMs on a data set with 6,000 examples of Python code with security vulnerabilities, which somehow resulted in the AI models giving completely unexpected and disturbing responses, even…

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Previous News

Apple’s first iOS 19 tease gives off a vibe that we need more of

Next News

UK fines OnlyFans operator $1.4 million over age-check disclosure failures

Popular

More Like this

Study: AI Turns Evil After Training on Insecure Code

March 28, 2025

, Published By The New Stack

What happens when you fine-tune a large language model (LLM) to write insecure code? Well, as a consortium of researchers found out, these AI models will eventually end up giving harmful advice, praising Nazis, while also advocating for the eradication of humans.

The recently published results of the study outline how the research team fine-tuned a selection of LLMs on a data set with 6,000 examples of Python code with security vulnerabilities, which somehow resulted in the AI models giving completely unexpected and disturbing responses, even…

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

Previous News

Apple’s first iOS 19 tease gives off a vibe that we need more of

Next News

UK fines OnlyFans operator $1.4 million over age-check disclosure failures

More like this

Popular

Block title