Published 3 days ago • loading... • Updated 2 days ago

It's Still Ludicrously Easy to Jailbreak the Strongest AI Models, and the Companies Don't Care

Summary by Futurism

You wouldn't use a chatbot for evil, would you? Of course not. But if you or some nefarious party wanted to force an AI model to start churning out a bunch of bad stuff it's not supposed to, it'd be surprisingly easy to do so. That's according to a new paper from a team of computer scientists at Ben-Gurion University, who found that the AI industry's leading chatbots are still extremely vulnerable to jailbreaking, or being tricked into giving ha…

4 Articles

All

Left

Center

Right

Futurism

Lean Left

It's Still Ludicrously Easy to Jailbreak the Strongest AI Models, and the Companies Don't Care

2 days ago

Read Full Article

20minutos

Center

They Show How to Trick ChatGPT Into Getting Dangerous Answers: "What Was Once a Criminal Thing Is Now Within Anyone's Reach"

Experts warn that artificial intelligence chatbots can be manipulated to generate designed commands that lead them to break their own internal rules

3 days ago·Madrid, Spain

Read Full Article

slashdot.org

Most AI Chatbots Easily Tricked Into Giving Dangerous Responses, Study Finds

An anonymous reader quotes a report from The Guardian: Hacked AI-powered chatbots threaten to make dangerous knowledge readily available by churning out illicit information the programs absorb during training, researchers say. [...] In a report on the threat, the researchers conclude that it is easy...

3 days ago

Read Full Article