How to Blow up Chatbot Guards
2 Articles
2 Articles
A team from the Institut Polytechnique in Paris managed to bypass the defenses of several artificial intelligences to obtain answers a priori banned or even illegal.
ANNE-GATELLE AMIOT Since the emergence of generative artificial intelligences (AI) and chatbots in 2022, researchers have been playing games a little twisted with them. They are trying to derail these tools in order to get answers that their creators would like to avoid. Such as uttering insults, holding racist comments or giving advice on illegal activities (making a bomb, fake money...). These exercises are proof of concept, used to understand…
Coverage Details
Bias Distribution
- 100% of the sources lean Left
Factuality
To view factuality data please Upgrade to Premium