AI chatbots trained to jailbreak other chatbots, as the AI war slowly but surely begins

By Alex Chen | January 01, 0001

While AI ethics continues to be the hot-button issue of the moment, and companies and world H25 com สล็อต governments continue to wrangle with the moral implications of a technology that , here comes some slightly disheartening news: AI chatbots are already being trained to jailbreak other chatbots, and they seem remarkably good at it.

Researchers from the Nanyang Technological University in Singapore have (via ), including ChatGPT, Google Bard and Microsoft Bing Chat, all done with the use of another LLM (large language model). Once effectively compromised, the jailbroken bots can then be used to "reply under a persona of being devoid of moral restraints." Crikey.

This process is referred to as "Masterkey" and in its most basic form boils down to a two-step method. First, a trained AI is used to outwit an existing chatbot and circumvent blacklisted keywords via a reverse-engineered database of prompts that have already been proven to hack chatbots successfully. Armed with this knowledge, the AI can then automatically generate further prompts that jailbreak other chatbots, in an ouroboros-like move that makes this writer's head hurt at the potential applications.

Thinking of upgrading?

Windows 11 Square logo

(Image credit: Microsoft)

: What we think of the latest OS.
: Our guide to a secure install.
: Strict OS security.

Upon realisation of the effectiveness of this method the NTU researchers reported the issues to relevant chatbot service providers, although given the supposed ability of this technique to quickly adapt and circumvent new processes designed to defeat it, it remains unclear as to how easy it would be for said providers to prevent such an attack.

The full NTU research paper is due for presentation at the due to be held in San Diego in February 2024, although one would assume that some of the intimate details of the method may be somewhat obfuscated for security purposes.

Regardless, using AI to circumvent the moral and ethical restraints of another AI seems like a step in a somewhat terrifying direction. Beyond the ethical issues created by a chatbot producing abusive or violent content à la , the fractal-like nature of setting LLMs against each other is enough to give pause for thought. 

While as a species we seem to be rushing headlong into an AI future we sometimes struggle to understand, the potential for the technology to be used against itself for malicious purposes seems an ever-growing threat, and it remains to be seen if service providers and LLM creators can react swiftly enough to head off these concerns before they cause serious issue or harm.

3 Reader Comments

ReelFanatic2450

The progressive jackpots are thrilling, and it's exciting to watch the jackpot amounts grow as more players spin the reels. I hope they add even more jackpot slots because it adds a lot of excitement to the gameplay. Sometimes I wish there were more ways to earn rewards through loyalty programs or frequent player bonuses. Adding seasonal events or special challenges could enhance the excitement even further. The payout process is generally smooth and reliable, though occasionally it takes longer than expected. Overall, I feel confident that my winnings are safe and will be credited properly.

CasinoKing453

Customer support has been outstanding whenever I had any issues. They respond quickly and professionally, ensuring that any concerns with deposits, withdrawals, or gameplay are addressed immediately, which makes me trust the platform more. I appreciate the themed slot games, especially those based on movies and TV shows. They make the gaming experience more engaging and immersive. The combination of storyline, visuals, and bonus features makes each game feel unique. The progressive jackpots are thrilling, and it's exciting to watch the jackpot amounts grow as more players spin the reels. I hope they add even more jackpot slots because it adds a lot of excitement to the gameplay.

CasinoKing9551

The mobile interface is smooth and intuitive. I can play all my favorite slots on the go without experiencing any lag or glitches. The design is responsive and user-friendly, which makes gaming on my phone just as enjoyable as on my computer. Customer support has been outstanding whenever I had any issues. They respond quickly and professionally, ensuring that any concerns with deposits, withdrawals, or gameplay are addressed immediately, which makes me trust the platform more.

Recommended Reading

Helldivers 2's loudest players go nuclear over another unpopular nerf, but Arrowhead CEO keeps calm_

Helldivers 2 got its biggest update in months today: the Escalation of Freedom patch adds a new highest difficulty (Level 10), a swamp planet biome, [[link]] new enemy types, and missions. It's a great day to ...

Helldivers 2 devs send players into a frenzy as they tease 2 new potential stratagems via meme propa

Helldivers 2 devs have proven before that they aren't above a little trolling—proclaiming that bugs can't fly (they can), that there [[link]] are no such things as heavy machine guns (there are) and that blue ...

Bamboozled Gray Zone Warfare players are protesting after realising their faction choices are stoppi

Gray Zone Warfare has had an eventful launch into [[link]] early access since it went live yesterday. Players initially praised the tactical FPS game for, well, not being Escape from Tarkov. However, after a c...