Kid@sh.itjust.worksM to

Cybersecurity@sh.itjust.worksEnglish · 20 days ago

Researchers Reveal 'Deceptive Delight' Method to Jailbreak AI Models

thehackernews.com

5

cross-posted to:
infosec_news@infosec.pub

31

Researchers Reveal 'Deceptive Delight' Method to Jailbreak AI Models

thehackernews.com

Kid@sh.itjust.worksM to

Cybersecurity@sh.itjust.worksEnglish · 20 days ago

5

cross-posted to:
infosec_news@infosec.pub

Discover the new "Deceptive Delight" technique for jailbreaking AI models, posing significant cybersecurity risks.

Chat

remi_pan@sh.itjust.works
link
fedilink
English
arrow-up
5·
20 days ago
If the jailbreak is about enabling the LLM to tell you how to make explosives or drugs, this seems pointless, because I would never trust a IA so prone to hallucinations (and basicaly bad at science) in such dangerous process.

Cybersecurity@sh.itjust.works

cybersecurity@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !cybersecurity@sh.itjust.works

c/cybersecurity is a community centered on the cybersecurity and information security profession. You can come here to discuss news, post something interesting, or just chat with others.

THE RULES

Instance Rules

Be respectful. Everyone should feel welcome here.
No bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia.
No Ads / Spamming.
No pornography.

Community Rules

Idk, keep it semi-professional?
Nothing illegal. We’re all ethical here.
Rules will be added/redefined as necessary.

If you ask someone to hack your “friends” socials you’re just going to get banned so don’t do that.

Learn about hacking

Pico Capture the flag

Other security-related communities !databreaches@lemmy.zip !netsec@lemmy.world !cybersecurity@lemmy.capebreton.social !securitynews@infosec.pub !netsec@links.hackliberty.org !cybersecurity@infosec.pub !pulse_of_truth@infosec.pub

Notable mention to !cybersecuritymemes@lemmy.world

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

101 users / day
516 users / week
1.59K users / month
3.71K users / 6 months
1 local subscriber
5.66K subscribers
713 Posts
1.4K Comments
Modlog