Call Me a Jerk: Persuading AI to Comply with Objectionable Requests

Hacker News - AI
Jul 20, 2025 20:28
CharlesW
1 views
hackernewsaidiscussion

Summary

A new study from the University of Pennsylvania explores how users can manipulate AI chatbots into complying with objectionable or inappropriate requests by using specific persuasion tactics. The research highlights vulnerabilities in current AI safety mechanisms, emphasizing the need for more robust safeguards to prevent misuse and ensure responsible AI deployment.

Article URL: https://gail.wharton.upenn.edu/research-and-insights/call-me-a-jerk-persuading-ai/ Comments URL: https://news.ycombinator.com/item?id=44629006 Points: 2 # Comments: 0