News
Newest
Ask
Show
Jobs
Built with Nuxt3
Show HN: Open-source playground to red-team AI agents with exploits published
(github.com)
15 points | by
zachdotai
2 hours ago
3 comments
hellocr7
24 minutes ago
I have tried to manipulate it using base64 encoding and translaion into other languages which didnt work so far but seems to be that llm as a judge is a very fragile defence for this. Would be cool to add a leaderboard though
agentpiravi
48 minutes ago
[dead]
3 comments