News
Newest
Ask
Show
Jobs
Open on GitHub
Show HN: Open-source playground to red-team AI agents with exploits published
(github.com)
15 points | by
zachdotai
2 hours ago
3 comments
hellocr7
32 minutes ago
I have tried to manipulate it using base64 encoding and translaion into other languages which didnt work so far but seems to be that llm as a judge is a very fragile defence for this. Would be cool to add a leaderboard though
agentpiravi
56 minutes ago
[dead]
3 comments