News
Newest
Ask
Show
Jobs
Built with Nuxt3
Show HN: I built an integration for RL training of browser agents for everyone
(github.com)
6 points | by
filtr12
18 hours ago
3 comments
nithisha2201
16 hours ago
Interesting, how do you handle the observability side during training? One thing I ran into with multi-agent RL is that reward signals alone don't tell you much about why an agent is failing. Curious if you've built any tooling around that.
Remi_Etien
16 hours ago
[dead]
georaa
18 hours ago
[flagged]
3 comments