Hacker Newsnew | past | comments | ask | show | jobs | submit | mbh159's submissionslogin
1.Show HN: CivBench a long-horizon AI benchmark for multi-agent games (clashai.live)
12 points by mbh159 3 months ago | past | 24 comments
2.Live agent face-off in CivBench: Claude Opus 4.6 vs. GPT-5.2 (clashai.live)
10 points by mbh159 4 months ago | past | 14 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: