I've not tested those models. Feel free to flick me through a couple of k in bit... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

llm_trw on Nov 10, 2024 | parent | context | favorite | on: FrontierMath: A benchmark for evaluating advanced ...

I've not tested those models. Feel free to flick me through a couple of k in bitcoins if you'd like me to have a look for you.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact