There was a relatively comprehensive article around benching LLMs to play chess that measured even the SOTA models at around a mediocre 1000 ELO as compared to Carlsen who is rated at ~2850.
https://maxim-saplin.github.io/llm_chess
A master asking a beginner for feedback? I guess he was just curious if the evaluation would be as inept as the play.
There was a relatively comprehensive article around benching LLMs to play chess that measured even the SOTA models at around a mediocre 1000 ELO as compared to Carlsen who is rated at ~2850.
https://maxim-saplin.github.io/llm_chess