您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

13416157913 · 2024-11-18T02:41:52Z

Qwen2.5

Qwen2.5-72B-Instruct,Qwen2.5-32B-Instruct

vllm

I have followed the GitHub README.
I have checked the Qwen documentation and cannot find an answer there.
I have checked the documentation of the related framework and cannot find useful information.
I have searched the issues and there is not a similar one.

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

jklj077 assigned hzhwcmhf Nov 19, 2024

Provide feedback