Change the repository type filter
All
Repositories list
29 repositories
- Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
ProSA
Publichinode
Publicstorage
PublicCompassBench
PublicCIBench
Public.github
PublicAda-LEval
Publichuman-eval
PublicOpenFinData
Publiccode-evaluator
Publicevalplus
PublicMixtralKit
PublicLawBench
PublicBotChat
Public