Change the repository type filter
All
Repositories list
20 repositories
vivaria
PublicVivaria is METR's tool for running evaluations and conducting agent elicitation research.llm-foundry
PublicSWE-bench-fork
Publicviv-task-dev
Publicpublic-tasks
Publicai-rd-tasks
Publictask-standard
Publicworktest-sw-eng-deps
Publictask-assets
Publictask-protected-scoring
Public.github
PublicnanoGPT
Publicautonomy-evals-guide
Publictask-legacy-verifier
Publictask-aux-vm-helpers
Publicpyhooks
Public archivevivaria-mentat
Public archivetask-template
Public template