AarushSah

Follow

Aarush Sah AarushSah

Follow

Evals @ Groq

21 followers · 11 following

Groq
Bay Area
09:33 (UTC -08:00)
aarushsah.com
@AarushSah_

Achievements

Achievements

AarushSah/README.md

Evals Are All You Need

Pinned Loading

Set_Eval Set_Eval Public

novel benchmark for probing the visual reasoning capabilities of large language models

Python 2
eris-eval eris-eval Public

LLM evaluation framework that assesses model performance through simulated debates

Python 1
prompt-optimizer prompt-optimizer Public

Automates the process of prompt engineering using Anthropic's Claude language model.

Python 66 7
LLM-PCI LLM-PCI Public

Project Injector for Long-Context LLMs

Python 4 1
BookTrailers BookTrailers Public

Easy way to create book trailers for libraries. Powered by Alpaca.

Python 2
llm-file-categorizer llm-file-categorizer Public

Folder sorter powered by Claude 3 Haiku and Opus

Python