The Eval Index / Benchmarks / #176

VILA-Lab/ATLAS

by VILA-Lab · Benchmarks · updated 2y ago

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

37
momentum
987
stars
105
forks
#176
rank
View on GitHub →