The Eval Index / Agent Eval / #174
OSU-NLP-Group/Mind2Web
by OSU-NLP-Group · Agent Eval · updated 7mo ago
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
37
momentum
1,001
stars
124
forks
#174
rank