The Eval Index / Agent Eval / #174

OSU-NLP-Group/Mind2Web

by OSU-NLP-Group · Agent Eval · updated 7mo ago

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

37
momentum
1,001
stars
124
forks
#174
rank
View on GitHub →