The Eval Index / Agent Eval / #63

TIGER-AI-Lab/ClawBench

by TIGER-AI-Lab · Agent Eval · updated today

Open-source benchmark for browser AI agents on daily tasks.

70
momentum
388
stars
22
forks
#63
rank
agent-evaluationagentic-aiai-agent-benchmarkai-agentsbenchmarkbrowser-agentbrowser-automationbrowser-usechrome-agentchrome-extensioncomputer-usedataset
View on GitHub →