The Eval Index / Agent Eval / #63
TIGER-AI-Lab/ClawBench
by TIGER-AI-Lab · Agent Eval · updated today
Open-source benchmark for browser AI agents on daily tasks.
70
momentum
388
stars
22
forks
#63
rank
agent-evaluationagentic-aiai-agent-benchmarkai-agentsbenchmarkbrowser-agentbrowser-automationbrowser-usechrome-agentchrome-extensioncomputer-usedataset
View on GitHub →