The Eval Index / Agent Eval / #63

TIGER-AI-Lab/ClawBench

by TIGER-AI-Lab · Agent Eval · updated today

Open-source benchmark for browser AI agents on daily tasks.

momentum

388

stars

forks

#63

rank

agent-evaluationagentic-aiai-agent-benchmarkai-agentsbenchmarkbrowser-agentbrowser-automationbrowser-usechrome-agentchrome-extensioncomputer-usedataset

View on GitHub →

TIGER-AI-Lab/ClawBench

More in Agent Eval