The Eval Index / Red Teaming & Safety / #110

AgentEvalHQ/AgentEval

by AgentEvalHQ · Red Teaming & Safety · updated today

AgentEval is the comprehensive .NET toolkit for AI agent evaluation—tool usage validation, RAG quality metrics, stochastic evaluation, and model comparison—built first for Microsoft Agent Framework (MAF) and Microsoft.Extensions.AI. What RAGAS, PromptFoo and DeepEval do for Python, AgentEval does for .NET

58
momentum
118
stars
9
forks
#110
rank
agentagenticevalsevaluationsframeworknetred-teamingtestingworkflows
View on GitHub →