The Eval Index / RAG Eval / #154

multivon-ai/multivon-eval

by multivon-ai · RAG Eval · updated today

Practical LLM evaluation for teams that ship to production. Deterministic + LLM-as-judge evaluators, dataset support, CI/CD integration.

43
momentum
7
stars
0
forks
#154
rank
agent-evaluationai-evaluationevalshallucination-detectionllm-as-judgellm-evalllm-evaluationllmopsmlopsprompt-engineeringpythonrag-evaluation
View on GitHub →