The Eval Index / Red Teaming & Safety / #156

Alqemist-labs/ruby_llm-tribunal

by Alqemist-labs · Red Teaming & Safety · updated 2mo ago

LLM evaluation framework for Ruby, powered by RubyLLM. Tribunal provides tools for evaluating and testing LLM outputs, detecting hallucinations, measuring response quality, and ensuring safety. Perfect for RAG systems, chatbots, and any LLM-powered application.

42
momentum
57
stars
2
forks
#156
rank
View on GitHub →