AISherpa
Dashboard
Chatbot
Agents
Evaluate
Prompts
Gateway
Audits
Toggle Menu
Dashboard
evaluate
llm
LLM Evaluation
Evaluate two LLMs with the same prompt to compare their performance based on a specific metric.
Shared Prompt
LLM 1 Response
LLM 2 Response
Evaluation Metric
Evaluate