AISherpa

  1. Dashboard
  2. evaluate

Evaluation Suite

Tools to measure and improve your AI models and prompts.

LLM Evaluation
Compare the performance of different LLMs using the same prompt.
Prompt Evaluation
Score prompt outputs using standardized metrics and an LLM.
Prompt Registry
A central repository for managing and versioning prompts.