LLM Evaluation
Evaluate two LLMs with the same prompt to compare their performance based on a specific metric.