JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
• 2310.17631 • Published
• 35
Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs.
View and compare open‑source AI model rankings with ELO scores