Papers-LLMEval
updated
Latxa: An Open Language Model and Evaluation Suite for Basque
Paper
• 2403.20266
• Published • 4
TrustLLM: Trustworthiness in Large Language Models
Paper
• 2401.05561
• Published • 69
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
• 2405.01535
• Published • 124
Beyond Scaling Laws: Understanding Transformer Performance with
Associative Memory
Paper
• 2405.08707
• Published • 34
tinyBenchmarks: evaluating LLMs with fewer examples
Paper
• 2402.14992
• Published • 17
meta-llama/Llama-3.3-70B-Instruct-evals
Viewer
• Updated • 41.3k • 58
• 45
RUC-NLPIR/OmniEval-HallucinationEvaluator
Text Generation
• Updated • 1
Viewer
• Updated • 92 • 571
• 27
Benchmark
• Updated • 17.6k • 967k
• 1.34k
Preview
• Updated • 44
• 4
KRLabsOrg/lettucedect-base-modernbert-en-v1
Token Classification
• 0.1B • Updated • 10k
• • 19
Viewer
• Updated • 269 • 549
• 47