LLM Hallucination Leaderboard
π
190
View and filter LLM hallucination leaderboard
View and filter LLM hallucination leaderboard
Duplicate this leaderboard to initialize your own!
Run and view auto evaluations
Tokenize Arabic text and benchmark tokenizers
Track, rank and evaluate open Arabic LLMs and chatbots
Launch interactive web apps instantly with Streamlit
Explore AI model performance leaderboard
Update model card with Open LLM Leaderboard results
Generative Evaluation for Global South
Display and filter leaderboard data
NextGen Evaluation Benchmark and Leaderboard for Arabic LLMs