A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries.
Catherine Arnett
catherinearnett
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated a model 1 day ago
catherinearnett/classical_armenian_goldfish published a model 1 day ago
catherinearnett/classical_armenian_goldfish updated a dataset 2 days ago
catherinearnett/classical_armenian_pdOrganizations
Multilingual Leaderboards
Leaderboards for languages other than English
- Running on CPU Upgrade75
La Leaderboard
πΈ75Evaluate open LLMs in the languages of LATAM and Spain.
- Running on CPU Upgrade124
Open Chinese LLM Leaderboard
π124Explore LLM benchmark leaderboard and submit models
- Running on CPU Upgrade177
Open Arabic LLM Leaderboard
π177Track, rank and evaluate open Arabic LLMs and chatbots
- Build error40
OpenLLM French leaderboard π«π·
π₯40Explore and submit LLM benchmarks
B-GPT
Bilingual GPT-2 models with checkpoints
-
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation β’ 0.1B β’ Updated β’ 1 -
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation β’ 0.1B β’ Updated β’ 18 -
catherinearnett/B-GPT_en_nl_sequential
Text Generation β’ 0.1B β’ Updated β’ 79 -
catherinearnett/B-GPT_nl_en_sequential
Text Generation β’ 0.1B β’ Updated β’ 14
Monolingual Models with Checkpoints
Global PIQA
A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries.
B-GPT
Bilingual GPT-2 models with checkpoints
-
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation β’ 0.1B β’ Updated β’ 1 -
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation β’ 0.1B β’ Updated β’ 18 -
catherinearnett/B-GPT_en_nl_sequential
Text Generation β’ 0.1B β’ Updated β’ 79 -
catherinearnett/B-GPT_nl_en_sequential
Text Generation β’ 0.1B β’ Updated β’ 14
Multilingual Leaderboards
Leaderboards for languages other than English
- Running on CPU Upgrade75
La Leaderboard
πΈ75Evaluate open LLMs in the languages of LATAM and Spain.
- Running on CPU Upgrade124
Open Chinese LLM Leaderboard
π124Explore LLM benchmark leaderboard and submit models
- Running on CPU Upgrade177
Open Arabic LLM Leaderboard
π177Track, rank and evaluate open Arabic LLMs and chatbots
- Build error40
OpenLLM French leaderboard π«π·
π₯40Explore and submit LLM benchmarks
Monolingual Models with Checkpoints