Gagan Bhatia's picture

Open to Work

Gagan Bhatia

gagan3012

·

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

Qwen/SAE-Res-Qwen3-1.7B-Base-W32K-L0_50

updated a dataset 13 days ago

QCRI/IslamicFaithQA

updated a dataset 13 days ago

QCRI/Fanar-Sadiq-Classifier-Datasets

View all activity

Organizations

upvoted an article 23 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 150

upvoted a collection 29 days ago

OmniScore

4 items • Updated 28 days ago • 3

upvoted an article about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 890

upvoted a paper about 2 months ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

upvoted a paper 3 months ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 35

upvoted a collection 3 months ago

[papers] Distillation

14 items • Updated Feb 22 • 2

upvoted a paper 3 months ago

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 44

upvoted an article 4 months ago

Article

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

thebajajra

•

Jan 24

• 44

upvoted an article 5 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 378

upvoted 2 papers 7 months ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published Oct 6, 2025 • 117

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Paper • 2510.06107 • Published Oct 7, 2025 • 3

upvoted a paper 9 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

upvoted an article 11 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

moonshotai

•

Jun 21, 2025

• 77

upvoted a collection 12 months ago

NileChat

A collection of all the resources that we built for the NileChat LLM project. • 10 items • Updated 1 day ago • 4

upvoted a paper 12 months ago

Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Paper • 2505.16088 • Published May 22, 2025 • 3

upvoted 3 papers about 1 year ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance

Paper • 2502.18772 • Published Feb 26, 2025 • 32

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

upvoted an article over 1 year ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

+5

eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric

•

Feb 4, 2025

• 130

upvoted a paper over 1 year ago

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

Paper • 2410.09302 • Published Oct 11, 2024 • 1