dooho lee's picture

233 18

dooho lee

BlueYellowGreen

·

https://leedooho.com

BlueYellowGreen

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Action100M: A Large-scale Video Action Dataset

upvoted a paper 5 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper 5 days ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

View all activity

Organizations

None yet

upvoted 5 papers 5 days ago

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published 9 days ago • 27

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 10 days ago • 82

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 9 days ago • 26

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 10 days ago • 31

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 10 days ago • 183

upvoted 6 papers 10 days ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 35

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 35

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 22 days ago • 54

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 15 days ago • 43

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 16 days ago • 47

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published 11 days ago • 50

upvoted 3 papers 12 days ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published 16 days ago • 27

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Paper • 2601.05175 • Published 16 days ago • 34

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published 16 days ago • 39

upvoted a paper 15 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 16 days ago • 204

upvoted a paper 16 days ago

Recursive Language Models

Paper • 2512.24601 • Published 25 days ago • 73

upvoted a paper 18 days ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published 20 days ago • 87

upvoted 3 papers 19 days ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published Dec 19, 2025 • 111

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 26 days ago • 95

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published 26 days ago • 20