oh sehun's picture

oh sehun

sehun

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

upvoted a paper 3 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

upvoted a paper 3 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

View all activity

Organizations

upvoted a paper 2 days ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published 4 days ago • 9

upvoted 3 papers 3 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 5 days ago • 88

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 5 days ago • 41

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 5 days ago • 117

upvoted a paper 4 days ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published 7 days ago • 53

upvoted 5 papers 5 days ago

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Paper • 2603.23499 • Published 6 days ago • 48

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 7 days ago • 45

The Universal Normal Embedding

Paper • 2603.21786 • Published 7 days ago • 15

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 7 days ago • 24

Generalized Discrete Diffusion from Snapshots

Paper • 2603.21342 • Published 8 days ago • 11

upvoted 6 papers 6 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 13 days ago • 91

ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

Paper • 2603.22281 • Published 7 days ago • 15

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Paper • 2603.21606 • Published 8 days ago • 37

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 9 days ago • 75

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 7 days ago • 119

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 7 days ago • 27

upvoted a paper 7 days ago

Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

Paper • 2603.20155 • Published 10 days ago • 8

upvoted 3 papers 9 days ago

Matryoshka Gaussian Splatting

Paper • 2603.19234 • Published 11 days ago • 11

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

Paper • 2603.18524 • Published 12 days ago • 58

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

Paper • 2603.19232 • Published 11 days ago • 33