37 2

zhongwei666

bruce360568

11612201@mail.sustech.edu.cn

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

upvoted a paper about 1 month ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

upvoted a paper about 1 month ago

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

View all activity

Organizations

upvoted a paper 1 day ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 4 days ago • 57

upvoted 2 papers about 1 month ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published Feb 23 • 16

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Paper • 2602.19895 • Published Feb 23 • 13

upvoted 3 papers 2 months ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published Jan 21 • 75

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 50

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200

upvoted a paper 3 months ago

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

upvoted a paper 7 months ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published Aug 5, 2025 • 59

liked a dataset 7 months ago

bruce360568/SRPO_RL_datasets

Preview • Updated Aug 18, 2025 • 48 • 2

upvoted 2 papers 7 months ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25, 2025 • 49

Electrocardiogram Instruction Tuning for Report Generation

Paper • 2403.04945 • Published Mar 7, 2024 • 2

updated a dataset 7 months ago

bruce360568/SRPO_RL_datasets

Preview • Updated Aug 18, 2025 • 48 • 2

published a dataset 7 months ago

bruce360568/SRPO_RL_datasets

Preview • Updated Aug 18, 2025 • 48 • 2

upvoted 2 papers 8 months ago

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Paper • 2403.07378 • Published Mar 12, 2024 • 4

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Paper • 2507.02259 • Published Jul 3, 2025 • 5

upvoted 3 papers 9 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published Jun 20, 2025 • 29

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Paper • 2506.10406 • Published Jun 12, 2025 • 2

upvoted a collection 9 months ago

DyCodeEval

Collection

DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5. • 3 items • Updated Jun 27, 2025 • 4

upvoted a paper 9 months ago

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Paper • 2505.19640 • Published May 26, 2025 • 15

zhongwei666

AI & ML interests

Recent Activity

Organizations

bruce360568's activity