meituan

company

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

KleinChong updated a model about 8 hours ago

meituan/EvoCUA-8B-20260105

KleinChong updated a model about 8 hours ago

meituan/EvoCUA-32B-20260105

DadaCloud01 authored a paper 12 days ago

Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement Learning

View all activity

Papers

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue

View all Papers

updated 2 models about 8 hours ago

meituan/EvoCUA-8B-20260105

9B • Updated about 8 hours ago • 48.2k • 14

meituan/EvoCUA-32B-20260105

33B • Updated about 8 hours ago • 614 • 23

authored 3 papers 12 days ago

Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement Learning

Paper • 2510.10959 • Published Oct 13, 2025 • 2

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 193

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Paper • 2507.01327 • Published Jul 2, 2025 • 1

authored a paper 13 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 14 days ago • 58

submitted a paper to Daily Papers 13 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 14 days ago • 58

updated a dataset 26 days ago

meituan/LIBERO-X

Viewer • Updated 26 days ago • 172k • 1.14k

published a dataset 26 days ago

meituan/LIBERO-X

Viewer • Updated 26 days ago • 172k • 1.14k

updated a model 28 days ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated 28 days ago • 64 • 7

authored 5 papers about 1 month ago

WebGuard: Building a Generalizable Guardrail for Web Agents

Paper • 2507.14293 • Published Jul 18, 2025 • 1

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published Feb 26 • 36

World Models with Hints of Large Language Models for Goal Achieving

Paper • 2406.07381 • Published Jun 11, 2024 • 1

ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning

Paper • 2505.23871 • Published May 29, 2025 • 1

Multi-Agent Coordination via Multi-Level Communication

Paper • 2209.12713 • Published Sep 26, 2022 • 2

published a model about 1 month ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated 28 days ago • 64 • 7

authored a paper about 2 months ago

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 30

submitted a paper to Daily Papers about 2 months ago

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Paper • 2602.06820 • Published Feb 6 • 13

submitted a paper to Daily Papers about 2 months ago

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue

Paper • 2602.03548 • Published Feb 3 • 4

authored a paper about 2 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 179