Huggingface Projects

company

https://huggingface.co/

huggingface

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sergiopaniego updated a dataset 3 days ago

huggingface-projects/Deep-RL-Course-Certification

AdinaY submitted a paper 6 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

pcuenq updated a dataset 6 days ago

huggingface-projects/drlc-leaderboard-data

View all activity

sergiopaniego

updated a dataset 3 days ago

huggingface-projects/Deep-RL-Course-Certification

Viewer • Updated 3 days ago • 1.66k • 568 • 16

AdinaY

posted an update 4 days ago

Post

979

What a week 🤯

Following DeepSeek, Kimi, Qwen, Baidu, and Ant Group, Unitree Robotics
has now released a VLA model on the hub too!

unitreerobotics/UnifoLM-VLA-Base

sergiopaniego

posted an update 4 days ago

Post

232

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally ( @microsoft ):

🔍 Detects training issues early
🛠 Lets you intervene safely
📊 Keeps long training runs stable, auditable & efficient

Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/

Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration

Code: https://github.com/microsoft/post-training-toolkit

victor

posted an update 4 days ago

Post

268

Interesting article: use Claude Code to help open models write CUDA kernels (for eg) by turning CC traces into Skills. They made a library out of it 👀

https://huggingface.co/blog/upskill

AdinaY

posted an update 5 days ago

Post

216

LongCat-Flash-Lite🔥 a non-thinking MoE model released by Meituan LongCat team.

meituan-longcat/LongCat-Flash-Lite

✨ Total 68.5B / 3B active - MIT license
✨ 256k context
✨ Faster inference with N-gram embeddings

sergiopaniego

posted an update 5 days ago

Post

2440

New TRL + OpenEnv example! 💥

Fine tune an LLM for playing Sudoku using an RL env via OpenEnv

Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.

Enjoy!

Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb

Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

1 reply

AdinaY

posted an update 5 days ago

Post

205

Ant Group is going big on robotics 🤖

They just dropped their first VLA and depth perception foundation model on huggingface.

✨ LingBot-VLA :
- Trained on 20k hours of real-world robot data
- 9 robot embodiments
- Clear no-saturation scaling laws
- Apache 2.0

Model: https://huggingface.co/collections/robbyant/lingbot-vla
Paper:
A Pragmatic VLA Foundation Model (2601.18692)

✨ LingBot-Depth:
- Metric-accurate 3D from noisy, incomplete depth
- Masked Depth Modeling (self-supervised)
- RGB–depth alignment, works with <5% sparse depth
- Apache 2.0

Model: https://huggingface.co/collections/robbyant/lingbot-depth
Paper:
Masked Depth Modeling for Spatial Perception (2601.17895)

AdinaY

posted an update 6 days ago

Post

270

Blog 2 is live 🔥 After the DeepSeek R1 moment, what came next wasn’t just more models.

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-2

In this second post, we dive into the architectural and hardware choices shaping China’s open AI ecosystem.

2 replies

AdinaY

submitted a paper to Daily Papers 6 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 7 days ago • 24

AdinaY

posted an update 6 days ago

Post

1259

Big day in open source AI!!

✨ DeepSeek released OCR2 💥
deepseek-ai/DeepSeek-OCR-2

✨ Kimi K2.5 just landed 🔥
moonshotai/Kimi-K2.5

With the Chinese Spring Festival 3 weeks away,

what’s coming next?👀

pcuenq

updated a dataset 6 days ago

huggingface-projects/drlc-leaderboard-data

Viewer • Updated 6 days ago • 48.5k • 1.55k • 2

AdinaY

posted an update 6 days ago

Post

848

Kimi K2.5 from Moonshot AI is more than just another large model🤯

https://huggingface.co/collections/moonshotai/kimi-k25

✨ Native multimodality : image + video + language + agents 💥
✨1T MoE / 32B active
✨ 256K context
✨ Modified MIT license
✨ Agent Swarm execution
✨ Open weights + open infra mindset

sergiopaniego

posted an update 7 days ago

Post

2063

Date idea: read the entire Transformers v5.0.0 release notes

Officially stable now: https://github.com/huggingface/transformers/releases/tag/v5.0.0

1 reply

akhaliq

submitted a paper to Daily Papers 11 days ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published 13 days ago • 10

AdinaY

posted an update 12 days ago

Post

485

AgentCPM-report 🔥 local DeepResearch agent released from OpenBMB

openbmb/AgentCPM-Report

✨ 8B - Apache 2.0
✨ Gemini-2.5-Pro level DeepResearch report generation
✨ Fully offline, privacy-first local deployment
✨ + GGUF version

1 reply

victor

updated a Space 12 days ago

AI Video Composer - Natural Language FFMPEG

🏞

641

Describe what you want, AI writes the FFMPEG command

AdinaY

posted an update 13 days ago

Post

1426

DeepSeek R1 dropped one year ago 🐳 and a lot has changed.

With @irenesolaiman , we’re launching a blog series about how that moment reshaped AI + open source in 2025, starting with strategic shifts and the explosion of new open models in China!

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment

AdinaY

posted an update 14 days ago

Post

2370

Z.ai just released a powerful lightweight option of GLM 4.7

✨ 30B total/3B active - MoE

zai-org/GLM-4.7-Flash

1 reply

sergiopaniego

posted an update 14 days ago

Post

1561

FunctionGemma Tuning Lab is a new no-code tool by @google that lets you fine-tune a model directly from the browser, with no coding knowledge required, using TRL behind the scenes.

blog: https://developers.googleblog.com/a-guide-to-fine-tuning-functiongemma/

try it out: google/functiongemma-tuning-lab

This example builds on a more advanced one for learning fine-tuning with SFT using TRL: https://ai.google.dev/gemma/docs/functiongemma/finetuning-with-functiongemma

1 reply

AdinaY

posted an update 14 days ago

Post

249

Another Chinese model fully trained on domestic chips, released by China Telecom 👀

Tele-AI/TeleChat3-36B-Thinking

TeleChat3-36B-Thinking:
✨ Native support for the Ascend + MindSpore ecosystem
✨ Inspired by DeepSeek’s architecture design, bringing training stability and efficiency gains.

2 replies

AI & ML interests

Recent Activity

Team members 20

huggingface-projects's activity

AI Video Composer - Natural Language FFMPEG