view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts NucleusAI • Apr 14 • 11
BANG: Dividing 3D Assets via Generative Exploded Dynamics Paper • 2507.21493 • Published Jul 29, 2025 • 65
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 799
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 • Jan 27, 2025 • 70
view article Article We now support VLMs in smolagents! +1 m-ric, merve, albertvillanova • Jan 24, 2025 • 113
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Paper • 2410.11831 • Published Oct 15, 2024 • 9
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 andito, merve, SkalskiP • Jun 24, 2024 • 207
view article Article Diffusers welcomes Stable Diffusion 3 +4 dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart • Jun 12, 2024 • 99
view article Article Hugging Face x LangChain : A new partner package +1 Jofthomas, kkondratenko, efriis • May 14, 2024 • 161
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent +2 qgallouedec, edbeeching, ClementRomac, thomwolf • Apr 22, 2024 • 81
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4, 2024 • 62
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 629
Masked Audio Generation using a Single Non-Autoregressive Transformer Paper • 2401.04577 • Published Jan 9, 2024 • 44