Sarashina2.2 Collection Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated Mar 5, 2025 • 9
Domain-Specific LLMs: Japanese Finance Collection ドメイン特化LLM:日本語金融 • 6 items • Updated 25 days ago • 3
Constructing Synthetic Instruction Datasets for Improving Reasoning in Domain-Specific LLMs: A Case Study in the Japanese Financial Domain Paper • 2603.01353 • Published Mar 2 • 3
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 14 days ago • 64
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 23 days ago • 47
DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering Paper • 2512.00773 • Published Nov 30, 2025 • 1
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper • 2602.21818 • Published Feb 25 • 56
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 22 days ago • 128
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated Mar 2 • 11
NEST-Ja Collection Japanese speech self-supervised learning model developed by SB Intuitions. • 2 items • Updated Feb 10 • 1
Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning Paper • 2601.03872 • Published Jan 7 • 45