117 324

Dokyoon

leeloolee

Eruly

AI & ML interests

Recent Activity

liked a dataset 1 day ago

InternScience/ResearchClawBench

liked a model 1 day ago

rl-research/DR-Tulu-8B-results

upvoted a paper 1 day ago

Grounding Everything in Tokens for Multimodal Large Language Models

View all activity

Organizations

liked a dataset 1 day ago

InternScience/ResearchClawBench

Updated about 14 hours ago • 1

liked a model 1 day ago

rl-research/DR-Tulu-8B-results

Updated 1 day ago • 1

upvoted a paper 1 day ago

Grounding Everything in Tokens for Multimodal Large Language Models

Paper • 2512.10554 • Published Dec 11, 2025 • 1

reactedto Shrijanagain's post with 🔥 3 days ago

Post

5491

We are thrilled to announce the launch of SKT-OMNI-CORPUS-146T-V1, a massive-scale, high-quality dataset designed to power the next generation of Foundation Models (LLMs) from scratch.
Developed at SKT AI LABS, this corpus is not just a collection of data; it’s a mission to decentralize high-grade AI training for regional languages and global knowledge.

💎 Key Highlights:

•• Massive Scale: Targeting a multi-terabyte architecture for 146T-level tokenization.

•• Pure Quality: Curated from 500+ Elite Sources

•• Structured for MoE: Perfectly sharded into 3.5GB standardized units (SKT-𝕻 series) for seamless distributed training.

🤝 Open for Collaboration!

We are looking for AI researchers, CUDA engineers, and data scientists to join us in this journey of building Project Surya and the ST-X Series models. Whether it's optimization, custom tokenization, or architecture design—let’s build the future together.

Explore the Dataset on Hugging Face:

🔗 Shrijanagain/SKT-OMNI-CORPUS-146T-V1

DSR -- 🔗 Shrijanagain/SKT-DSRx10000

#AI #MachineLearning #OpenSource #IndicAI #SKTAILABS #LLM #BigData #HuggingFace #InnovationIndia

upvoted a paper 3 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 5 days ago • 23

upvoted an article 4 days ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

Jan 20

•

reactedto DedeProGames's post with 🚀 4 days ago

Post

5215

Introducing GRM2, a powerful 3 billion parameter model designed for long-term reasoning and high performance in complex tasks.

Even with only 3 billion parameters, it outperforms qwen3-32b in several benchmarks and complex reasoning tasks.

With just 3 billion parameters, it can also generate extensive and complex code with over 1000 lines, utilize tools comparable to larger models, and is perfect for agentic tasks.

GRM2 is licensed under Apache 2.0, making it ideal as a base for FineTune in other tasks.
You can see more here: OrionLLM/GRM2-3b