1 38 10

Yan Varakin PRO

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Recent Activity

upvoted an article about 1 month ago

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

upvoted a paper about 2 months ago

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

upvoted a paper about 2 months ago

Demystifing Video Reasoning

View all activity

Organizations

upvoted an article about 1 month ago

Article

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

KnutJaegersberg

•

Feb 20, 2025

• 4

upvoted 3 papers about 2 months ago

upvoted a paper 3 months ago

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published Jan 29 • 7

upvoted an article 9 months ago

Article

Activation Steering: A New Frontier in AI Control—But Does It Scale?

royswastik

•

Feb 2, 2025

• 5

upvoted an article 11 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif

•

Jun 26, 2025

• 121

liked a Space 11 months ago

Lingshu 7B

🩻

Chat with Lingshu 7B, a multimodal medical model

updated a Space 12 months ago

SkinLesionClassifierHAM10K

📈

Diagnose skin conditions from images

upvoted 2 articles about 1 year ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert

•

Apr 5, 2023

• 48

Article

Fine-tune Llama 2 with DPO

kashif, ybelkada, lvwerra

•

Aug 8, 2023

• 69

upvoted a paper about 1 year ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 55

upvoted an article about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292

upvoted 4 papers about 1 year ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

LLMs for Engineering: Teaching Models to Design High Powered Rockets

Paper • 2504.19394 • Published Apr 27, 2025 • 13

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Paper • 2504.21659 • Published Apr 30, 2025 • 14

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2, 2025 • 44

published a Space about 1 year ago

SkinLesionClassifierHAM10K

📈

Diagnose skin conditions from images

upvoted a paper about 1 year ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 95

updated a Space about 1 year ago

DermaScanBeta

🌍

Updated version of DermaScan system

Yan Varakin PRO

AI & ML interests

Recent Activity

Organizations

ZDPLI's activity

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Gemma 3n fully available in the open-source ecosystem!

Lingshu 7B

SkinLesionClassifierHAM10K

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tune Llama 2 with DPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

SkinLesionClassifierHAM10K

DermaScanBeta