Open to Work

22 15 17

Aritra Dutta

dutta18

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

upvoted a collection 20 days ago

LLaVa-NeXT

updated a dataset 21 days ago

dutta18/esnlive

published a dataset 21 days ago

dutta18/esnlive

View all activity

Organizations

upvoted a collection 20 days ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 34

upvoted an article 26 days ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

28 days ago

•

upvoted a collection about 1 month ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 562

upvoted an article 3 months ago

Article

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Feb 1, 2022

•

upvoted an article 4 months ago

Article

Running Large Transformer Models on Mobile and Edge Devices

Nov 3, 2025

•

upvoted 3 articles 5 months ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

291

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

upvoted an article 6 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

111

upvoted 2 articles 8 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

318

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

•

upvoted an article 9 months ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

113

upvoted a collection 9 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 670

upvoted an article 9 months ago

Article

Introducing Command A Vision: Multimodal AI built for Business

Jul 31, 2025

•

upvoted an article about 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

•

417

Aritra Dutta

AI & ML interests

Recent Activity

Organizations

dutta18's activity

Multimodal Embedding & Reranker Models with Sentence Transformers

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Running Large Transformer Models on Mobile and Edge Devices

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Preference Optimization for Vision Language Models

Vision Language Model Alignment in TRL ⚡️

KV Caching Explained: Optimizing Transformer Inference Efficiency

Fine-tune Llama 2 with DPO

Decoding Strategies in Large Language Models

Introducing Command A Vision: Multimodal AI built for Business

SmolVLM - small yet mighty Vision Language Model