4 6 7

Sayan Deb Sarkar

sayandsarkar

https://sayands.github.io/

AI & ML interests

3D Computer Vision, 3D Scene Understanding

Recent Activity

liked a Space about 1 month ago

yuanwenyue/FiT3D

upvoted a paper 3 months ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

upvoted a paper 4 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

View all activity

Organizations

liked a Space about 1 month ago

FiT3D

🏃

Visualize and compare 2D and 3D-aware feature representations of images

upvoted a paper 3 months ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Paper • 2603.18002 • Published Mar 18 • 15

upvoted 2 papers 4 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 15

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published Feb 13 • 32

submitted a paper to Daily Papers 4 months ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published Feb 13 • 32

liked a Space 4 months ago

Vision Arena (Testing VLMs side-by-side)

🖼

561

Explore Vision Arena visual AI demo online

liked a Space 7 months ago

GuideFlow3D

🤗

A HF Space that demonstrates all use-cases for GuideFlow3D

published a Space 7 months ago

GuideFlow3D

🔥

Robust cross-category 3D appearance transfer

liked a Space 7 months ago

TRELLIS

🏢

646

Scalable and Versatile 3D Generation from images

upvoted a paper 8 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5

authored a paper 8 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5

commented a paper 8 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5 •

updated a model 9 months ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

liked a model 10 months ago

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 20.8k • 127

liked a model about 1 year ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

published a model about 1 year ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

upvoted 2 papers about 1 year ago

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images

Paper • 2504.08727 • Published Apr 11, 2025 • 12

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 59

published a Space over 1 year ago

Gradient Spaces

🤖

commented a paper over 1 year ago

CrossOver: 3D Scene Cross-Modal Alignment

Paper • 2502.15011 • Published Feb 20, 2025 • 2 •

Sayan Deb Sarkar

AI & ML interests

Recent Activity

Organizations

sayandsarkar's activity

FiT3D

Vision Arena (Testing VLMs side-by-side)

GuideFlow3D

GuideFlow3D

TRELLIS

Gradient Spaces