arxiv:2601.09708
Min-Hung Chen
cmhungsteve
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
authored
a paper
about 2 hours ago
3AM: Segment Anything with Geometric Consistency in Videos
authored
a paper
about 2 hours ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
upvoted
a
paper
about 11 hours ago
GenRecal: Generation after Recalibration from Large to Small
Vision-Language Models