Literate Goggles's picture

Literate Goggles

literate-goggles

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

upvoted a paper 3 days ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

upvoted a paper 3 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

View all activity

Organizations

None yet

upvoted 4 papers 3 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 124

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 14

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 18 days ago • 100

CosyEdit: Unlocking End-to-End Speech Editing Capability from Zero-Shot Text-to-Speech Models

Paper • 2601.05329 • Published 14 days ago • 1

upvoted a paper 18 days ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 21 days ago • 54

upvoted 2 papers 24 days ago

Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation

Paper • 2512.21734 • Published 28 days ago • 5

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 24 days ago • 65

upvoted 5 papers about 1 month ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37

Scaling Behavior of Discrete Diffusion Language Models

Paper • 2512.10858 • Published Dec 11, 2025 • 8

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Paper • 2411.19509 • Published Nov 29, 2024 • 3

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published Dec 4, 2025 • 42

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published Dec 3, 2025 • 75

upvoted a paper about 2 months ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 169

upvoted 3 articles about 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

309

Article

Diffusers welcomes FLUX-2

+6

Nov 25, 2025

•

170

Article

SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution

Dec 1, 2025

•

4

upvoted 4 papers 2 months ago

Step-Audio-EditX Technical Report

Paper • 2511.03601 • Published Nov 5, 2025 • 29

The Free Transformer

Paper • 2510.17558 • Published Oct 20, 2025 • 5

Controllable Video Generation: A Survey

Paper • 2507.16869 • Published Jul 22, 2025 • 1

Evaluating In Silico Creativity: An Expert Review of AI Chess Compositions

Paper • 2510.23772 • Published Oct 27, 2025 • 2