-
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
Paper • 2507.22627 • Published • 1 -
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation
Paper • 2504.14011 • Published -
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
Paper • 2409.01086 • Published -
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Paper • 2403.14828 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2411.16819
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37 -
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
Paper • 2411.17223 • Published • 7 -
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
Paper • 2411.17176 • Published • 24
-
Zero-shot Image Editing with Reference Imitation
Paper • 2406.07547 • Published • 33 -
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
Paper • 2406.10601 • Published • 70 -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper • 2407.05282 • Published • 15 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Paper • 2411.10913 • Published • 4 -
ROICtrl: Boosting Instance Control for Visual Generation
Paper • 2411.17949 • Published • 87 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 23 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 30 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 132 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 40
-
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
Paper • 2507.22627 • Published • 1 -
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation
Paper • 2504.14011 • Published -
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
Paper • 2409.01086 • Published -
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Paper • 2403.14828 • Published
-
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Paper • 2411.15466 • Published • 39 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37 -
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
Paper • 2411.17223 • Published • 7 -
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
Paper • 2411.17176 • Published • 24
-
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Paper • 2411.10913 • Published • 4 -
ROICtrl: Boosting Instance Control for Visual Generation
Paper • 2411.17949 • Published • 87 -
Pathways on the Image Manifold: Image Editing via Video Generation
Paper • 2411.16819 • Published • 37
-
Zero-shot Image Editing with Reference Imitation
Paper • 2406.07547 • Published • 33 -
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
Paper • 2406.10601 • Published • 70 -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper • 2407.05282 • Published • 15 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 23 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 30 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 132 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 40