WordRobe: Text-Guided Generation of Textured 3D Garments

Srivastava, Astitva; Manu, Pranav; Raj, Amit; Jampani, Varun; Sharma, Avinash

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.17541 (cs)

[Submitted on 26 Mar 2024 (v1), last revised 14 Jul 2024 (this version, v2)]

Title:WordRobe: Text-Guided Generation of Textured 3D Garments

Authors:Astitva Srivastava, Pranav Manu, Amit Raj, Varun Jampani, Avinash Sharma

View PDF HTML (experimental)

Abstract:In this paper, we tackle a new and challenging problem of text-driven generation of 3D garments with high-quality textures. We propose "WordRobe", a novel framework for the generation of unposed & textured 3D garment meshes from user-friendly text prompts. We achieve this by first learning a latent representation of 3D garments using a novel coarse-to-fine training strategy and a loss for latent disentanglement, promoting better latent interpolation. Subsequently, we align the garment latent space to the CLIP embedding space in a weakly supervised manner, enabling text-driven 3D garment generation and editing. For appearance modeling, we leverage the zero-shot generation capability of ControlNet to synthesize view-consistent texture maps in a single feed-forward inference step, thereby drastically decreasing the generation time as compared to existing methods. We demonstrate superior performance over current SOTAs for learning 3D garment latent space, garment interpolation, and text-driven texture synthesis, supported by quantitative evaluation and qualitative user study. The unposed 3D garment meshes generated using WordRobe can be directly fed to standard cloth simulation & animation pipelines without any post-processing.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2403.17541 [cs.CV]
	(or arXiv:2403.17541v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.17541

Submission history

From: Astitva Srivastava [view email]
[v1] Tue, 26 Mar 2024 09:44:34 UTC (39,520 KB)
[v2] Sun, 14 Jul 2024 22:05:06 UTC (44,282 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:WordRobe: Text-Guided Generation of Textured 3D Garments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:WordRobe: Text-Guided Generation of Textured 3D Garments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators