CT4D: Consistent Text-to-4D Generation with Animatable Meshes

Chen, Ce; Huang, Shaoli; Chen, Xuelin; Chen, Guangyi; Han, Xiaoguang; Zhang, Kun; Gong, Mingming

Computer Science > Graphics

arXiv:2408.08342 (cs)

[Submitted on 15 Aug 2024]

Title:CT4D: Consistent Text-to-4D Generation with Animatable Meshes

Authors:Ce Chen, Shaoli Huang, Xuelin Chen, Guangyi Chen, Xiaoguang Han, Kun Zhang, Mingming Gong

View PDF HTML (experimental)

Abstract:Text-to-4D generation has recently been demonstrated viable by integrating a 2D image diffusion model with a video diffusion model. However, existing models tend to produce results with inconsistent motions and geometric structures over time. To this end, we present a novel framework, coined CT4D, which directly operates on animatable meshes for generating consistent 4D content from arbitrary user-supplied prompts. The primary challenges of our mesh-based framework involve stably generating a mesh with details that align with the text prompt while directly driving it and maintaining surface continuity. Our CT4D framework incorporates a unique Generate-Refine-Animate (GRA) algorithm to enhance the creation of text-aligned meshes. To improve surface continuity, we divide a mesh into several smaller regions and implement a uniform driving function within each area. Additionally, we constrain the animating stage with a rigidity regulation to ensure cross-region continuity. Our experimental results, both qualitative and quantitative, demonstrate that our CT4D framework surpasses existing text-to-4D techniques in maintaining interframe consistency and preserving global geometry. Furthermore, we showcase that this enhanced representation inherently possesses the capability for combinational 4D generation and texture editing.

Subjects:	Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.08342 [cs.GR]
	(or arXiv:2408.08342v1 [cs.GR] for this version)
	https://doi.org/10.48550/arXiv.2408.08342

Submission history

From: Ce Chen [view email]
[v1] Thu, 15 Aug 2024 14:41:34 UTC (15,313 KB)

Computer Science > Graphics

Title:CT4D: Consistent Text-to-4D Generation with Animatable Meshes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Graphics

Title:CT4D: Consistent Text-to-4D Generation with Animatable Meshes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators