CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Kim, Seungwook; Li, Kejie; Deng, Xueqing; Shi, Yichun; Cho, Minsu; Wang, Peng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.10603 (cs)

[Submitted on 16 Apr 2024 (v1), last revised 16 Sep 2024 (this version, v2)]

Title:CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Authors:Seungwook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang

View PDF HTML (experimental)

Abstract:Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e.g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models. However, the 3D geometric fidelity of the output remains an unresolved issue; albeit the rendered 2D views are realistic, the underlying geometry may contain errors such as unreasonable concavities. In this work, we propose CorrespondentDream, an effective method to leverage annotation-free, cross-view correspondences yielded from the diffusion U-Net to provide additional 3D prior to the NeRF optimization process. We find that these correspondences are strongly consistent with human perception, and by adopting it in our loss design, we are able to produce NeRF models with geometries that are more coherent with common sense, e.g., more smoothed object surface, yielding higher 3D fidelity. We demonstrate the efficacy of our approach through various comparative qualitative results and a solid user study.

Comments:	25 pages, 22 figures, accepted to CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.10603 [cs.CV]
	(or arXiv:2404.10603v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.10603

Submission history

From: Seungwook Kim [view email]
[v1] Tue, 16 Apr 2024 14:28:57 UTC (27,817 KB)
[v2] Mon, 16 Sep 2024 18:29:46 UTC (27,817 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators