FlexiDreamer：Single Image-to-3D Generation with FlexiCubes

Ruowen Zhao^1,4, Zhengyi Wang^2,4, Yikai Wang², Zihan Zhou³, Jun Zhu^2,4

¹University of Chinese Academy of Sciences, ²Tsinghua University, ³Xidian University, ⁴ShengShu

Our Code will be released soon... 🏗️🚧🔨

Abstract

3D content generation from text prompts or single images has made remarkable progress in quality and speed recently. One of its dominant paradigms involves generating consistent multi-view images followed by a sparse-view reconstruction. However, due to the challenge of directly deforming the mesh representation to approach the target topology, most methodologies learn an implicit representation (such as NeRF) during the sparse-view reconstruction and acquire the target mesh by a post-processing extraction. Although the implicit representation can effectively model rich 3D information, its training typically entails a long convergence time. In addition, the post-extraction operation from the implicit field also leads to undesirable visual artifacts. In this paper, we propose FlexiDreamer, a novel single image-to-3d generation framework that reconstructs the target mesh in an end-to-end manner. By leveraging a flexible gradient-based extraction known as FlexiCubes, our method circumvents the defects brought by the post-processing and facilitates a direct acquisition of the target mesh. Furthermore, we incorporate a multi-resolution hash grid encoding scheme that progressively activates the encoding levels into the implicit field in FlexiCubes to help capture geometric details for per-step optimization. Notably, FlexiDreamer recovers a dense 3D structure from a single-view image in approximately 1 minute on a single NVIDIA A100 GPU, outperforming previous methodologies by a large margin.

Method

FlexiDreamer is an end-to-end framework for high-quality 3D generation from a reference image. In each optimization step, we estimate the signed distance field of a bounded space and utilize FlexiCubes to derive an explicit mesh geometry from it. Then texture is applied to the mesh surface by a texture neural network. The entire framework is trained end-to-end on generated images with reconstruction losses.

Comparisons

Citation

If you find FlexiDreamer is helpful, please cite our report:

@misc{zhao2024flexidreamer,
      title={FlexiDreamer: Single Image-to-3D Generation with FlexiCubes}, 
      author={Ruowen Zhao and Zhengyi Wang and Yikai Wang and Zihan Zhou and Jun Zhu},
      year={2024},
      eprint={2404.00987},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

README.md

README.md

Repository files navigation

FlexiDreamer：Single Image-to-3D Generation with FlexiCubes

Abstract

Method

Comparisons

Citation

About

Releases

Packages

zhaorw02/FlexiDreamer

Folders and files

Latest commit

History

assets

assets

README.md

README.md

Repository files navigation

FlexiDreamer：Single Image-to-3D Generation with FlexiCubes

Abstract

Method

Comparisons

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages