MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

Zhong, Ziming; Xu, Yanxu; Li, Jing; Xu, Jiale; Li, Zhengxin; Yu, Chaohui; Gao, Shenghua

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.13675 (cs)

[Submitted on 18 Jul 2024 (v1), last revised 25 Jul 2024 (this version, v3)]

Title:MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

Authors:Ziming Zhong, Yanxu Xu, Jing Li, Jiale Xu, Zhengxin Li, Chaohui Yu, Shenghua Gao

View PDF HTML (experimental)

Abstract:We present MeshSegmenter, a simple yet effective framework designed for zero-shot 3D semantic segmentation. This model successfully extends the powerful capabilities of 2D segmentation models to 3D meshes, delivering accurate 3D segmentation across diverse meshes and segment descriptions. Specifically, our model leverages the Segment Anything Model (SAM) model to segment the target regions from images rendered from the 3D shape. In light of the importance of the texture for segmentation, we also leverage the pretrained stable diffusion model to generate images with textures from 3D shape, and leverage SAM to segment the target regions from images with textures. Textures supplement the shape for segmentation and facilitate accurate 3D segmentation even in geometrically non-prominent areas, such as segmenting a car door within a car mesh. To achieve the 3D segments, we render 2D images from different views and conduct segmentation for both textured and untextured images. Lastly, we develop a multi-view revoting scheme that integrates 2D segmentation results and confidence scores from various views onto the 3D mesh, ensuring the 3D consistency of segmentation results and eliminating inaccuracies from specific perspectives. Through these innovations, MeshSegmenter offers stable and reliable 3D segmentation results both quantitatively and qualitatively, highlighting its potential as a transformative tool in the field of 3D zero-shot segmentation. The code is available at \url{this https URL}.

Comments:	The paper was accepted by ECCV2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.13675 [cs.CV]
	(or arXiv:2407.13675v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.13675

Submission history

From: Ziming Zhong [view email]
[v1] Thu, 18 Jul 2024 16:50:59 UTC (14,726 KB)
[v2] Tue, 23 Jul 2024 08:47:34 UTC (14,711 KB)
[v3] Thu, 25 Jul 2024 12:32:21 UTC (14,711 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators