当前位置: X-MOL 学术arXiv.cs.GR › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
iSeg: Interactive 3D Segmentation via Interactive Attention
arXiv - CS - Graphics Pub Date : 2024-04-04 , DOI: arxiv-2404.03219
Itai Lang, Fei Xu, Dale Decatur, Sudarshan Babu, Rana Hanocka

We present iSeg, a new interactive technique for segmenting 3D shapes. Previous works have focused mainly on leveraging pre-trained 2D foundation models for 3D segmentation based on text. However, text may be insufficient for accurately describing fine-grained spatial segmentations. Moreover, achieving a consistent 3D segmentation using a 2D model is challenging since occluded areas of the same semantic region may not be visible together from any 2D view. Thus, we design a segmentation method conditioned on fine user clicks, which operates entirely in 3D. Our system accepts user clicks directly on the shape's surface, indicating the inclusion or exclusion of regions from the desired shape partition. To accommodate various click settings, we propose a novel interactive attention module capable of processing different numbers and types of clicks, enabling the training of a single unified interactive segmentation model. We apply iSeg to a myriad of shapes from different domains, demonstrating its versatility and faithfulness to the user's specifications. Our project page is at https://threedle.github.io/iSeg/.

中文翻译:

iSeg:通过交互式注意力进行交互式 3D 分割

我们推出了 iSeg,一种用于分割 3D 形状的新型交互式技术。之前的工作主要集中于利用预训练的 2D 基础模型进行基于文本的 3D 分割。然而,文本可能不足以准确描述细粒度的空间分割。此外,使用 2D 模型实现一致的 3D 分割具有挑战性,因为从任何 2D 视图中可能无法同时看到同一语义区域的遮挡区域。因此,我们设计了一种以精细用户点击为条件的分割方法,该方法完全在 3D 中运行。我们的系统接受用户直接在形状表面上单击,指示从所需形状分区中包含或排除区域。为了适应各种点击设置,我们提出了一种新颖的交互式注意模块,能够处理不同数量和类型的点击,从而能够训练单个统一的交互式分割模型。我们将 iSeg 应用于不同领域的无数形状,展示了其多功能性和对用户规格的忠实度。我们的项目页面位于 https://tridle.github.io/iSeg/。
更新日期:2024-04-05
down
wechat
bug