Submitted by Dave Zhenyu Chen.

Submission data

Full namePointGroup + Transformer (w/o end-to-end fine-tuning)
DescriptionThis is the pretrained version of D3Net - PointGroup + Transformer (w/o end-to-end fine-tuning)
Publication titleD3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Publication authorsDave Zhenyu Chen, Qirui Wu, Matthias Niessner, Angel X. Chang
Publication venue17th European Conference on Computer Vision (ECCV), 2022
Publication URL
Input Data TypesUses XYZ coordinates,Uses Multiview Image Features,Uses Normal Vectors
Programming language(s)Python
HardwareRTX 3090Ti
Source code or download URL
Submission creation date27 Oct, 2021
Last edited23 Jul, 2022


Unique Unique Multiple Multiple Overall Overall