Submitted by Lichen Zhao.

Submission data

Full name3DJCG (VoteNet + Feature-Enhancement + Transformer-Based-Head)
DescriptionJoint Training
We use the VoteNet backbone for detection.
Publication title3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Publication authorsDaigang Cai, Lichen Zhao, Jing Zhang†, Lu Sheng, Dong Xu
Publication venueCVPR2022 Oral
Publication URLhttps://openaccess.thecvf.com/content/CVPR2022/papers/Cai_3DJCG_A_Unified_Framework_for_Joint_Dense_Captioning_and_Visual_CVPR_2022_paper.pdf
Input Data TypesUses XYZ coordinates,Uses Multiview Image Features,Uses Normal Vectors
Programming language(s)Python With Cuda
HardwareGeForce RTX 2080 Ti, 11GB RAM
Source code or download URLhttps://github.com/zlccccc/3DJCG
Submission creation date6 Mar, 2021
Last edited13 Sep, 2022

Localization

Unique Unique Multiple Multiple Overall Overall
acc@0.25IoUacc@0.5IoUacc@0.25IoUacc@0.5IoUacc@0.25IoUacc@0.5IoU
0.76750.60590.43890.31170.51260.3776