Submitted by Dave Zhenyu Chen.

Submission data

Full nameVoteNet + GRU
Publication titleScan2Cap: Context-aware Dense Captioning in RGB-D Scans
Publication authorsDave Zhenyu Chen, Ali Gholami, Matthias Nießner and Angel X. Chang
Publication venueCVPR 2021
Publication URLhttps://openaccess.thecvf.com/content/CVPR2021/papers/Chen_Scan2Cap_Context-Aware_Dense_Captioning_in_RGB-D_Scans_CVPR_2021_paper.pdf
Input Data TypesUses XYZ coordinates,Uses Multiview Image Features,Uses Normal Vectors
Programming language(s)Python
HardwareGeForce RTX 2080 Ti, 11GB RAM
Websitehttps://daveredrum.github.io/Scan2Cap/
Source code or download URLhttps://github.com/daveredrum/Scan2Cap
Submission creation date25 Aug, 2022
Last edited25 Aug, 2022

Captioning

Captioning F1-Score Dense Captioning Object Detection
CIDEr@0.5IoUBLEU-4@0.5IoURouge-L@0.5IoUMETEOR@0.5IoUDCmAPmAP@0.5
0.08490.05760.10730.04920.09700.2481