Full name | ScanRefer + BRNet + DGCNN + Self-Attention + Cross-modal Attention (end-to-end) |
Description | Uses BRNet for object detection, self-attention for language embeddings, cross-modal attention for visual and textual embeddings, apply DGCNN to fused features |
Input Data Types | Uses XYZ coordinates,Uses Multiview Image Features,Uses Normal Vectors |
Programming language(s) | Python |
Hardware | GeForce RTX 2080 Ti |
Source code or download URL | https://github.com/yoonhachoe/ScanRefer-GAB |
Submission creation date | 8 Jul, 2021 |
Last edited | 15 Jul, 2021 |