Full name | Unifying 2D and 3D Vision-Language Understanding |
Publication title | Unifying 2D and 3D Vision-Language Understanding |
Publication authors | Ayush Jain, Alexander Swerdlow, Yuzhou Wang, Alexander Sax, Franziska Meier, Katerina Fragkiadaki |
Publication URL | https://arxiv.org/abs/2503.10745 |
Input Data Types | Uses XYZ coordinates,Uses RGB values,Uses Multiview Image Features |
Programming language(s) | Python with CUDA |
Hardware | A100/L40S, >=40GB of VRAM |
Website | https://univlg.github.io/ |
Source code or download URL | https://github.com/facebookresearch/univlg |
Submission creation date | 18 Feb, 2025 |
Last edited | 18 Mar, 2025 |