| Full name | Unifying 2D and 3D Vision-Language Understanding |
| Publication title | Unifying 2D and 3D Vision-Language Understanding |
| Publication authors | Ayush Jain, Alexander Swerdlow, Yuzhou Wang, Alexander Sax, Franziska Meier, Katerina Fragkiadaki |
| Publication URL | https://arxiv.org/abs/2503.10745 |
| Input Data Types | Uses XYZ coordinates,Uses RGB values,Uses Multiview Image Features |
| Programming language(s) | Python with CUDA |
| Hardware | A100/L40S, >=40GB of VRAM |
| Website | https://univlg.github.io/ |
| Source code or download URL | https://github.com/facebookresearch/univlg |
| Submission creation date | 18 Feb, 2025 |
| Last edited | 18 Mar, 2025 |