Novel View Synthesis on DSLR Images

The novel view synthesis task is to render images from novel viewpoints given a dense RGB capture of the scene. The images are captured by a fisheye DSLR camera, and camera poses from COLMAP are provided for every training and test image.

We also provide the undistorted evaluation track: rendering undistorted perspective (pinhole) images of the given poses. The training images and the GT are undistorted from the raw fisheye images using the ScanNet++ Toolbox. The undistorted DSLR images are included in the download by default after 30.04.2025.

We provide a 3DGS example codebase, which showcases how to use ScanNet++ for training and evaluation. It also includes a viewer tool to help visualize the camera poses.

New in v2: Small Set Evaluation

The complete testing set for NVS consists of 50 scenes (referred to as the full set).

For quicker evaluation of per-scene optimization methods, we also offer a small set of 12 scenes. Users have the option to submit results for the small set alone or for the full set. Submissions on the full set will be evaluated on both the small set and the full set.

Evaluation and Metrics

We evaluate the similarity beween the ground truth and generated RGB images. Our evaluation metrics are peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and learned perceptual image patch similarity (LPIPS). For each pair of generated and ground-truth images, we compute these three metrics, and the numbers reported in the table are the average over all the images across all the scenes.

Evaluation is carried out on GT images with resolution 1752 x 1168. Submitted images will be automatically resized if their resolutions differ from this.

Evaluation excludes the pixels which are anonymized. Anonymized pixels are specified in resized_anon_masks and original_anon_masks.

The benchmark is currently evaluated on the v2 version of the dataset.

Results (small set)

The small set is a subset of the full NVS test set, and contains 12 scenes. The small set is identical across versions v1 and v2 of the dataset.

Methods	PSNR	SSIM	LPIPS
Instant-NGP	22.631	0.851	0.379
Thomas Müller, Alex Evans, Christoph Schied, Alexander Keller. Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. SIGGRAPH 2022
Nerfacto	22.425	0.848	0.349
Matthew Tancik, Ethan Weber, Evonne Ng, Ruilong Li, Brent Yi, Justin Kerr, Terrance Wang, Alexander Kristoffersen, Jake Austin, Kamyar Salahi, Abhik Ahuja, David McAllister, Angjoo Kanazawa. Nerfstudio: A Modular Framework for Neural Radiance Field Development. SIGGRAPH 2023

Results (full set)

The full set is the complete NVS test set, and contains 50 scenes.

Methods	PSNR	SSIM	LPIPS

Please refer to the submission instructions before making a submission

Submit results