This table lists the benchmark results for the 3D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
SparseConvNet0.725 20.647 180.821 10.846 10.721 10.869 10.533 10.754 60.603 40.614 20.955 10.572 10.325 10.710 20.870 20.724 20.823 10.628 30.934 10.865 10.683 2
MinkowskiNetpermissive0.736 10.859 20.818 20.832 20.709 20.840 20.521 20.853 10.660 10.643 10.951 30.544 20.286 70.731 10.893 10.675 50.772 40.683 10.874 80.852 20.727 1
C. Choy, J. Gwak, S. Savarese: 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks. CVPR 2019
joint point-basedpermissive0.634 80.614 200.778 40.667 150.633 70.825 30.420 90.804 30.467 200.561 50.951 30.494 40.291 40.566 90.458 170.579 160.764 50.559 130.838 130.814 50.598 13
Hung-Yueh Chiang, Yen-Liang Lin, Yueh-Cheng Liu, Winston H. Hsu: A Unified Point-Based Framework for 3D Segmentation. 3DV 2019
PointConvpermissive0.666 50.781 60.759 60.699 100.644 50.822 40.475 40.779 40.564 90.504 110.953 20.428 110.203 180.586 70.754 60.661 70.753 60.588 60.902 20.813 60.642 4
Wenxuan Wu, Zhongang Qi, Li Fuxin: PointConv: Deep Convolutional Networks on 3D Point Clouds. CVPR 2019
KP-FCNN0.684 40.847 40.758 70.784 60.647 40.814 50.473 50.772 50.605 30.594 30.935 170.450 80.181 220.587 60.805 50.690 40.785 30.614 40.882 50.819 40.632 6
H. Thomas, C. Qi, J. Deschaud, B. Marcotegui, F. Goulette, L. Guibas.: KPConv: Flexible and Deformable Convolution for Point Clouds. ICCV 2019
MCCNNpermissive0.633 90.866 10.731 90.771 70.576 130.809 60.410 130.684 110.497 150.491 120.949 60.466 50.105 280.581 80.646 90.620 110.680 130.542 160.817 160.795 100.618 8
P. Hermosilla, T. Ritschel, P.P. Vazquez, A. Vinacua, T. Ropinski: Monte Carlo Convolution for Learning on Non-Uniformly Sampled Point Clouds. SIGGRAPH Asia 2018
DMC-Net0.653 60.771 70.701 120.801 40.619 80.807 70.463 70.680 120.495 160.520 90.940 130.452 70.301 30.496 150.816 40.664 60.719 70.563 110.822 150.799 90.638 5
CU-Hybrid Net0.693 30.596 210.789 30.803 30.677 30.800 80.469 60.846 20.554 100.591 40.948 70.500 30.316 20.609 40.847 30.732 10.808 20.593 50.894 40.839 30.652 3
SPH3D-GCNpermissive0.610 130.858 30.772 50.489 270.532 150.792 90.404 140.643 170.570 70.507 100.935 170.414 120.046 320.510 130.702 70.602 120.705 80.549 140.859 110.773 140.534 18
Huan Lei, Naveed Akhtar, and Ajmal Mian: Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds.
MVPNetpermissive0.641 70.831 50.715 110.671 130.590 110.781 100.394 150.679 140.642 20.553 60.937 160.462 60.256 90.649 30.406 200.626 100.691 100.666 20.877 60.792 110.608 10
Maximilian Jaritz, Jiayuan Gu, Hao Su: Multi-view PointNet for 3D Scene Understanding. GMDL Workshop, ICCV 2019
PointASNL0.630 110.738 100.729 100.764 90.637 60.779 110.416 110.626 180.518 120.530 70.951 30.398 140.260 80.518 110.576 130.590 140.687 110.568 90.872 90.810 70.631 7
LAP-D0.594 140.720 130.692 150.637 180.456 210.773 120.391 170.730 70.587 50.445 180.940 130.381 160.288 50.434 180.453 180.591 130.649 150.581 70.777 200.749 190.610 9
SIConv0.594 140.768 80.639 230.616 190.544 140.768 130.419 100.601 200.513 140.474 160.946 80.402 130.213 160.387 220.581 120.633 90.683 120.549 140.843 120.774 130.521 20
HPEIN0.618 120.729 120.668 170.647 160.597 100.766 140.414 120.680 120.520 110.525 80.946 80.432 100.215 140.493 160.599 110.638 80.617 210.570 80.897 30.806 80.605 11
Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia: Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation. ICCV 2019
DPC0.592 160.720 130.700 130.602 220.480 180.762 150.380 190.713 90.585 60.437 190.940 130.369 180.288 50.434 180.509 160.590 140.639 190.567 100.772 210.755 170.592 14
Francis Engelmann, Theodora Kontogianni, Bastian Leibe: Dilated Point Convolutions. arXiv
3DSM_DMMF0.631 100.626 190.745 80.801 40.607 90.751 160.506 30.729 80.565 80.491 120.866 300.434 90.197 200.595 50.630 100.709 30.705 80.560 120.875 70.740 200.491 23
Pointnet++ & Featurepermissive0.557 200.735 110.661 190.686 110.491 170.744 170.392 160.539 230.451 210.375 220.946 80.376 170.205 170.403 210.356 220.553 190.643 170.497 190.824 140.756 160.515 21
CCRFNet0.589 170.766 90.659 200.683 120.470 200.740 180.387 180.620 190.490 170.476 140.922 220.355 210.245 100.511 120.511 150.571 170.643 170.493 200.872 90.762 150.600 12
TextureNetpermissive0.566 180.672 160.664 180.671 130.494 160.719 190.445 80.678 150.411 250.396 200.935 170.356 200.225 120.412 200.535 140.565 180.636 200.464 220.794 190.680 250.568 15
Jingwei Huang, Haotian Zhang, Li Yi, Thomas Funkerhouser, Matthias Niessner, Leonidas Guibas: TextureNet: Consistent Local Parametrizations for Learning from High-Resolution Signals on Meshes. CVPR
PointCNN with RGBpermissive0.458 250.577 230.611 250.356 330.321 300.715 200.299 240.376 270.328 300.319 240.944 110.285 260.164 230.216 310.229 270.484 230.545 250.456 240.755 220.709 220.475 25
Yangyan Li, Rui Bu, Mingchao Sun, Baoquan Chen: PointCNN. NeurIPS 2018
PCNN0.498 230.559 240.644 220.560 250.420 230.711 210.229 300.414 240.436 220.352 230.941 120.324 230.155 250.238 280.387 210.493 210.529 270.509 170.813 170.751 180.504 22
3DMV, FTSDF0.501 220.558 250.608 260.424 310.478 190.690 220.246 280.586 210.468 190.450 170.911 250.394 150.160 240.438 170.212 280.432 260.541 260.475 210.742 230.727 210.477 24
DVVNet0.562 190.648 170.700 130.770 80.586 120.687 230.333 210.650 160.514 130.475 150.906 270.359 190.223 130.340 240.442 190.422 270.668 140.501 180.708 250.779 120.534 18
FCPNpermissive0.447 260.679 150.604 270.578 240.380 250.682 240.291 250.106 330.483 180.258 310.920 230.258 280.025 330.231 300.325 230.480 240.560 240.463 230.725 240.666 270.231 33
Dario Rethage, Johanna Wald, Jürgen Sturm, Nassir Navab, Federico Tombari: Fully-Convolutional Point Networks for Large-Scale Point Clouds. ECCV 2018
SPLAT Netcopyleft0.393 300.472 300.511 300.606 200.311 310.656 250.245 290.405 250.328 300.197 320.927 200.227 310.000 350.001 350.249 260.271 340.510 280.383 310.593 300.699 230.267 31
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, Jan Kautz: SPLATNet: Sparse Lattice Networks for Point Cloud Processing. CVPR 2018
SurfaceConvPF0.442 270.505 270.622 240.380 320.342 290.654 260.227 310.397 260.367 280.276 280.924 210.240 290.198 190.359 230.262 250.366 290.581 220.435 270.640 280.668 260.398 27
Hao Pan, Shilin Liu, Yang Liu, Xin Tong: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames.
Tangent Convolutionspermissive0.438 290.437 310.646 210.474 280.369 260.645 270.353 200.258 300.282 320.279 270.918 240.298 250.147 260.283 250.294 240.487 220.562 230.427 280.619 290.633 280.352 29
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, Qian-Yi Zhou: Tangent convolutions for dense prediction in 3d. CVPR 2018
PanopticFusion-label0.529 210.491 280.688 160.604 210.386 240.632 280.225 320.705 100.434 230.293 260.815 310.348 220.241 110.499 140.669 80.507 200.649 150.442 260.796 180.602 300.561 16
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
PNET20.442 270.548 260.548 280.597 230.363 270.628 290.300 230.292 280.374 270.307 250.881 290.268 270.186 210.238 280.204 300.407 280.506 310.449 250.667 270.620 290.462 26
ScanNet+FTSDF0.383 310.297 330.491 310.432 300.358 280.612 300.274 260.116 320.411 250.265 290.904 280.229 300.079 300.250 260.185 310.320 320.510 280.385 300.548 310.597 320.394 28
3DMV0.484 240.484 290.538 290.643 170.424 220.606 310.310 220.574 220.433 240.378 210.796 320.301 240.214 150.537 100.208 290.472 250.507 300.413 290.693 260.602 300.539 17
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
SSC-UNetpermissive0.308 330.353 320.290 340.278 340.166 340.553 320.169 340.286 290.147 340.148 340.908 260.182 330.064 310.023 340.018 350.354 310.363 320.345 320.546 330.685 240.278 30
ScanNetpermissive0.306 340.203 340.366 330.501 260.311 310.524 330.211 330.002 350.342 290.189 330.786 330.145 340.102 290.245 270.152 320.318 330.348 330.300 330.460 340.437 340.182 34
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17
PointNet++permissive0.339 320.584 220.478 320.458 290.256 330.360 340.250 270.247 310.278 330.261 300.677 340.183 320.117 270.212 320.145 330.364 300.346 340.232 340.548 310.523 330.252 32
Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas: pointnet++: deep hierarchical feature learning on point sets in a metric space.
ERROR0.054 350.000 350.041 350.172 350.030 350.062 350.001 350.035 340.004 350.051 350.143 350.019 350.003 340.041 330.050 340.003 350.054 350.018 350.005 350.264 350.082 35

This table lists the benchmark results for the 3D semantic instance scenario.




Method Infoavg ap 50%bathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
OccuSeg0.634 31.000 10.902 10.771 20.461 50.814 10.282 10.583 90.328 50.472 10.471 20.295 70.600 31.000 10.650 20.664 70.587 41.000 10.537 1
PointGroup0.636 21.000 10.765 50.624 40.505 30.797 20.116 60.696 40.384 20.441 30.559 10.476 10.596 51.000 10.666 10.756 40.556 60.997 50.513 2
SSEN0.568 51.000 10.747 60.449 90.371 70.760 30.143 30.706 30.336 40.439 40.430 60.306 60.600 30.857 50.407 80.831 20.611 10.944 70.283 10
MPA0.610 41.000 10.833 30.765 30.546 10.750 40.140 40.588 80.478 10.433 50.454 30.376 30.650 20.857 50.429 60.765 30.537 81.000 10.378 7
GICN0.638 11.000 10.895 20.800 10.480 40.676 50.144 20.737 20.354 30.447 20.400 80.365 40.700 11.000 10.569 30.836 10.599 21.000 10.473 3
MTML0.549 61.000 10.807 40.588 70.327 90.647 60.004 150.815 10.180 90.418 60.364 100.182 90.445 71.000 10.442 50.688 60.571 51.000 10.396 5
Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald: 3D Instance Segmentation via Multi-task Metric Learning. ICCV 2019 [oral]
MASCpermissive0.447 110.528 140.555 130.381 100.382 60.633 70.002 160.509 100.260 70.361 80.432 50.327 50.451 60.571 120.367 110.639 80.386 100.980 60.276 11
Chen Liu, Yasutaka Furukawa: MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation.
Occipital-SCS0.512 71.000 10.716 80.509 80.506 20.611 80.092 80.602 70.177 100.346 90.383 90.165 100.442 80.850 90.386 100.618 90.543 70.889 110.389 6
R-PointNet0.306 140.500 150.405 150.311 110.348 80.589 90.054 90.068 160.126 120.283 120.290 110.028 150.219 130.214 160.331 120.396 150.275 130.821 150.245 12
ResNet-backbone0.459 101.000 10.737 70.159 160.259 110.587 100.138 50.475 110.217 80.416 70.408 70.128 110.315 120.714 110.411 70.536 130.590 30.873 130.304 9
3D-SISpermissive0.382 121.000 10.432 140.245 130.190 130.577 110.013 130.263 130.033 160.320 110.240 130.075 140.422 100.857 50.117 150.699 50.271 140.883 120.235 13
Ji Hou, Angela Dai, Matthias Niessner: 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. CVPR 2019
PanopticFusion-inst0.478 90.667 110.712 100.595 50.259 120.550 120.000 180.613 60.175 110.250 130.434 40.437 20.411 110.857 50.485 40.591 120.267 150.944 70.359 8
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
3D-BoNet0.488 81.000 10.672 110.590 60.301 100.484 130.098 70.620 50.306 60.341 100.259 120.125 120.434 90.796 100.402 90.499 140.513 90.909 100.439 4
Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. NeurIPS 2019 Spotlight
UNet-backbone0.319 130.667 110.715 90.233 140.189 140.479 140.008 140.218 140.067 150.201 140.173 140.107 130.123 140.438 130.150 130.615 100.355 110.916 90.093 17
3D-BEVIS0.248 150.667 110.566 120.076 170.035 180.394 150.027 110.035 170.098 130.099 160.030 170.025 160.098 150.375 140.126 140.604 110.181 160.854 140.171 14
Cathrin Elich, Francis Engelmann, Jonas Schult, Theodora Kontogianni, Bastian Leibe: 3D-BEVIS: Birds-Eye-View Instance Segmentation.
Seg-Clusterpermissive0.215 160.370 160.337 170.285 120.105 150.325 160.025 120.282 120.085 140.105 150.107 150.007 180.079 160.317 150.114 160.309 170.304 120.587 160.123 16
Sgpn_scannet0.143 170.208 180.390 160.169 150.065 160.275 170.029 100.069 150.000 170.087 170.043 160.014 170.027 180.000 170.112 170.351 160.168 170.438 170.138 15
MaskRCNN 2d->3d Proj0.058 180.333 170.002 180.000 180.053 170.002 180.002 170.021 180.000 170.045 180.024 180.238 80.065 170.000 170.014 180.107 180.020 180.110 180.006 18

This table lists the benchmark results for the 2D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
CU-Hybrid-2D Net0.636 10.825 10.820 10.179 130.648 10.463 10.549 10.742 10.676 10.628 10.961 10.420 10.379 30.684 10.381 70.732 10.723 10.599 10.827 50.851 10.634 1
DMMF_3d0.605 20.651 40.744 50.782 10.637 20.387 20.536 20.732 20.590 30.540 20.856 90.359 40.306 80.596 40.539 10.627 80.706 20.497 40.785 90.757 80.476 9
3DMV (2d proj)0.498 100.481 110.612 100.579 60.456 100.343 30.384 90.623 90.525 80.381 100.845 100.254 100.264 100.557 70.182 110.581 110.598 90.429 100.760 110.661 120.446 11
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
DMMF0.597 30.543 80.755 40.749 20.585 40.338 40.494 40.704 40.598 20.494 80.911 40.347 60.327 70.593 50.527 20.675 40.646 60.513 20.842 30.774 60.527 7
AdapNet++copyleft0.503 90.613 60.722 70.418 100.358 130.337 50.370 110.479 110.443 100.368 110.907 50.207 110.213 120.464 110.525 30.618 90.657 30.450 90.788 80.721 100.408 12
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
MCA-Net0.595 40.533 90.756 30.746 30.590 30.334 60.506 30.670 50.587 40.500 60.905 60.366 30.352 40.601 30.506 40.669 70.648 40.501 30.839 40.769 70.516 8
RFBNet0.592 50.616 50.758 20.659 40.581 50.330 70.469 50.655 80.543 70.524 30.924 20.355 50.336 60.572 60.479 50.671 50.648 40.480 50.814 70.814 20.614 3
SSMAcopyleft0.577 70.695 20.716 80.439 90.563 60.314 80.444 70.719 30.551 50.503 50.887 80.346 70.348 50.603 20.353 90.709 20.600 80.457 80.901 10.786 40.599 4
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
DCRedNet0.583 60.682 30.723 60.542 70.510 80.310 90.451 60.668 60.549 60.520 40.920 30.375 20.446 10.528 80.417 60.670 60.577 100.478 60.862 20.806 30.628 2
FuseNetpermissive0.535 80.570 70.681 90.182 120.512 70.290 100.431 80.659 70.504 90.495 70.903 70.308 80.428 20.523 90.365 80.676 30.621 70.470 70.762 100.779 50.541 6
Caner Hazirbas, Lingni Ma, Csaba Domokos, Daniel Cremers: FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture. ACCV 2016
Enet (reimpl)0.376 120.264 130.452 130.452 80.365 110.181 110.143 130.456 120.409 130.346 120.769 130.164 120.218 110.359 120.123 130.403 130.381 130.313 130.571 120.685 110.472 10
Re-implementation of Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello: ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation.
ScanNet (2d proj)permissive0.330 130.293 120.521 120.657 50.361 120.161 120.250 120.004 130.440 110.183 130.836 110.125 130.060 130.319 130.132 120.417 120.412 120.344 120.541 130.427 130.109 13
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17
ILC-PSPNet0.475 110.490 100.581 110.289 110.507 90.067 130.379 100.610 100.417 120.435 90.822 120.278 90.267 90.503 100.228 100.616 100.533 110.375 110.820 60.729 90.560 5

This table lists the benchmark results for the 2D semantic instance scenario.




Method Infoavg apbathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MaskRCNN_ScanNetpermissive0.119 10.129 10.212 10.002 10.112 10.148 10.014 10.205 10.044 10.066 10.078 10.095 10.142 10.030 10.128 10.139 10.080 10.459 10.057 1
Re-implementation of Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick: Mask R-CNN. ICCV'17

This table lists the benchmark results for the scene type classification scenario.




Method Infoavg recallapartmentbathroombedroom / hotelbookstore / libraryconference roomcopy/mail roomhallwaykitchenlaundry roomliving room / loungemiscofficestorage / basement / garage
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MTL0.700 10.500 11.000 10.882 20.500 11.000 11.000 10.500 11.000 11.000 10.778 10.000 20.938 10.000 1
SE-ResNeXt-SSMA0.498 20.000 30.812 20.941 10.500 10.500 20.500 20.500 10.429 30.500 20.667 20.500 10.625 20.000 1
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. arXiv
resnet50_scannet0.353 30.250 20.812 20.529 30.500 10.500 20.000 30.500 10.571 20.000 30.556 30.000 20.375 30.000 1