This table lists the benchmark results for the 3D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MinkowskiNetpermissive0.736 10.859 20.818 20.832 20.709 20.840 20.521 20.853 10.660 10.643 10.951 40.544 20.286 60.731 10.893 10.675 40.772 30.683 10.874 70.852 20.727 1
C. Choy, J. Gwak, S. Savarese: 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks. CVPR 2019
SparseConvNet0.725 20.647 170.821 10.846 10.721 10.869 10.533 10.754 50.603 40.614 20.955 10.572 10.325 20.710 20.870 20.724 10.823 10.628 30.934 10.865 10.683 2
KP-FCNN0.684 30.847 40.758 60.784 30.647 30.814 50.473 40.772 40.605 30.594 30.935 150.450 60.181 190.587 40.805 30.690 20.785 20.614 40.882 40.819 30.632 5
H. Thomas, C. Qi, J. Deschaud, B. Marcotegui, F. Goulette, L. Guibas.: KPConv: Flexible and Deformable Convolution for Point Clouds. ICCV 2019
PointConvpermissive0.666 40.781 60.759 50.699 90.644 40.822 40.475 30.779 30.564 80.504 90.953 20.428 80.203 150.586 50.754 40.661 50.753 50.588 50.902 20.813 50.642 3
Wenxuan Wu, Zhongang Qi, Li Fuxin: PointConv: Deep Convolutional Networks on 3D Point Clouds. CVPR 2019
MVPNet0.641 50.831 50.715 100.671 120.590 70.781 90.394 120.679 130.642 20.553 50.937 140.462 50.256 70.649 30.406 180.626 80.691 90.666 20.877 60.792 90.608 8
Maximilian Jaritz, Jiayuan Gu, Hao Su: Multi-view PointNet for 3D Scene Understanding. GMDL Workshop, ICCV 2019
joint point-basedpermissive0.634 60.614 180.778 30.667 140.633 50.825 30.420 80.804 20.467 180.561 40.951 40.494 30.291 30.566 70.458 150.579 140.764 40.559 90.838 110.814 40.598 11
Hung-Yueh Chiang, Yen-Liang Lin, Yueh-Cheng Liu, Winston H. Hsu: A Unified Point-Based Framework for 3D Segmentation. 3DV 2019
MCCNNpermissive0.633 70.866 10.731 80.771 40.576 90.809 60.410 100.684 110.497 140.491 100.949 60.466 40.105 260.581 60.646 80.620 90.680 110.542 130.817 130.795 80.618 6
P. Hermosilla, T. Ritschel, P.P. Vazquez, A. Vinacua, T. Ropinski: Monte Carlo Convolution for Learning on Non-Uniformly Sampled Point Clouds. SIGGRAPH Asia 2018
DMC-Net0.630 80.706 130.738 70.745 60.535 120.787 80.335 180.742 60.512 120.546 60.941 100.427 90.330 10.496 120.653 70.634 70.719 60.550 110.754 200.801 70.642 3
HPEIN0.618 90.729 100.668 160.647 150.597 60.766 110.414 90.680 120.520 100.525 70.946 70.432 70.215 120.493 140.599 90.638 60.617 190.570 70.897 30.806 60.605 9
Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia: Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation. ICCV 2019
SPH3D-GCNpermissive0.610 100.858 30.772 40.489 250.532 130.792 70.404 110.643 160.570 70.507 80.935 150.414 110.046 300.510 100.702 50.602 110.705 80.549 120.859 100.773 110.534 17
Huan Lei, Naveed Akhtar, and Ajmal Mian: Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds.
FLConv0.605 110.751 80.695 130.701 80.545 110.758 130.448 50.596 180.538 90.428 170.953 20.398 120.160 210.457 150.598 100.612 100.686 100.551 100.880 50.768 120.585 13
CDF-SM3D0.597 120.557 230.728 90.703 70.572 100.730 160.435 70.715 80.504 130.467 130.863 280.427 90.184 180.495 130.569 110.685 30.717 70.516 140.866 90.726 190.479 21
LAP-D0.594 130.720 110.692 140.637 170.456 190.773 100.391 140.730 70.587 50.445 150.940 120.381 140.288 40.434 170.453 160.591 120.649 130.581 60.777 170.749 170.610 7
DPC0.592 140.720 110.700 110.602 200.480 160.762 120.380 160.713 90.585 60.437 160.940 120.369 160.288 40.434 170.509 140.590 130.639 170.567 80.772 180.755 150.592 12
Francis Engelmann, Theodora Kontogianni, Bastian Leibe: Dilated Point Convolutions. arXiv
CCRFNet0.589 150.766 70.659 190.683 110.470 180.740 150.387 150.620 170.490 150.476 110.922 200.355 190.245 80.511 90.511 130.571 150.643 150.493 180.872 80.762 130.600 10
TextureNetpermissive0.566 160.672 150.664 170.671 120.494 140.719 170.445 60.678 140.411 230.396 180.935 150.356 180.225 100.412 190.535 120.565 160.636 180.464 200.794 160.680 230.568 14
Jingwei Huang, Haotian Zhang, Li Yi, Thomas Funkerhouser, Matthias Niessner, Leonidas Guibas: TextureNet: Consistent Local Parametrizations for Learning from High-Resolution Signals on Meshes. CVPR
DVVNet0.562 170.648 160.700 110.770 50.586 80.687 210.333 190.650 150.514 110.475 120.906 250.359 170.223 110.340 220.442 170.422 250.668 120.501 160.708 230.779 100.534 17
Pointnet++ & Featurepermissive0.557 180.735 90.661 180.686 100.491 150.744 140.392 130.539 210.451 190.375 200.946 70.376 150.205 140.403 200.356 200.553 170.643 150.497 170.824 120.756 140.515 19
PanopticFusion-label0.529 190.491 260.688 150.604 190.386 220.632 260.225 300.705 100.434 210.293 240.815 290.348 200.241 90.499 110.669 60.507 180.649 130.442 240.796 150.602 280.561 15
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
3DMV, FTSDF0.501 200.558 220.608 240.424 290.478 170.690 200.246 260.586 190.468 170.450 140.911 230.394 130.160 210.438 160.212 260.432 240.541 240.475 190.742 210.727 180.477 22
PCNN0.498 210.559 210.644 210.560 230.420 210.711 190.229 280.414 220.436 200.352 210.941 100.324 210.155 230.238 260.387 190.493 190.529 250.509 150.813 140.751 160.504 20
3DMV0.484 220.484 270.538 270.643 160.424 200.606 290.310 200.574 200.433 220.378 190.796 300.301 220.214 130.537 80.208 270.472 230.507 280.413 270.693 240.602 280.539 16
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
PointCNN with RGBpermissive0.458 230.577 200.611 230.356 310.321 280.715 180.299 220.376 250.328 280.319 220.944 90.285 240.164 200.216 290.229 250.484 210.545 230.456 220.755 190.709 200.475 23
Yangyan Li, Rui Bu, Mingchao Sun, Baoquan Chen: PointCNN. NeurIPS 2018
FCPNpermissive0.447 240.679 140.604 250.578 220.380 230.682 220.291 230.106 310.483 160.258 290.920 210.258 260.025 310.231 280.325 210.480 220.560 220.463 210.725 220.666 250.231 31
Dario Rethage, Johanna Wald, Jürgen Sturm, Nassir Navab, Federico Tombari: Fully-Convolutional Point Networks for Large-Scale Point Clouds. ECCV 2018
SurfaceConvPF0.442 250.505 250.622 220.380 300.342 270.654 240.227 290.397 240.367 260.276 260.924 190.240 270.198 160.359 210.262 230.366 270.581 200.435 250.640 260.668 240.398 25
Hao Pan, Shilin Liu, Yang Liu, Xin Tong: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames.
PNET20.442 250.548 240.548 260.597 210.363 250.628 270.300 210.292 260.374 250.307 230.881 270.268 250.186 170.238 260.204 280.407 260.506 290.449 230.667 250.620 270.462 24
Tangent Convolutionspermissive0.438 270.437 290.646 200.474 260.369 240.645 250.353 170.258 280.282 300.279 250.918 220.298 230.147 240.283 230.294 220.487 200.562 210.427 260.619 270.633 260.352 27
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, Qian-Yi Zhou: Tangent convolutions for dense prediction in 3d. CVPR 2018
SPLAT Netcopyleft0.393 280.472 280.511 280.606 180.311 290.656 230.245 270.405 230.328 280.197 300.927 180.227 290.000 330.001 330.249 240.271 320.510 260.383 290.593 280.699 210.267 29
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, Jan Kautz: SPLATNet: Sparse Lattice Networks for Point Cloud Processing. CVPR 2018
ScanNet+FTSDF0.383 290.297 310.491 290.432 280.358 260.612 280.274 240.116 300.411 230.265 270.904 260.229 280.079 280.250 240.185 290.320 300.510 260.385 280.548 290.597 300.394 26
PointNet++permissive0.339 300.584 190.478 300.458 270.256 310.360 320.250 250.247 290.278 310.261 280.677 320.183 300.117 250.212 300.145 310.364 280.346 320.232 320.548 290.523 310.252 30
Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas: pointnet++: deep hierarchical feature learning on point sets in a metric space.
SSC-UNetpermissive0.308 310.353 300.290 320.278 320.166 320.553 300.169 320.286 270.147 320.148 320.908 240.182 310.064 290.023 320.018 330.354 290.363 300.345 300.546 310.685 220.278 28
ScanNetpermissive0.306 320.203 320.366 310.501 240.311 290.524 310.211 310.002 330.342 270.189 310.786 310.145 320.102 270.245 250.152 300.318 310.348 310.300 310.460 320.437 320.182 32
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17
ERROR0.054 330.000 330.041 330.172 330.030 330.062 330.001 330.035 320.004 330.051 330.143 330.019 330.003 320.041 310.050 320.003 330.054 330.018 330.005 330.264 330.082 33

This table lists the benchmark results for the 3D semantic instance scenario.




Method Infoavg ap 50%bathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
DCNet0.634 11.000 10.902 10.771 10.461 20.814 10.282 10.583 50.328 10.472 10.471 10.295 30.600 11.000 10.650 10.664 30.587 21.000 10.537 1
MTML0.549 21.000 10.807 20.588 40.327 50.647 20.004 120.815 10.180 50.418 20.364 60.182 50.445 31.000 10.442 40.688 20.571 31.000 10.396 3
Occipital-SCS0.512 31.000 10.716 40.509 50.506 10.611 40.092 50.602 40.177 60.346 50.383 50.165 60.442 40.850 50.386 70.618 50.543 40.889 80.389 4
3D-BoNet0.488 41.000 10.672 70.590 30.301 60.484 90.098 40.620 20.306 20.341 60.259 80.125 80.434 50.796 60.402 60.499 100.513 50.909 70.439 2
Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. NeurIPS 2019 Spotlight
PanopticFusion-inst0.478 50.667 70.712 60.595 20.259 80.550 80.000 150.613 30.175 70.250 90.434 20.437 10.411 70.857 30.485 20.591 80.267 120.944 50.359 5
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
ResNet-backbone0.459 61.000 10.737 30.159 130.259 70.587 60.138 20.475 70.217 40.416 30.408 40.128 70.315 80.714 70.411 50.536 90.590 10.873 100.304 6
MASCpermissive0.447 70.528 100.555 90.381 70.382 30.633 30.002 130.509 60.260 30.361 40.432 30.327 20.451 20.571 80.367 80.639 40.386 70.980 30.276 7
Chen Liu, Yasutaka Furukawa: MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation.
3D-SISpermissive0.382 81.000 10.432 110.245 100.190 100.577 70.013 100.263 100.033 130.320 70.240 100.075 110.422 60.857 30.117 120.699 10.271 110.883 90.235 10
Ji Hou, Angela Dai, Matthias Niessner: 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. CVPR 2019
DPC-instance0.355 90.500 110.517 100.467 60.228 90.422 110.133 30.405 80.111 90.205 100.241 90.075 100.233 90.306 120.445 30.439 110.457 60.974 40.239 9
Francis Engelmann, Theodora Kontogianni, Bastian Leibe: Dilated Point Convolutions. arXiv
UNet-backbone0.319 100.667 70.715 50.233 110.189 110.479 100.008 110.218 110.067 120.201 110.173 110.107 90.123 110.438 90.150 100.615 60.355 80.916 60.093 14
R-PointNet0.306 110.500 110.405 120.311 80.348 40.589 50.054 60.068 130.126 80.283 80.290 70.028 120.219 100.214 130.331 90.396 120.275 100.821 120.245 8
3D-BEVIS0.248 120.667 70.566 80.076 140.035 150.394 120.027 80.035 140.098 100.099 130.030 140.025 130.098 120.375 100.126 110.604 70.181 130.854 110.171 11
Cathrin Elich, Francis Engelmann, Jonas Schult, Theodora Kontogianni, Bastian Leibe: 3D-BEVIS: Birds-Eye-View Instance Segmentation.
Seg-Clusterpermissive0.215 130.370 130.337 140.285 90.105 120.325 130.025 90.282 90.085 110.105 120.107 120.007 150.079 130.317 110.114 130.309 140.304 90.587 130.123 13
Sgpn_scannet0.143 140.208 150.390 130.169 120.065 130.275 140.029 70.069 120.000 140.087 140.043 130.014 140.027 150.000 140.112 140.351 130.168 140.438 140.138 12
MaskRCNN 2d->3d Proj0.058 150.333 140.002 150.000 150.053 140.002 150.002 140.021 150.000 140.045 150.024 150.238 40.065 140.000 140.014 150.107 150.020 150.110 150.006 15

This table lists the benchmark results for the 2D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MCA-Net0.595 10.533 60.756 20.746 10.590 10.334 30.506 10.670 20.587 10.500 40.905 40.366 20.352 30.601 20.506 20.669 50.648 20.501 10.839 30.769 50.516 6
RFBNet0.592 20.616 30.758 10.659 20.581 20.330 40.469 20.655 50.543 40.524 10.924 10.355 30.336 50.572 30.479 30.671 30.648 20.480 20.814 50.814 10.614 2
DCRedNet0.583 30.682 20.723 30.542 50.510 50.310 60.451 30.668 30.549 30.520 20.920 20.375 10.446 10.528 50.417 40.670 40.577 70.478 30.862 20.806 20.628 1
SSMAcopyleft0.577 40.695 10.716 50.439 70.563 30.314 50.444 40.719 10.551 20.503 30.887 60.346 40.348 40.603 10.353 60.709 10.600 50.457 50.901 10.786 30.599 3
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
FuseNetpermissive0.535 50.570 50.681 60.182 100.512 40.290 70.431 50.659 40.504 60.495 50.903 50.308 50.428 20.523 60.365 50.676 20.621 40.470 40.762 70.779 40.541 5
Caner Hazirbas, Lingni Ma, Csaba Domokos, Daniel Cremers: FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture. ACCV 2016
AdapNet++copyleft0.503 60.613 40.722 40.418 80.358 100.337 20.370 80.479 80.443 70.368 80.907 30.207 80.213 90.464 80.525 10.618 60.657 10.450 60.788 60.721 70.408 9
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
3DMV (2d proj)0.498 70.481 80.612 70.579 40.456 70.343 10.384 60.623 60.525 50.381 70.845 70.254 70.264 70.557 40.182 80.581 80.598 60.429 70.760 80.661 90.446 8
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
ILC-PSPNet0.475 80.490 70.581 80.289 90.507 60.067 100.379 70.610 70.417 90.435 60.822 90.278 60.267 60.503 70.228 70.616 70.533 80.375 80.820 40.729 60.560 4
Enet (reimpl)0.376 90.264 100.452 100.452 60.365 80.181 80.143 100.456 90.409 100.346 90.769 100.164 90.218 80.359 90.123 100.403 100.381 100.313 100.571 90.685 80.472 7
Re-implementation of Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello: ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation.
ScanNet (2d proj)permissive0.330 100.293 90.521 90.657 30.361 90.161 90.250 90.004 100.440 80.183 100.836 80.125 100.060 100.319 100.132 90.417 90.412 90.344 90.541 100.427 100.109 10
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17

This table lists the benchmark results for the 2D semantic instance scenario.




Method Infoavg apbathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MaskRCNN_ScanNetpermissive0.119 10.129 10.212 10.002 10.112 10.148 10.014 10.205 10.044 10.066 10.078 10.095 10.142 10.030 10.128 10.139 10.080 10.459 10.057 1
Re-implementation of Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick: Mask R-CNN. ICCV'17

This table lists the benchmark results for the scene type classification scenario.




Method Infoavg recallapartmentbathroombedroom / hotelbookstore / libraryconference roomcopy/mail roomhallwaykitchenlaundry roomliving room / loungemiscofficestorage / basement / garage
sort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysorted bysort bysort by
SE-ResNeXt-SSMA0.498 20.000 30.812 20.941 10.500 20.500 10.500 10.500 20.429 30.500 10.667 10.500 10.625 20.000 1
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. arXiv
SSCN0.534 10.500 10.938 10.824 21.000 10.500 10.000 21.000 10.857 10.000 20.444 30.000 20.875 10.000 1
resnet50_scannet0.353 30.250 20.812 20.529 30.500 20.500 10.000 20.500 20.571 20.000 20.556 20.000 20.375 30.000 1