This table lists the benchmark results for the 3D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MinkowskiNetpermissive0.736 10.859 20.818 20.832 20.709 20.840 20.521 20.853 10.660 10.643 10.951 40.544 20.286 60.731 10.893 10.675 40.772 30.683 10.874 80.852 20.727 1
C. Choy, J. Gwak, S. Savarese: 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks. CVPR 2019
SparseConvNet0.725 20.647 170.821 10.846 10.721 10.869 10.533 10.754 50.603 40.614 20.955 10.572 10.325 10.710 20.870 20.724 10.823 10.628 30.934 10.865 10.683 2
KP-FCNN0.684 30.847 40.758 60.784 40.647 30.814 50.473 50.772 40.605 30.594 30.935 150.450 70.181 190.587 40.805 40.690 30.785 20.614 40.882 50.819 30.632 5
H. Thomas, C. Qi, J. Deschaud, B. Marcotegui, F. Goulette, L. Guibas.: KPConv: Flexible and Deformable Convolution for Point Clouds. ICCV 2019
PointConvpermissive0.666 40.781 60.759 50.699 90.644 40.822 40.475 40.779 30.564 80.504 90.953 20.428 90.203 150.586 50.754 50.661 60.753 50.588 50.902 20.813 50.642 3
Wenxuan Wu, Zhongang Qi, Li Fuxin: PointConv: Deep Convolutional Networks on 3D Point Clouds. CVPR 2019
DMC-Net0.653 50.771 70.701 100.801 30.619 60.807 70.463 60.680 110.495 140.520 70.940 110.452 60.301 20.496 130.816 30.664 50.719 60.563 100.822 130.799 70.638 4
MVPNet0.641 60.831 50.715 90.671 120.590 90.781 90.394 130.679 130.642 20.553 50.937 140.462 50.256 70.649 30.406 180.626 80.691 90.666 20.877 70.792 90.608 8
Maximilian Jaritz, Jiayuan Gu, Hao Su: Multi-view PointNet for 3D Scene Understanding. GMDL Workshop, ICCV 2019
joint point-basedpermissive0.634 70.614 180.778 30.667 140.633 50.825 30.420 90.804 20.467 180.561 40.951 40.494 30.291 30.566 80.458 150.579 140.764 40.559 110.838 110.814 40.598 11
Hung-Yueh Chiang, Yen-Liang Lin, Yueh-Cheng Liu, Winston H. Hsu: A Unified Point-Based Framework for 3D Segmentation. 3DV 2019
MCCNNpermissive0.633 80.866 10.731 80.771 50.576 110.809 60.410 110.684 100.497 130.491 110.949 60.466 40.105 260.581 70.646 80.620 90.680 110.542 140.817 140.795 80.618 6
P. Hermosilla, T. Ritschel, P.P. Vazquez, A. Vinacua, T. Ropinski: Monte Carlo Convolution for Learning on Non-Uniformly Sampled Point Clouds. SIGGRAPH Asia 2018
CDF-SM3D0.626 90.592 190.746 70.767 70.607 70.761 130.501 30.738 60.546 90.503 100.864 280.421 100.198 160.584 60.579 110.694 20.706 70.566 90.885 40.745 180.523 19
HPEIN0.618 100.729 110.668 160.647 150.597 80.766 110.414 100.680 110.520 110.525 60.946 70.432 80.215 120.493 140.599 90.638 70.617 190.570 70.897 30.806 60.605 9
Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia: Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation. ICCV 2019
SPH3D-GCNpermissive0.610 110.858 30.772 40.489 250.532 130.792 80.404 120.643 160.570 70.507 80.935 150.414 110.046 300.510 110.702 60.602 110.705 80.549 130.859 100.773 110.534 17
Huan Lei, Naveed Akhtar, and Ajmal Mian: Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds.
FLConv0.605 120.751 90.695 130.701 80.545 120.758 140.448 70.596 180.538 100.428 170.953 20.398 120.160 210.457 150.598 100.612 100.686 100.551 120.880 60.768 120.585 13
LAP-D0.594 130.720 120.692 140.637 170.456 190.773 100.391 150.730 70.587 50.445 150.940 110.381 140.288 40.434 170.453 160.591 120.649 130.581 60.777 180.749 170.610 7
DPC0.592 140.720 120.700 110.602 200.480 160.762 120.380 170.713 80.585 60.437 160.940 110.369 160.288 40.434 170.509 140.590 130.639 170.567 80.772 190.755 150.592 12
Francis Engelmann, Theodora Kontogianni, Bastian Leibe: Dilated Point Convolutions. arXiv
CCRFNet0.589 150.766 80.659 190.683 110.470 180.740 160.387 160.620 170.490 150.476 120.922 200.355 190.245 80.511 100.511 130.571 150.643 150.493 180.872 90.762 130.600 10
TextureNetpermissive0.566 160.672 150.664 170.671 120.494 140.719 170.445 80.678 140.411 230.396 180.935 150.356 180.225 100.412 190.535 120.565 160.636 180.464 200.794 170.680 230.568 14
Jingwei Huang, Haotian Zhang, Li Yi, Thomas Funkerhouser, Matthias Niessner, Leonidas Guibas: TextureNet: Consistent Local Parametrizations for Learning from High-Resolution Signals on Meshes. CVPR
DVVNet0.562 170.648 160.700 110.770 60.586 100.687 210.333 190.650 150.514 120.475 130.906 250.359 170.223 110.340 220.442 170.422 250.668 120.501 160.708 230.779 100.534 17
Pointnet++ & Featurepermissive0.557 180.735 100.661 180.686 100.491 150.744 150.392 140.539 210.451 190.375 200.946 70.376 150.205 140.403 200.356 200.553 170.643 150.497 170.824 120.756 140.515 20
PanopticFusion-label0.529 190.491 260.688 150.604 190.386 220.632 260.225 300.705 90.434 210.293 240.815 290.348 200.241 90.499 120.669 70.507 180.649 130.442 240.796 160.602 280.561 15
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
3DMV, FTSDF0.501 200.558 230.608 240.424 290.478 170.690 200.246 260.586 190.468 170.450 140.911 230.394 130.160 210.438 160.212 260.432 240.541 240.475 190.742 210.727 190.477 22
PCNN0.498 210.559 220.644 210.560 230.420 210.711 190.229 280.414 220.436 200.352 210.941 100.324 210.155 230.238 260.387 190.493 190.529 250.509 150.813 150.751 160.504 21
3DMV0.484 220.484 270.538 270.643 160.424 200.606 290.310 200.574 200.433 220.378 190.796 300.301 220.214 130.537 90.208 270.472 230.507 280.413 270.693 240.602 280.539 16
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
PointCNN with RGBpermissive0.458 230.577 210.611 230.356 310.321 280.715 180.299 220.376 250.328 280.319 220.944 90.285 240.164 200.216 290.229 250.484 210.545 230.456 220.755 200.709 200.475 23
Yangyan Li, Rui Bu, Mingchao Sun, Baoquan Chen: PointCNN. NeurIPS 2018
FCPNpermissive0.447 240.679 140.604 250.578 220.380 230.682 220.291 230.106 310.483 160.258 290.920 210.258 260.025 310.231 280.325 210.480 220.560 220.463 210.725 220.666 250.231 31
Dario Rethage, Johanna Wald, Jürgen Sturm, Nassir Navab, Federico Tombari: Fully-Convolutional Point Networks for Large-Scale Point Clouds. ECCV 2018
SurfaceConvPF0.442 250.505 250.622 220.380 300.342 270.654 240.227 290.397 240.367 260.276 260.924 190.240 270.198 160.359 210.262 230.366 270.581 200.435 250.640 260.668 240.398 25
Hao Pan, Shilin Liu, Yang Liu, Xin Tong: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames.
PNET20.442 250.548 240.548 260.597 210.363 250.628 270.300 210.292 260.374 250.307 230.881 270.268 250.186 180.238 260.204 280.407 260.506 290.449 230.667 250.620 270.462 24
Tangent Convolutionspermissive0.438 270.437 290.646 200.474 260.369 240.645 250.353 180.258 280.282 300.279 250.918 220.298 230.147 240.283 230.294 220.487 200.562 210.427 260.619 270.633 260.352 27
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, Qian-Yi Zhou: Tangent convolutions for dense prediction in 3d. CVPR 2018
SPLAT Netcopyleft0.393 280.472 280.511 280.606 180.311 290.656 230.245 270.405 230.328 280.197 300.927 180.227 290.000 330.001 330.249 240.271 320.510 260.383 290.593 280.699 210.267 29
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, Jan Kautz: SPLATNet: Sparse Lattice Networks for Point Cloud Processing. CVPR 2018
ScanNet+FTSDF0.383 290.297 310.491 290.432 280.358 260.612 280.274 240.116 300.411 230.265 270.904 260.229 280.079 280.250 240.185 290.320 300.510 260.385 280.548 290.597 300.394 26
PointNet++permissive0.339 300.584 200.478 300.458 270.256 310.360 320.250 250.247 290.278 310.261 280.677 320.183 300.117 250.212 300.145 310.364 280.346 320.232 320.548 290.523 310.252 30
Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas: pointnet++: deep hierarchical feature learning on point sets in a metric space.
SSC-UNetpermissive0.308 310.353 300.290 320.278 320.166 320.553 300.169 320.286 270.147 320.148 320.908 240.182 310.064 290.023 320.018 330.354 290.363 300.345 300.546 310.685 220.278 28
ScanNetpermissive0.306 320.203 320.366 310.501 240.311 290.524 310.211 310.002 330.342 270.189 310.786 310.145 320.102 270.245 250.152 300.318 310.348 310.300 310.460 320.437 320.182 32
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17
ERROR0.054 330.000 330.041 330.172 330.030 330.062 330.001 330.035 320.004 330.051 330.143 330.019 330.003 320.041 310.050 320.003 330.054 330.018 330.005 330.264 330.082 33

This table lists the benchmark results for the 3D semantic instance scenario.




Method Infoavg ap 50%bathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
OccuSeg0.634 11.000 10.902 10.771 10.461 20.814 10.282 10.583 50.328 10.472 10.471 10.295 30.600 11.000 10.650 10.664 30.587 21.000 10.537 1
MTML0.549 21.000 10.807 20.588 40.327 50.647 20.004 120.815 10.180 50.418 20.364 60.182 50.445 31.000 10.442 40.688 20.571 31.000 10.396 3
Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald: 3D Instance Segmentation via Multi-task Metric Learning. ICCV 2019 [oral]
Occipital-SCS0.512 31.000 10.716 40.509 50.506 10.611 40.092 50.602 40.177 60.346 50.383 50.165 60.442 40.850 50.386 70.618 50.543 40.889 80.389 4
3D-BoNet0.488 41.000 10.672 70.590 30.301 60.484 90.098 40.620 20.306 20.341 60.259 80.125 80.434 50.796 60.402 60.499 100.513 50.909 70.439 2
Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. NeurIPS 2019 Spotlight
PanopticFusion-inst0.478 50.667 70.712 60.595 20.259 80.550 80.000 150.613 30.175 70.250 90.434 20.437 10.411 70.857 30.485 20.591 80.267 120.944 50.359 5
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
ResNet-backbone0.459 61.000 10.737 30.159 130.259 70.587 60.138 20.475 70.217 40.416 30.408 40.128 70.315 80.714 70.411 50.536 90.590 10.873 100.304 6
MASCpermissive0.447 70.528 100.555 90.381 70.382 30.633 30.002 130.509 60.260 30.361 40.432 30.327 20.451 20.571 80.367 80.639 40.386 70.980 30.276 7
Chen Liu, Yasutaka Furukawa: MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation.
3D-SISpermissive0.382 81.000 10.432 110.245 100.190 100.577 70.013 100.263 100.033 130.320 70.240 100.075 110.422 60.857 30.117 120.699 10.271 110.883 90.235 10
Ji Hou, Angela Dai, Matthias Niessner: 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. CVPR 2019
DPC-instance0.355 90.500 110.517 100.467 60.228 90.422 110.133 30.405 80.111 90.205 100.241 90.075 100.233 90.306 120.445 30.439 110.457 60.974 40.239 9
Francis Engelmann, Theodora Kontogianni, Bastian Leibe: Dilated Point Convolutions. arXiv
UNet-backbone0.319 100.667 70.715 50.233 110.189 110.479 100.008 110.218 110.067 120.201 110.173 110.107 90.123 110.438 90.150 100.615 60.355 80.916 60.093 14
R-PointNet0.306 110.500 110.405 120.311 80.348 40.589 50.054 60.068 130.126 80.283 80.290 70.028 120.219 100.214 130.331 90.396 120.275 100.821 120.245 8
3D-BEVIS0.248 120.667 70.566 80.076 140.035 150.394 120.027 80.035 140.098 100.099 130.030 140.025 130.098 120.375 100.126 110.604 70.181 130.854 110.171 11
Cathrin Elich, Francis Engelmann, Jonas Schult, Theodora Kontogianni, Bastian Leibe: 3D-BEVIS: Birds-Eye-View Instance Segmentation.
Seg-Clusterpermissive0.215 130.370 130.337 140.285 90.105 120.325 130.025 90.282 90.085 110.105 120.107 120.007 150.079 130.317 110.114 130.309 140.304 90.587 130.123 13
Sgpn_scannet0.143 140.208 150.390 130.169 120.065 130.275 140.029 70.069 120.000 140.087 140.043 130.014 140.027 150.000 140.112 140.351 130.168 140.438 140.138 12
MaskRCNN 2d->3d Proj0.058 150.333 140.002 150.000 150.053 140.002 150.002 140.021 150.000 140.045 150.024 150.238 40.065 140.000 140.014 150.107 150.020 150.110 150.006 15

This table lists the benchmark results for the 2D semantic label scenario.


Method Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
DMMF_3d0.605 10.651 30.744 40.782 10.637 10.387 10.536 10.732 10.590 20.540 10.856 80.359 30.306 70.596 30.539 10.627 70.706 10.497 30.785 80.757 70.476 8
DMMF0.597 20.543 70.755 30.749 20.585 30.338 30.494 30.704 30.598 10.494 70.911 30.347 50.327 60.593 40.527 20.675 30.646 50.513 10.842 30.774 50.527 6
MCA-Net0.595 30.533 80.756 20.746 30.590 20.334 50.506 20.670 40.587 30.500 50.905 50.366 20.352 30.601 20.506 40.669 60.648 30.501 20.839 40.769 60.516 7
RFBNet0.592 40.616 40.758 10.659 40.581 40.330 60.469 40.655 70.543 60.524 20.924 10.355 40.336 50.572 50.479 50.671 40.648 30.480 40.814 60.814 10.614 2
DCRedNet0.583 50.682 20.723 50.542 70.510 70.310 80.451 50.668 50.549 50.520 30.920 20.375 10.446 10.528 70.417 60.670 50.577 90.478 50.862 20.806 20.628 1
SSMAcopyleft0.577 60.695 10.716 70.439 90.563 50.314 70.444 60.719 20.551 40.503 40.887 70.346 60.348 40.603 10.353 80.709 10.600 70.457 70.901 10.786 30.599 3
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
FuseNetpermissive0.535 70.570 60.681 80.182 120.512 60.290 90.431 70.659 60.504 80.495 60.903 60.308 70.428 20.523 80.365 70.676 20.621 60.470 60.762 90.779 40.541 5
Caner Hazirbas, Lingni Ma, Csaba Domokos, Daniel Cremers: FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture. ACCV 2016
AdapNet++copyleft0.503 80.613 50.722 60.418 100.358 120.337 40.370 100.479 100.443 90.368 100.907 40.207 100.213 110.464 100.525 30.618 80.657 20.450 80.788 70.721 90.408 11
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. International Journal of Computer Vision, 2019
3DMV (2d proj)0.498 90.481 100.612 90.579 60.456 90.343 20.384 80.623 80.525 70.381 90.845 90.254 90.264 90.557 60.182 100.581 100.598 80.429 90.760 100.661 110.446 10
Angela Dai, Matthias Niessner: 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation. ECCV'18
ILC-PSPNet0.475 100.490 90.581 100.289 110.507 80.067 120.379 90.610 90.417 110.435 80.822 110.278 80.267 80.503 90.228 90.616 90.533 100.375 100.820 50.729 80.560 4
Enet (reimpl)0.376 110.264 120.452 120.452 80.365 100.181 100.143 120.456 110.409 120.346 110.769 120.164 110.218 100.359 110.123 120.403 120.381 120.313 120.571 110.685 100.472 9
Re-implementation of Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello: ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation.
ScanNet (2d proj)permissive0.330 120.293 110.521 110.657 50.361 110.161 110.250 110.004 120.440 100.183 120.836 100.125 120.060 120.319 120.132 110.417 110.412 110.344 110.541 120.427 120.109 12
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner: ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. CVPR'17

This table lists the benchmark results for the 2D semantic instance scenario.




Method Infoavg apbathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
MaskRCNN_ScanNetpermissive0.119 10.129 10.212 10.002 10.112 10.148 10.014 10.205 10.044 10.066 10.078 10.095 10.142 10.030 10.128 10.139 10.080 10.459 10.057 1
Re-implementation of Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick: Mask R-CNN. ICCV'17

This table lists the benchmark results for the scene type classification scenario.




Method Infoavg recallapartmentbathroombedroom / hotelbookstore / libraryconference roomcopy/mail roomhallwaykitchenlaundry roomliving room / loungemiscofficestorage / basement / garage
sorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
SSCN0.700 10.500 11.000 10.882 20.500 11.000 11.000 10.500 11.000 11.000 10.778 10.000 20.938 10.000 1
SE-ResNeXt-SSMA0.498 20.000 30.812 20.941 10.500 10.500 20.500 20.500 10.429 30.500 20.667 20.500 10.625 20.000 1
Abhinav Valada, Rohit Mohan, Wolfram Burgard: Self-Supervised Model Adaptation for Multimodal Semantic Segmentation. arXiv
resnet50_scannet0.353 30.250 20.812 20.529 30.500 10.500 20.000 30.500 10.571 20.000 30.556 30.000 20.375 30.000 1