The 3D semantic instance prediction task involves detecting and segmenting the object in an 3D scan mesh.

Evaluation and metrics

Our evaluation ranks all methods according to the average precision for each class. We report the mean average precision AP at overlap 0.25 (AP 25%), overlap 0.5 (AP 50%), and over overlaps in the range [0.5:0.95:0.05] (AP). Note that multiple predictions of the same ground truth instance are penalized as false positives.



This table lists the benchmark results for the 3D semantic instance scenario.




Method Infoavg ap 50%bathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
Volt-SPFormerScanNetpermissive0.827 11.000 10.981 60.975 10.801 10.940 40.426 240.693 300.752 130.762 70.800 10.804 20.855 10.959 480.745 230.879 70.806 70.997 430.710 1
Kadir Yilmaz, Adrian Kruse, Tristan Höfer, Daan de Geus, Bastian Leibe: Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding.
Dyco3Dcopyleft0.641 451.000 10.841 290.893 80.531 480.802 420.115 520.588 530.448 450.438 500.537 470.430 500.550 550.857 520.534 500.764 410.657 370.987 550.568 31
Tong He; Chunhua Shen; Anton van den Hengel: DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution. CVPR2021
SALoss-ResNet0.459 661.000 10.737 590.159 770.259 690.587 660.138 480.475 650.217 630.416 570.408 590.128 660.315 700.714 630.411 600.536 720.590 500.873 680.304 64
Zhidong Liang, Ming Yang, Hao Li, Chunxiang Wang: 3D Instance Embedding Learning With a Structure-Aware Loss Function for Point Cloud Segmentation. IEEE Robotics and Automation Letters (IROS2020)
PBNetpermissive0.747 261.000 10.818 330.837 220.713 100.844 300.457 220.647 440.711 180.614 250.617 320.657 280.650 291.000 10.692 290.822 270.765 251.000 10.595 26
Weiguang Zhao, Yuyao Yan, Chaolong Yang, Jianan Ye, Xi Yang, Kaizhu Huang: Divide and Conquer: 3D Instance Segmentation With Point-Wise Binarization. ICCV 2023
SoftGrouppermissive0.761 231.000 10.808 370.845 180.716 90.862 280.243 370.824 40.655 300.620 240.734 60.699 200.791 90.981 410.716 250.844 180.769 221.000 10.594 27
Thang Vu, Kookhoi Kim, Tung M. Luu, Xuan Thanh Nguyen, Chang D. Yoo: SoftGroup for 3D Instance Segmentaiton on Point Clouds. CVPR 2022 [Oral]
DKNet0.718 301.000 10.814 340.782 310.619 340.872 250.224 380.751 160.569 370.677 210.585 370.724 150.633 410.981 410.515 520.819 290.736 311.000 10.617 15
Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong: 3D Instances as 1D Kernels. ECCV 2022
IPCA-Inst0.731 281.000 10.788 470.884 110.698 150.788 460.252 350.760 130.646 310.511 380.637 270.665 270.804 81.000 10.644 370.778 370.747 301.000 10.561 33
INS-Conv-instance0.657 411.000 10.760 550.667 490.581 400.863 270.323 300.655 410.477 410.473 440.549 440.432 480.650 291.000 10.655 340.738 480.585 510.944 590.472 50
ClickSeg_Instance0.539 591.000 10.621 680.300 660.530 490.698 570.127 500.533 570.222 620.430 540.400 600.365 560.574 530.938 490.472 550.659 630.543 610.944 590.347 62
DENet0.629 501.000 10.797 440.608 530.589 390.627 630.219 390.882 10.310 570.402 600.383 630.396 530.650 291.000 10.663 330.543 710.691 341.000 10.568 32
3D-MPA0.611 511.000 10.833 300.765 360.526 500.756 520.136 490.588 530.470 420.438 510.432 570.358 580.650 290.857 520.429 590.765 400.557 581.000 10.430 54
Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner: 3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation. CVPR 2020
MTML0.549 581.000 10.807 390.588 590.327 650.647 610.004 690.815 70.180 640.418 560.364 650.182 630.445 611.000 10.442 570.688 610.571 541.000 10.396 57
Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald: 3D Instance Segmentation via Multi-task Metric Learning. ICCV 2019 [oral]
SphereSeg0.680 361.000 10.856 250.744 390.618 350.893 190.151 430.651 430.713 170.537 350.579 400.430 490.651 281.000 10.389 630.744 470.697 320.991 540.601 24
RPGN0.643 441.000 10.758 560.582 610.539 450.826 340.046 600.765 120.372 530.436 520.588 360.539 380.650 291.000 10.577 440.750 450.653 400.997 430.495 46
Shichao Dong, Guosheng Lin, Tzu-Yi Hung: Learning Regional Purity for Instance Segmentation on 3D Point Clouds. ECCV 2022
SSTNetpermissive0.698 331.000 10.697 650.888 90.556 440.803 410.387 260.626 480.417 480.556 330.585 380.702 170.600 461.000 10.824 80.720 520.692 331.000 10.509 42
Zhihao Liang, Zhihao Li, Songcen Xu, Mingkui Tan, Kui Jia: Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks. ICCV2021
Mask-Group0.664 401.000 10.822 320.764 370.616 360.815 370.139 470.694 290.597 340.459 460.566 420.599 330.600 460.516 700.715 260.819 300.635 431.000 10.603 22
Min Zhong, Xinghao Chen, Xiaokang Chen, Gang Zeng, Yunhe Wang: MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation. ICME 2022
Mask3D0.780 201.000 10.786 480.716 430.696 170.885 230.500 180.714 220.810 50.672 220.715 100.679 250.809 21.000 10.831 40.833 220.787 131.000 10.602 23
Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe: Mask3D for 3D Semantic Instance Segmentation. ICRA 2023
HAISpermissive0.699 321.000 10.849 270.820 230.675 250.808 400.279 320.757 150.465 430.517 370.596 340.559 340.600 461.000 10.654 350.767 390.676 350.994 520.560 34
Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang: Hierarchical Aggregation for 3D Instance Segmentation. ICCV 2021
AOIA0.601 531.000 10.761 540.687 460.485 560.828 330.008 670.663 390.405 500.405 590.425 580.490 410.596 490.714 630.553 490.779 350.597 480.992 530.424 56
Sparse R-CNN0.515 611.000 10.538 730.282 670.468 590.790 450.173 410.345 690.429 460.413 580.484 500.176 640.595 510.591 680.522 510.668 620.476 660.986 570.327 63
PE0.645 431.000 10.773 510.798 280.538 460.786 470.088 550.799 100.350 550.435 530.547 450.545 360.646 400.933 500.562 460.761 420.556 600.997 430.501 45
Biao Zhang, Peter Wonka: Point Cloud Instance Segmentation using Probabilistic Embeddings. CVPR 2021
PCJC0.578 541.000 10.810 360.583 600.449 600.813 390.042 610.603 510.341 560.490 410.465 520.410 510.650 290.835 600.264 690.694 580.561 560.889 640.504 44
GICN0.638 461.000 10.895 210.800 270.480 570.676 590.144 450.737 170.354 540.447 470.400 610.365 560.700 171.000 10.569 450.836 210.599 471.000 10.473 49
PointGroup0.636 471.000 10.765 520.624 520.505 550.797 430.116 510.696 280.384 520.441 480.559 430.476 420.596 491.000 10.666 310.756 430.556 590.997 430.513 41
Li Jiang, Hengshuang Zhao, Shaoshuai Shi, Shu Liu, Chi-Wing Fu, Jiaya Jia: PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation. CVPR 2020 [oral]
SSEN0.575 551.000 10.761 530.473 630.477 580.795 440.066 570.529 580.658 290.460 450.461 530.380 550.331 690.859 510.401 620.692 600.653 391.000 10.348 61
Dongsu Zhang, Junha Chun, Sang Kyun Cha, Young Min Kim: Spatial Semantic Embedding Network: Fast 3D Instance Segmentation with Deep Metric Learning. Arxiv
OccuSeg+instance0.672 391.000 10.758 570.682 470.576 420.842 310.477 200.504 620.524 390.567 310.585 390.451 440.557 541.000 10.751 220.797 340.563 551.000 10.467 51
Lei Han, Tian Zheng, Lan Xu, Lu Fang: OccuSeg: Occupancy-aware 3D Instance Segmentation. CVPR2020
Occipital-SCS0.512 621.000 10.716 610.509 620.506 540.611 640.092 540.602 520.177 650.346 650.383 620.165 650.442 620.850 590.386 640.618 670.543 620.889 640.389 58
3D-BoNet0.488 631.000 10.672 670.590 580.301 670.484 740.098 530.620 490.306 580.341 660.259 690.125 670.434 640.796 620.402 610.499 730.513 640.909 630.439 53
Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. NeurIPS 2019 Spotlight
3D-SISpermissive0.382 691.000 10.432 760.245 690.190 710.577 670.013 660.263 710.033 740.320 670.240 700.075 700.422 650.857 520.117 740.699 560.271 750.883 660.235 69
Ji Hou, Angela Dai, Matthias Niessner: 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. CVPR 2019
CSC-Pretrained0.648 421.000 10.810 350.768 340.523 510.813 380.143 460.819 50.389 510.422 550.511 480.443 460.650 291.000 10.624 400.732 490.634 441.000 10.375 59
SSEC0.707 311.000 10.850 260.924 40.648 270.747 540.162 420.862 30.572 360.520 360.624 290.549 350.649 381.000 10.560 470.706 540.768 231.000 10.591 28
Box2Mask0.677 381.000 10.847 280.771 330.509 530.816 360.277 330.558 550.482 400.562 320.640 260.448 450.700 171.000 10.666 310.852 160.578 520.997 430.488 47
Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard Pons-Moll: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes. ECCV 2022
SoftGroup++0.769 221.000 10.803 410.937 20.684 230.865 260.213 400.870 20.664 270.571 300.758 20.702 180.807 71.000 10.653 360.902 30.792 111.000 10.626 12
TST3D0.795 131.000 10.929 150.918 50.709 110.884 240.596 40.704 250.769 100.734 100.644 250.699 210.751 141.000 10.794 120.876 90.757 270.997 430.550 37
Duc Tran Dang Trung, Byeongkeun Kang, Yeejin Lee: MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation. ACM Multimedia 2024
DCD0.798 121.000 10.878 240.792 300.693 190.936 50.596 30.685 310.663 280.736 90.717 80.788 60.693 221.000 10.825 70.840 190.837 11.000 10.689 2
VDG-Uni3DSeg0.804 71.000 10.990 10.886 100.688 210.912 150.602 20.703 260.786 80.771 40.708 140.700 190.669 270.981 410.789 170.903 20.772 211.000 10.609 21
Competitor-SPFormer0.800 101.000 10.986 30.845 180.705 140.915 140.532 170.733 190.757 120.733 120.708 130.698 220.648 390.981 410.890 10.830 230.796 100.997 430.644 6
Competitor-MAFT0.816 21.000 10.983 40.872 120.718 60.941 30.588 50.652 420.819 30.776 30.720 70.780 70.769 121.000 10.797 110.813 320.798 91.000 10.659 5
PointRel0.816 21.000 10.971 100.908 70.743 30.923 110.573 90.714 220.695 210.734 110.747 30.725 140.809 21.000 10.814 90.899 50.820 31.000 10.610 20
: Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation. CVPR 2025
MG-Former0.791 141.000 10.980 70.837 210.626 300.897 170.543 150.759 140.800 70.766 60.659 210.769 90.697 201.000 10.791 150.707 530.791 121.000 10.610 19
EV3D0.811 51.000 10.968 120.852 160.717 80.921 120.574 80.677 320.748 140.730 140.703 160.795 40.809 21.000 10.831 40.854 130.778 171.000 10.638 10
InsSSM0.799 111.000 10.915 160.710 450.729 50.925 90.664 10.670 360.770 90.766 50.739 50.737 100.700 171.000 10.792 140.829 250.815 40.997 430.625 13
Lei Yao, Yi Wang, Moyun Liu, Lap-Pui Chau: SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation. TCSVT, 2024
Mask3D_evaluation0.631 491.000 10.829 310.606 540.646 280.836 320.068 560.511 600.462 440.507 390.619 310.389 540.610 441.000 10.432 580.828 260.673 360.788 710.552 36
Spherical Mask(CtoF)0.812 41.000 10.973 90.852 160.718 70.917 130.574 70.677 320.748 140.729 150.715 100.795 40.809 21.000 10.831 40.854 130.787 131.000 10.638 9
ODIN - Inspermissive0.693 351.000 10.880 230.647 500.620 330.779 480.336 290.501 630.681 230.577 290.595 350.679 260.683 261.000 10.709 270.816 310.637 420.770 720.557 35
Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam W. Harley, Gabriel Sarch, Kriti Aggarwal, Vishrav Chaudhary, Katerina Fragkiadaki: ODIN: A Single Model for 2D and 3D Segmentation. CVPR 2024
DANCENET0.680 361.000 10.807 380.733 400.600 380.768 500.375 280.543 560.538 380.610 260.599 330.498 400.632 430.981 410.739 240.856 120.633 450.882 670.454 52
ExtMask3D0.789 151.000 10.988 20.756 380.706 130.912 160.429 230.647 440.806 60.755 80.673 190.689 240.772 111.000 10.789 160.852 150.811 51.000 10.617 16
UniPerception0.787 161.000 10.909 170.768 350.687 220.947 10.551 140.714 210.843 10.696 200.713 120.773 80.607 450.981 410.690 300.878 80.775 201.000 10.640 8
SIM3D0.803 81.000 10.967 130.863 150.692 200.924 100.552 130.732 200.667 260.732 130.662 200.796 30.789 101.000 10.803 100.864 100.766 241.000 10.643 7
Queryformer0.787 161.000 10.933 140.601 550.754 20.886 220.558 120.661 400.767 110.665 230.716 90.639 300.808 61.000 10.844 30.897 60.804 81.000 10.624 14
OneFormer3Dcopyleft0.801 91.000 10.973 80.909 60.698 160.928 80.582 60.668 380.685 220.780 20.687 180.698 230.702 161.000 10.794 130.900 40.784 150.986 560.635 11
Maxim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich: OneFormer3D: One Transformer for Unified Point Cloud Segmentation.
TopoSeg0.725 291.000 10.806 400.933 30.668 260.758 510.272 340.734 180.630 320.549 340.654 230.606 320.697 210.966 470.612 420.839 200.754 291.000 10.573 30
GraphCut0.732 271.000 10.788 460.724 420.642 290.859 290.248 360.787 110.618 330.596 280.653 240.722 160.583 521.000 10.766 190.861 110.825 21.000 10.504 43
MAFT0.786 181.000 10.894 220.807 250.694 180.893 200.486 190.674 340.740 160.786 10.704 150.727 130.739 151.000 10.707 280.849 170.756 281.000 10.685 4
TD3Dpermissive0.751 251.000 10.774 490.867 130.621 320.934 60.404 250.706 240.812 40.605 270.633 280.626 310.690 231.000 10.640 380.820 280.777 181.000 10.612 18
Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich: Top-Down Beats Bottom-Up in 3D Instance Segmentation. WACV 2024
DualGroup0.694 341.000 10.799 430.811 240.622 310.817 350.376 270.805 90.590 350.487 420.568 410.525 390.650 290.835 600.600 430.829 240.655 381.000 10.526 39
ISBNetpermissive0.757 241.000 10.904 180.731 410.678 240.895 180.458 210.644 460.670 250.710 180.620 300.732 110.650 291.000 10.756 200.778 360.779 161.000 10.614 17
Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen: ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution. CVPR 2023
OSIS0.605 521.000 10.801 420.599 560.535 470.728 560.286 310.436 670.679 240.491 400.433 550.256 600.404 670.857 520.620 410.724 500.510 651.000 10.539 38
KmaxOneFormerNetpermissive0.783 190.903 590.981 50.794 290.706 120.931 70.561 110.701 270.706 190.727 160.697 170.731 120.689 241.000 10.856 20.750 440.761 261.000 10.599 25
SPFormerpermissive0.770 210.903 590.903 190.806 260.609 370.886 210.568 100.815 60.705 200.711 170.655 220.652 290.685 251.000 10.789 180.809 330.776 191.000 10.583 29
Sun Jiahao, Qing Chunmei, Tan Junpeng, Xu Xiangmin: Superpoint Transformer for 3D Scene Instance Segmentation. AAAI 2023 [Oral]
PointComp0.811 50.850 610.969 110.864 140.739 40.946 20.539 160.671 350.835 20.700 190.742 40.817 10.766 131.000 10.755 210.909 10.808 61.000 10.687 3
Sem_Recon_ins0.227 770.764 620.486 740.069 790.098 770.426 770.017 650.067 780.015 750.172 740.100 740.096 690.054 790.183 750.135 720.366 780.260 770.614 760.168 74
DD-UNet+Group0.635 480.667 630.797 450.714 440.562 430.774 490.146 440.810 80.429 470.476 430.546 460.399 520.633 411.000 10.632 390.722 510.609 461.000 10.514 40
H. Liu, R. Liu, K. Yang, J. Zhang, K. Peng, R. Stiefelhagen: HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor. ICCVW 2021
NeuralBF0.555 570.667 630.896 200.843 200.517 520.751 530.029 620.519 590.414 490.439 490.465 510.000 790.484 580.857 520.287 670.693 590.651 411.000 10.485 48
Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi: NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds. WACV 2023
tmp0.248 750.667 630.437 750.188 730.153 750.491 730.000 720.208 730.094 720.153 750.099 760.057 720.217 740.119 760.039 760.466 740.302 710.640 750.140 76
One_Thing_One_Clickpermissive0.529 600.667 630.718 600.777 320.399 610.683 580.000 720.669 370.138 670.391 620.374 640.539 370.360 680.641 670.556 480.774 380.593 490.997 430.251 67
Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu: One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation. CVPR 2021
SPG_WSIS0.470 650.667 630.685 660.677 480.372 630.562 680.000 720.482 640.244 600.316 680.298 660.052 740.442 630.857 520.267 680.702 550.559 571.000 10.287 65
SegGroup_inspermissive0.445 680.667 630.773 500.185 740.317 660.656 600.000 720.407 680.134 680.381 630.267 680.217 620.476 590.714 630.452 560.629 660.514 631.000 10.222 70
An Tao, Yueqi Duan, Yi Wei, Jiwen Lu, Jie Zhou: SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation. TIP 2022
PanopticFusion-inst0.478 640.667 630.712 630.595 570.259 700.550 700.000 720.613 500.175 660.250 710.434 540.437 470.411 660.857 520.485 530.591 700.267 760.944 590.359 60
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
3D-BEVIS0.248 750.667 630.566 700.076 780.035 800.394 780.027 640.035 790.098 710.099 780.030 790.025 760.098 760.375 730.126 730.604 690.181 780.854 690.171 73
Cathrin Elich, Francis Engelmann, Jonas Schult, Theodora Kontogianni, Bastian Leibe: 3D-BEVIS: Birds-Eye-View Instance Segmentation.
Hier3Dcopyleft0.323 700.667 630.542 720.264 680.157 740.550 690.000 720.205 740.009 760.270 700.218 710.075 700.500 570.688 660.007 800.698 570.301 720.459 770.200 71
Tan: HCFS3D: Hierarchical Coupled Feature Selection Network for 3D Semantic and Instance Segmentation.
UNet-backbone0.319 710.667 630.715 620.233 700.189 720.479 750.008 670.218 720.067 730.201 730.173 720.107 680.123 750.438 710.150 710.615 680.355 680.916 620.093 79
RWSeg0.567 560.528 730.708 640.626 510.580 410.745 550.063 580.627 470.240 610.400 610.497 490.464 430.515 561.000 10.475 540.745 460.571 531.000 10.429 55
MASCpermissive0.447 670.528 730.555 710.381 640.382 620.633 620.002 700.509 610.260 590.361 640.432 560.327 590.451 600.571 690.367 650.639 650.386 670.980 580.276 66
Chen Liu, Yasutaka Furukawa: MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation.
R-PointNet0.306 720.500 750.405 770.311 650.348 640.589 650.054 590.068 770.126 690.283 690.290 670.028 750.219 730.214 740.331 660.396 770.275 730.821 700.245 68
ASIS0.199 780.333 760.253 790.167 760.140 760.438 760.000 720.177 750.008 770.121 770.069 770.004 780.231 720.429 720.036 780.445 760.273 740.333 790.119 78
SemRegionNet-20cls0.250 740.333 760.613 690.229 710.163 730.493 720.000 720.304 700.107 700.147 760.100 750.052 730.231 710.119 760.039 760.445 750.325 690.654 740.141 75
MaskRCNN 2d->3d Proj0.058 800.333 760.002 800.000 800.053 790.002 800.002 710.021 800.000 790.045 800.024 800.238 610.065 780.000 790.014 790.107 800.020 800.110 800.006 80
Region-18class0.284 730.250 790.751 580.228 720.270 680.521 710.000 720.468 660.008 780.205 720.127 730.000 790.068 770.070 780.262 700.652 640.323 700.740 730.173 72
Sgpn_scannet0.143 790.208 800.390 780.169 750.065 780.275 790.029 630.069 760.000 790.087 790.043 780.014 770.027 800.000 790.112 750.351 790.168 790.438 780.138 77