The 3D semantic instance prediction task involves detecting and segmenting the object in an 3D scan mesh.

Evaluation and metrics

Our evaluation ranks all methods according to the average precision for each class. We report the mean average precision AP at overlap 0.25 (AP 25%), overlap 0.5 (AP 50%), and over overlaps in the range [0.5:0.95:0.05] (AP). Note that multiple predictions of the same ground truth instance are penalized as false positives.



This table lists the benchmark results for the 3D semantic instance scenario.




Method Infoavg ap 25%bathtubbedbookshelfcabinetchaircountercurtaindeskdoorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwindow
sort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort by
Volt-SPFormerScanNetpermissive0.908 11.000 10.981 220.975 10.885 10.964 60.744 230.845 180.906 160.916 20.842 10.820 20.879 10.959 520.955 90.944 10.872 180.999 410.869 3
Kadir Yilmaz, Adrian Kruse, Tristan Höfer, Daan de Geus, Bastian Leibe: Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding.
SoftGroup++0.874 171.000 10.972 290.947 20.839 150.898 310.556 450.913 20.881 250.756 290.828 30.748 240.821 21.000 10.937 150.937 20.887 51.000 10.821 13
PointRel0.901 21.000 10.978 260.928 40.879 20.962 80.882 50.749 410.947 30.912 30.802 40.753 220.820 31.000 10.984 40.919 70.894 31.000 10.815 17
: Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation. CVPR 2025
Mask3D0.870 191.000 10.985 200.782 500.818 240.938 210.760 200.749 410.923 110.877 170.760 140.785 160.820 31.000 10.912 230.864 400.878 120.983 570.825 11
Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe: Mask3D for 3D Semantic Instance Segmentation. ICRA 2023
EV3D0.877 141.000 10.996 100.873 160.854 90.950 150.691 300.783 320.926 90.889 150.754 180.794 140.820 31.000 10.912 230.900 120.860 231.000 10.779 24
Spherical Mask(CtoF)0.875 151.000 10.991 170.873 160.850 100.946 170.691 300.752 400.926 90.889 140.759 150.794 130.820 31.000 10.912 230.900 120.878 121.000 10.769 26
Queryformer0.874 171.000 10.978 250.809 400.876 30.936 220.702 270.716 460.920 130.875 180.766 130.772 180.818 71.000 10.995 10.916 80.892 41.000 10.767 27
IPCA-Inst0.851 241.000 10.968 310.884 130.842 140.862 440.693 290.812 260.888 240.677 410.783 90.698 300.807 81.000 10.911 310.865 390.865 221.000 10.757 30
SoftGrouppermissive0.865 221.000 10.969 300.860 220.860 60.913 270.558 420.899 40.911 150.760 280.828 20.736 260.802 90.981 480.919 200.875 270.877 141.000 10.820 15
Thang Vu, Kookhoi Kim, Tung M. Luu, Xuan Thanh Nguyen, Chang D. Yoo: SoftGroup for 3D Instance Segmentaiton on Point Clouds. CVPR 2022 [Oral]
SIM3D0.878 131.000 10.972 280.863 210.817 250.952 120.821 110.783 310.890 220.902 90.735 240.797 100.799 101.000 10.931 180.893 160.853 251.000 10.792 21
ExtMask3D0.867 211.000 11.000 10.756 580.816 260.940 190.795 140.760 380.862 270.888 160.739 220.763 200.774 111.000 10.929 190.878 260.879 91.000 10.819 16
ODIN - Inspermissive0.847 261.000 10.951 370.834 330.828 190.875 360.871 60.767 360.821 370.816 240.690 350.800 90.771 121.000 10.912 230.891 170.821 310.886 730.713 37
Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam W. Harley, Gabriel Sarch, Kriti Aggarwal, Vishrav Chaudhary, Katerina Fragkiadaki: ODIN: A Single Model for 2D and 3D Segmentation. CVPR 2024
Competitor-MAFT0.896 41.000 11.000 10.872 180.847 120.967 50.955 10.778 350.901 190.919 10.784 80.812 40.770 131.000 10.949 100.865 380.868 201.000 10.840 7
PointComp0.897 31.000 10.998 60.864 200.869 40.969 40.830 80.783 330.905 170.894 110.791 50.834 10.769 141.000 10.982 50.920 60.868 211.000 10.872 2
TST3D0.879 121.000 10.994 110.921 60.807 270.939 200.771 170.887 70.923 120.862 190.722 260.768 190.756 151.000 10.910 340.904 90.836 300.999 410.824 12
Duc Tran Dang Trung, Byeongkeun Kang, Yeejin Lee: MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation. ACM Multimedia 2024
OneFormer3Dcopyleft0.896 41.000 11.000 10.913 70.858 70.951 140.786 160.837 200.916 140.908 50.778 110.803 80.750 161.000 10.976 70.926 50.882 70.995 510.849 4
Maxim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich: OneFormer3D: One Transformer for Unified Point Cloud Segmentation.
MAFT0.860 231.000 10.990 180.810 390.829 180.949 160.809 120.688 520.836 320.904 70.751 190.796 110.741 171.000 10.864 440.848 490.837 281.000 10.828 9
MG-Former0.887 61.000 10.991 160.837 280.801 280.935 230.887 40.857 110.946 40.891 120.748 210.805 70.739 181.000 10.993 20.809 620.876 151.000 10.842 6
TopoSeg0.832 301.000 10.981 230.933 30.819 230.826 530.524 510.841 190.811 380.681 400.759 160.687 310.727 190.981 480.911 310.883 220.853 261.000 10.756 31
KmaxOneFormerNetpermissive0.883 81.000 11.000 10.798 430.848 110.971 20.853 70.903 30.827 350.910 40.748 200.809 60.724 201.000 10.980 60.855 440.844 271.000 10.832 8
UniPerception0.870 191.000 10.998 60.770 530.835 160.972 10.762 190.754 390.928 80.845 210.790 60.819 30.717 210.981 480.915 220.890 180.878 101.000 10.809 18
TD3Dpermissive0.875 151.000 10.976 270.877 140.783 340.970 30.889 30.828 210.945 50.803 270.713 280.720 290.709 221.000 10.936 160.934 40.873 161.000 10.791 22
Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich: Top-Down Beats Bottom-Up in 3D Instance Segmentation. WACV 2024
DCD0.885 71.000 10.933 440.856 240.832 170.959 100.930 20.858 100.802 410.859 200.767 120.796 120.709 231.000 10.971 80.871 320.904 11.000 10.874 1
Mask3D_evaluation0.843 271.000 10.955 360.847 250.795 300.932 240.750 220.780 340.891 210.818 230.737 230.633 390.703 241.000 10.902 360.870 330.820 320.941 650.805 19
RPGN0.806 351.000 10.992 140.789 450.723 530.891 320.650 330.810 270.832 330.665 440.699 320.658 330.700 251.000 10.881 380.832 530.774 350.997 440.613 54
Shichao Dong, Guosheng Lin, Tzu-Yi Hung: Learning Regional Purity for Instance Segmentation on 3D Point Clouds. ECCV 2022
Box2Mask0.803 361.000 10.962 350.874 150.707 570.887 350.686 320.598 600.961 10.715 350.694 330.469 570.700 251.000 10.912 230.902 110.753 420.997 440.637 48
Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard Pons-Moll: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes. ECCV 2022
CSC-Pretrained0.791 391.000 10.996 80.829 340.767 380.889 340.600 370.819 240.770 480.594 550.620 480.541 490.700 251.000 10.941 110.889 200.763 381.000 10.526 64
InsSSM0.883 81.000 10.996 80.800 420.865 50.960 90.808 130.852 160.940 70.899 100.785 70.810 50.700 251.000 10.912 230.851 470.895 20.997 440.827 10
Lei Yao, Yi Wang, Moyun Liu, Lap-Pui Chau: SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation. TCSVT, 2024
GICN0.788 411.000 10.978 240.867 190.781 350.833 500.527 500.824 220.806 390.549 630.596 510.551 450.700 251.000 10.853 450.935 30.733 461.000 10.651 45
SPFormerpermissive0.851 241.000 10.994 120.806 410.774 360.942 180.637 340.849 170.859 290.889 130.720 270.730 270.665 301.000 10.911 310.868 370.873 171.000 10.796 20
Sun Jiahao, Qing Chunmei, Tan Junpeng, Xu Xiangmin: Superpoint Transformer for 3D Scene Instance Segmentation. AAAI 2023 [Oral]
Competitor-SPFormer0.881 101.000 11.000 10.845 260.854 80.962 70.714 260.857 120.904 180.902 80.782 100.789 150.662 311.000 10.988 30.874 290.886 60.997 440.847 5
SSEC0.820 331.000 10.983 210.924 50.826 200.817 560.415 600.899 50.793 430.673 420.731 250.636 370.653 321.000 10.939 130.804 640.878 111.000 10.780 23
SphereSeg0.835 281.000 10.963 340.891 100.794 310.954 110.822 100.710 470.961 20.721 330.693 340.530 520.653 331.000 10.867 430.857 430.859 240.991 540.771 25
DualGroup0.782 441.000 10.927 470.811 370.772 370.853 460.631 360.805 280.773 450.613 510.611 490.610 400.650 340.835 660.881 380.879 250.750 441.000 10.675 43
DD-UNet+Group0.764 481.000 10.897 600.837 290.753 440.830 520.459 560.824 220.699 570.629 480.653 390.438 600.650 341.000 10.880 400.858 420.690 591.000 10.650 46
H. Liu, R. Liu, K. Yang, J. Zhang, K. Peng, R. Stiefelhagen: HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor. ICCVW 2021
INS-Conv-instance0.762 491.000 10.923 500.765 540.785 330.905 290.600 370.655 540.646 620.683 390.647 420.530 510.650 341.000 10.824 500.830 540.693 570.944 610.644 47
DKNet0.815 341.000 10.930 450.844 270.765 400.915 260.534 490.805 280.805 400.807 260.654 380.763 210.650 341.000 10.794 570.881 230.766 371.000 10.758 29
Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong: 3D Instances as 1D Kernels. ECCV 2022
3D-MPA0.737 531.000 10.933 430.785 460.794 320.831 510.279 680.588 610.695 580.616 490.559 580.556 440.650 341.000 10.809 540.875 280.696 551.000 10.608 56
Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner: 3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation. CVPR 2020
DENet0.786 421.000 10.929 460.736 590.750 470.720 690.755 210.934 10.794 420.590 560.561 570.537 500.650 341.000 10.882 370.804 650.789 341.000 10.719 36
PBNetpermissive0.825 321.000 10.963 330.837 300.843 130.865 390.822 90.647 550.878 260.733 310.639 440.683 320.650 341.000 10.853 450.870 340.820 331.000 10.744 32
Weiguang Zhao, Yuyao Yan, Chaolong Yang, Jianan Ye, Xi Yang, Kaizhu Huang: Divide and Conquer: 3D Instance Segmentation With Point-Wise Binarization. ICCV 2023
PCJC0.684 641.000 10.895 610.757 570.659 620.862 430.189 750.739 440.606 650.712 360.581 540.515 540.650 340.857 580.357 760.785 680.631 640.889 710.635 49
ISBNetpermissive0.835 281.000 10.950 380.731 600.819 220.918 250.790 150.740 430.851 310.831 220.661 370.742 250.650 341.000 10.937 140.814 610.836 291.000 10.765 28
Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen: ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution. CVPR 2023
PE0.776 461.000 10.900 580.860 220.728 520.869 370.400 610.857 130.774 440.568 620.701 310.602 410.646 430.933 540.843 480.890 190.691 580.997 440.709 38
Biao Zhang, Peter Wonka: Point Cloud Instance Segmentation using Probabilistic Embeddings. CVPR 2021
HAISpermissive0.803 361.000 10.994 120.820 350.759 410.855 450.554 460.882 80.827 360.615 500.676 360.638 360.646 431.000 10.912 230.797 670.767 360.994 520.726 34
Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang: Hierarchical Aggregation for 3D Instance Segmentation. ICCV 2021
GraphCut0.832 301.000 10.922 530.724 620.798 290.902 300.701 280.856 140.859 280.715 340.706 290.748 230.640 451.000 10.934 170.862 410.880 81.000 10.729 33
PointGroup0.778 451.000 10.900 570.798 440.715 550.863 410.493 520.706 480.895 200.569 610.701 300.576 430.639 461.000 10.880 400.851 460.719 500.997 440.709 39
Li Jiang, Hengshuang Zhao, Shaoshuai Shi, Shu Liu, Chi-Wing Fu, Jiaya Jia: PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation. CVPR 2020 [oral]
ClickSeg_Instance0.685 631.000 10.818 690.600 710.715 560.795 580.557 430.533 660.591 680.601 530.519 640.429 620.638 470.938 530.706 650.817 590.624 660.944 610.502 66
DANCENET0.786 421.000 10.936 410.783 480.737 500.852 470.742 240.647 550.765 500.811 250.624 470.579 420.632 481.000 10.909 350.898 140.696 540.944 610.601 57
RWSeg0.739 521.000 10.899 590.759 560.753 450.823 540.282 660.691 510.658 600.582 590.594 520.547 460.628 491.000 10.795 560.868 360.728 481.000 10.692 41
VDG-Uni3DSeg0.880 111.000 10.990 180.889 110.823 210.952 130.764 180.893 60.941 60.907 60.756 170.781 170.628 491.000 10.918 210.903 100.872 190.999 410.821 14
Sparse R-CNN0.714 581.000 10.926 490.694 630.699 590.890 330.636 350.516 680.693 590.743 300.588 530.369 640.601 510.594 720.800 550.886 210.676 600.986 560.546 61
Hier3Dcopyleft0.540 721.000 10.727 720.626 690.467 760.693 710.200 730.412 700.480 750.528 670.318 740.077 780.600 520.688 690.382 730.768 710.472 730.941 650.350 75
Tan: HCFS3D: Hierarchical Coupled Feature Selection Network for 3D Semantic and Instance Segmentation.
SPG_WSIS0.678 651.000 10.880 640.836 310.701 580.727 680.273 700.607 590.706 560.541 660.515 650.174 710.600 520.857 580.716 640.846 500.711 511.000 10.506 65
SSTNetpermissive0.789 401.000 10.840 670.888 120.717 540.835 490.717 250.684 530.627 630.724 320.652 400.727 280.600 521.000 10.912 230.822 560.757 411.000 10.691 42
Zhihao Liang, Zhihao Li, Songcen Xu, Mingkui Tan, Kui Jia: Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks. ICCV2021
Mask-Group0.792 381.000 10.968 320.812 360.766 390.864 400.460 540.815 250.888 230.598 540.651 410.639 350.600 520.918 550.941 110.896 150.721 491.000 10.723 35
Min Zhong, Xinghao Chen, Xiaokang Chen, Gang Zeng, Yunhe Wang: MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation. ICME 2022
3D-SISpermissive0.558 701.000 10.773 710.614 700.503 730.691 720.200 730.412 700.498 740.546 650.311 750.103 750.600 520.857 580.382 730.799 660.445 770.938 670.371 72
Ji Hou, Angela Dai, Matthias Niessner: 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans. CVPR 2019
AOIA0.767 471.000 10.937 400.810 380.740 490.906 280.550 470.800 300.706 550.577 600.624 460.544 480.596 570.857 580.879 420.880 240.750 430.992 530.658 44
OccuSeg+instance0.742 511.000 10.923 500.785 460.745 480.867 380.557 430.578 640.729 530.670 430.644 430.488 550.577 581.000 10.794 570.830 540.620 671.000 10.550 60
Lei Han, Tian Zheng, Lan Xu, Lu Fang: OccuSeg: Occupancy-aware 3D Instance Segmentation. CVPR2020
MTML0.731 541.000 10.992 140.779 520.609 660.746 640.308 650.867 90.601 660.607 520.539 610.519 530.550 591.000 10.824 500.869 350.729 471.000 10.616 52
Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald: 3D Instance Segmentation via Multi-task Metric Learning. ICCV 2019 [oral]
Dyco3Dcopyleft0.761 501.000 10.935 420.893 90.752 460.863 420.600 370.588 610.742 520.641 460.633 450.546 470.550 590.857 580.789 590.853 450.762 390.987 550.699 40
Tong He; Chunhua Shen; Anton van den Hengel: DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution. CVPR2021
MASCpermissive0.615 680.711 750.802 700.540 740.757 420.777 600.029 780.577 650.588 690.521 690.600 500.436 610.534 610.697 680.616 690.838 520.526 690.980 580.534 63
Chen Liu, Yasutaka Furukawa: MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation.
SegGroup_inspermissive0.637 671.000 10.923 520.593 720.561 700.746 650.143 770.504 690.766 490.485 710.442 690.372 630.530 620.714 670.815 530.775 690.673 611.000 10.431 70
An Tao, Yueqi Duan, Yi Wei, Jiwen Lu, Jie Zhou: SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation. TIP 2022
PanopticFusion-inst0.693 601.000 10.852 660.655 670.616 650.788 590.334 630.763 370.771 470.457 730.555 590.652 340.518 630.857 580.765 600.732 730.631 630.944 610.577 59
Gaku Narita, Takashi Seno, Tomoya Ishikawa, Yohsuke Kaji: PanopticFusion: Online Volumetric Semantic Mapping at the Level of Stuff and Things. IROS 2019 (to appear)
3D-BoNet0.687 621.000 10.887 620.836 310.587 690.643 760.550 470.620 570.724 540.522 680.501 660.243 680.512 641.000 10.751 620.807 630.661 620.909 700.612 55
Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds. NeurIPS 2019 Spotlight
Occipital-SCS0.688 611.000 10.913 540.730 610.737 510.743 660.442 570.855 150.655 610.546 640.546 600.263 670.508 650.889 560.568 710.771 700.705 520.889 710.625 50
NeuralBF0.718 571.000 10.945 390.901 80.754 430.817 550.460 540.700 490.772 460.688 380.568 560.000 790.500 660.981 480.606 700.872 300.740 451.000 10.614 53
Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi: NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds. WACV 2023
OSIS0.725 551.000 10.885 630.653 680.657 630.801 570.576 410.695 500.828 340.698 370.534 620.457 590.500 660.857 580.831 490.841 510.627 651.000 10.619 51
SALoss-ResNet0.695 591.000 10.855 650.579 730.589 680.735 670.484 530.588 610.856 300.634 470.571 550.298 650.500 661.000 10.824 500.818 570.702 530.935 680.545 62
Zhidong Liang, Ming Yang, Hao Li, Chunxiang Wang: 3D Instance Embedding Learning With a Structure-Aware Loss Function for Point Cloud Segmentation. IEEE Robotics and Automation Letters (IROS2020)
SSEN0.724 561.000 10.926 480.781 510.661 610.845 480.596 400.529 670.764 510.653 450.489 680.461 580.500 660.859 570.765 600.872 310.761 401.000 10.577 58
Dongsu Zhang, Junha Chun, Sang Kyun Cha, Young Min Kim: Spatial Semantic Embedding Network: Fast 3D Instance Segmentation with Deep Metric Learning. Arxiv
One_Thing_One_Clickpermissive0.675 661.000 10.823 680.782 490.621 640.766 610.211 720.736 450.560 700.586 570.522 630.636 380.453 700.641 700.853 450.850 480.694 560.997 440.411 71
Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu: One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation. CVPR 2021
ASIS0.422 770.333 790.707 750.676 650.401 770.650 750.350 620.177 780.594 670.376 760.202 770.077 770.404 710.571 730.197 800.674 770.447 760.500 790.260 79
UNet-backbone0.605 691.000 10.909 550.764 550.603 670.704 700.415 590.301 740.548 710.461 720.394 700.267 660.386 720.857 580.649 680.817 580.504 710.959 590.356 74
SemRegionNet-20cls0.470 761.000 10.727 720.447 770.481 740.678 730.024 790.380 720.518 720.440 740.339 720.128 730.350 730.429 750.212 790.711 750.465 740.833 760.290 78
tmp0.474 751.000 10.727 720.433 780.481 750.673 740.022 800.380 720.517 730.436 750.338 730.128 730.343 740.429 750.291 780.728 740.473 720.833 760.300 77
3D-BEVIS0.401 780.667 760.687 760.419 790.137 800.587 780.188 760.235 750.359 770.211 790.093 800.080 760.311 750.571 730.382 730.754 720.300 790.874 740.357 73
Cathrin Elich, Francis Engelmann, Jonas Schult, Theodora Kontogianni, Bastian Leibe: 3D-BEVIS: Birds-Eye-View Instance Segmentation.
R-PointNet0.544 710.500 780.655 770.661 660.663 600.765 620.432 580.214 770.612 640.584 580.499 670.204 700.286 760.429 750.655 670.650 780.539 680.950 600.499 67
Sem_Recon_ins0.484 740.764 740.608 790.470 760.521 720.637 770.311 640.218 760.348 780.365 770.223 760.222 690.258 770.629 710.734 630.596 790.509 700.858 750.444 69
Sgpn_scannet0.390 790.556 770.636 780.493 750.353 780.539 790.271 710.160 790.450 760.359 780.178 780.146 720.250 780.143 790.347 770.698 760.436 780.667 780.331 76
Region-18class0.497 730.250 800.902 560.689 640.540 710.747 630.276 690.610 580.268 790.489 700.348 710.000 790.243 790.220 780.663 660.814 600.459 750.928 690.496 68
MaskRCNN 2d->3d Proj0.261 800.903 730.081 800.008 800.233 790.175 800.280 670.106 800.150 800.203 800.175 790.480 560.218 800.143 790.542 720.404 800.153 800.393 800.049 80