The 3D semantic labeling task involves predicting a semantic labeling of a 3D scan mesh.

Evaluation and metrics

Our evaluation ranks all methods according to the PASCAL VOC intersection-over-union metric (IoU). IoU = TP/(TP+FP+FN), where TP, FP, and FN are the numbers of true positive, false positive, and false negative pixels, respectively. Predicted labels are evaluated per-vertex over the respective 3D scan mesh; for 3D approaches that operate on other representations like grids or points, the predicted labels should be mapped onto the mesh vertices (e.g., one such example for grid to mesh vertices is provided in the evaluation helpers).



This table lists the benchmark results for the ScanNet200 3D semantic label scenario.




Method Infoavg iouhead ioucommon ioutail ioualarm clockarmchairbackpackbagballbarbasketbathroom cabinetbathroom counterbathroom stallbathroom stall doorbathroom vanitybathtubbedbenchbicyclebinblackboardblanketblindsboardbookbookshelfbottlebowlboxbroombucketbulletin boardcabinetcalendarcandlecartcase of water bottlescd caseceilingceiling lightchairclockclosetcloset doorcloset rodcloset wallclothesclothes dryercoat rackcoffee kettlecoffee makercoffee tablecolumncomputer towercontainercopiercouchcountercratecupcurtaincushiondecorationdeskdining tabledish rackdishwasherdividerdoordoorframedresserdumbbelldustpanend tablefanfile cabinetfire alarmfire extinguisherfireplacefloorfolded chairfurnitureguitarguitar casehair dryerhandicap barhatheadphonesironing boardjacketkeyboardkeyboard pianokitchen cabinetkitchen counterladderlamplaptoplaundry basketlaundry detergentlaundry hamperledgelightlight switchluggagemachinemailboxmatmattressmicrowavemini fridgemirrormonitormousemusic standnightstandobjectoffice chairottomanovenpaperpaper bagpaper cutterpaper towel dispenserpaper towel rollpersonpianopicturepillarpillowpipeplantplateplungerposterpotted plantpower outletpower stripprinterprojectorprojector screenpurserackradiatorrailrange hoodrecycling binrefrigeratorscaleseatshelfshoeshowershower curtainshower curtain rodshower doorshower floorshower headshower wallsignsinksoap dishsoap dispensersofa chairspeakerstair railstairsstandstoolstorage binstorage containerstorage organizerstovestructurestuffed animalsuitcasetabletelephonetissue boxtoastertoaster oventoilettoilet papertoilet paper dispensertoilet paper holdertoilet seat cover dispensertoweltrash bintrash cantraytubetvtv standvacuum cleanerventwallwardrobewashing machinewater bottlewater coolerwater pitcherwhiteboardwindowwindowsill
sort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysorted bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort bysort by
CeCo0.340 10.551 10.247 10.181 10.475 20.057 40.142 30.000 10.000 10.000 10.387 20.463 10.499 20.924 10.774 10.213 10.257 10.000 30.546 40.100 20.006 20.615 10.177 40.534 10.246 10.000 20.400 10.000 10.338 10.006 30.484 10.609 10.000 10.083 10.000 20.873 10.089 20.661 20.000 30.048 40.560 10.408 10.892 10.000 10.000 10.586 10.616 20.000 40.692 20.900 10.721 10.162 10.228 10.860 10.000 10.000 20.575 10.083 20.550 10.347 10.624 10.410 10.360 10.740 10.109 20.321 20.660 10.000 20.121 20.939 10.143 20.000 10.400 10.003 20.190 10.564 10.652 10.615 10.421 10.304 30.579 10.547 10.000 10.000 10.296 10.000 40.030 40.096 10.000 20.916 10.037 10.551 10.171 20.376 10.865 10.286 10.000 10.633 10.102 40.027 40.011 20.000 10.000 10.474 20.742 10.133 20.311 10.824 10.242 10.503 10.068 30.828 10.000 20.429 10.000 10.063 10.000 10.781 10.000 10.000 20.000 10.665 10.633 10.450 10.818 10.000 10.000 10.429 10.532 10.226 10.825 10.510 30.377 10.709 10.079 20.000 10.753 10.683 10.102 40.063 20.401 40.620 30.000 10.619 10.000 40.000 30.000 10.595 20.000 20.000 10.345 20.564 10.411 10.603 10.384 10.945 10.266 10.643 10.367 10.304 10.663 10.000 10.010 10.726 20.767 10.898 10.000 10.784 10.435 10.861 10.000 10.447 10.000 40.257 10.656 10.377 3
: Understanding Imbalanced Semantic Segmentation Through Neural Collapse.
Minkowski 34Dpermissive0.253 30.463 30.154 40.102 30.381 40.084 10.134 40.000 10.000 10.000 10.386 30.141 40.279 40.737 40.703 30.014 40.164 30.000 30.663 10.092 30.000 30.224 30.291 10.531 20.056 40.000 20.242 30.000 10.000 20.013 20.331 30.000 20.000 10.035 40.001 10.858 20.059 40.650 40.000 30.056 30.353 30.299 30.670 30.000 10.000 10.284 30.484 40.071 30.594 30.720 30.710 30.027 40.068 40.813 20.000 10.005 10.492 20.164 10.274 30.111 40.571 30.307 40.293 30.307 40.150 10.163 40.531 30.002 10.545 10.932 20.093 40.000 10.000 20.002 30.159 30.368 40.581 40.440 40.228 40.406 10.282 40.294 30.000 10.000 10.189 30.060 10.036 30.000 20.000 20.897 20.000 40.525 30.025 40.205 40.771 40.000 20.000 10.593 30.108 30.044 30.000 30.000 10.000 10.282 40.589 30.094 30.169 30.466 40.227 40.419 40.125 20.757 20.002 10.334 30.000 10.000 20.000 10.357 30.000 10.000 20.000 10.582 20.513 40.337 20.612 40.000 10.000 10.250 30.352 40.136 40.724 30.655 10.280 20.000 20.046 40.000 10.606 40.559 20.159 10.102 10.445 10.655 10.000 10.310 40.117 10.000 30.000 10.581 40.026 10.000 10.265 40.483 30.084 40.097 40.044 20.865 40.142 40.588 20.351 20.272 20.596 40.000 10.003 20.622 30.720 20.096 40.000 10.771 30.016 30.772 20.000 10.302 30.194 20.214 30.621 30.197 4
C. Choy, J. Gwak, S. Savarese: 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks. CVPR 2019
LGroundpermissive0.272 20.485 20.184 20.106 20.476 10.077 20.218 10.000 10.000 10.000 10.547 10.295 20.540 10.746 30.745 20.058 30.112 40.005 10.658 20.077 40.000 30.322 20.178 30.512 30.190 20.199 10.277 20.000 10.000 20.173 10.399 20.000 20.000 10.039 30.000 20.858 20.085 30.676 10.002 10.103 10.498 20.323 20.703 20.000 10.000 10.296 20.549 30.216 10.702 10.768 20.718 20.028 30.092 30.786 30.000 10.000 20.453 30.022 30.251 40.252 20.572 20.348 20.321 20.514 20.063 30.279 30.552 20.000 20.019 30.932 20.132 30.000 10.000 20.000 40.156 40.457 20.623 20.518 20.265 30.358 20.381 20.395 20.000 10.000 10.127 40.012 30.051 10.000 20.000 20.886 30.014 20.437 40.179 10.244 20.826 20.000 20.000 10.599 20.136 10.085 20.000 30.000 10.000 10.565 10.612 20.143 10.207 20.566 20.232 30.446 20.127 10.708 30.000 20.384 20.000 10.000 20.000 10.402 20.000 10.059 10.000 10.525 40.566 20.229 30.659 30.000 10.000 10.265 20.446 20.147 30.720 40.597 20.066 30.000 20.187 10.000 10.726 20.467 40.134 30.000 30.413 30.629 20.000 10.363 30.055 30.022 20.000 10.626 10.000 20.000 10.323 30.479 40.154 30.117 20.028 30.901 20.243 20.415 40.295 40.143 40.610 30.000 10.000 30.777 10.397 40.324 30.000 10.778 20.179 20.702 30.000 10.274 40.404 10.233 20.622 20.398 2
David Rozenberszki, Or Litany, Angela Dai: Language-Grounded Indoor 3D Semantic Segmentation in the Wild. arXiv
CSC-Pretrainpermissive0.249 40.455 40.171 30.079 40.418 30.059 30.186 20.000 10.000 10.000 10.335 40.250 30.316 30.766 20.697 40.142 20.170 20.003 20.553 30.112 10.097 10.201 40.186 20.476 40.081 30.000 20.216 40.000 10.000 20.001 40.314 40.000 20.000 10.055 20.000 20.832 40.094 10.659 30.002 10.076 20.310 40.293 40.664 40.000 10.000 10.175 40.634 10.130 20.552 40.686 40.700 40.076 20.110 20.770 40.000 10.000 20.430 40.000 40.319 20.166 30.542 40.327 30.205 40.332 30.052 40.375 10.444 40.000 20.012 40.930 40.203 10.000 10.000 20.046 10.175 20.413 30.592 30.471 30.299 20.152 40.340 30.247 40.000 10.000 10.225 20.058 20.037 20.000 20.207 10.862 40.014 20.548 20.033 30.233 30.816 30.000 20.000 10.542 40.123 20.121 10.019 10.000 10.000 10.463 30.454 40.045 40.128 40.557 30.235 20.441 30.063 40.484 40.000 20.308 40.000 10.000 20.000 10.318 40.000 10.000 20.000 10.545 30.543 30.164 40.734 20.000 10.000 10.215 40.371 30.198 20.743 20.205 40.062 40.000 20.079 20.000 10.683 30.547 30.142 20.000 30.441 20.579 40.000 10.464 20.098 20.041 10.000 10.590 30.000 20.000 10.373 10.494 20.174 20.105 30.001 40.895 30.222 30.537 30.307 30.180 30.625 20.000 10.000 30.591 40.609 30.398 20.000 10.766 40.014 40.638 40.000 10.377 20.004 30.206 40.609 40.465 1
Ji Hou, Benjamin Graham, Matthias Nie├čner, Saining Xie: Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts. CVPR 2021