Limited Reconstructions Semantic Label Results

The 3D semantic labeling task involves predicting a semantic labeling of a 3D scan mesh.

Evaluation and metrics

Our evaluation ranks all methods according to the PASCAL VOC intersection-over-union metric (IoU). IoU = TP/(TP+FP+FN), where TP, FP, and FN are the numbers of true positive, false positive, and false negative pixels, respectively. Predicted labels are evaluated per-vertex over the respective 3D scan mesh; for 3D approaches that operate on other representations like grids or points, the predicted labels should be mapped onto the mesh vertices (e.g., one such example for grid to mesh vertices is provided in the evaluation helpers).

This table lists the benchmark results for the 3D semantic label with limited reconstructions scenario.

Method	avg iou	bathtub	bed	bookshelf	cabinet	chair	counter	curtain	desk	door	floor	otherfurniture	picture	refrigerator	shower curtain	sink	sofa	table	toilet	wall	window

WS3D_LR_Sem	0.684 1	0.865 1	0.761 1	0.780 1	0.644 1	0.810 2	0.445 1	0.796 1	0.596 1	0.594 1	0.945 2	0.456 1	0.234 1	0.541 1	0.793 1	0.723 1	0.761 1	0.618 1	0.906 1	0.822 1	0.598 2
Kangcheng Liu: WS3D: Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination. European Conference on Computer Vision (ECCV), 2022
CSG_3DSegNet	0.480 4	0.521 5	0.715 3	0.562 2	0.389 6	0.693 8	0.307 4	0.157 8	0.501 3	0.321 8	0.927 8	0.219 6	0.074 8	0.329 2	0.485 2	0.504 2	0.596 8	0.458 5	0.715 4	0.714 8	0.418 4

CSC_LR_SEM	0.460 5	0.472 7	0.731 2	0.465 3	0.398 4	0.817 1	0.292 5	0.442 5	0.311 8	0.387 6	0.939 4	0.218 7	0.181 4	0.302 4	0.076 4	0.449 4	0.743 2	0.430 7	0.444 8	0.737 5	0.368 7

NWSYY	0.517 2	0.725 3	0.619 6	0.396 4	0.455 3	0.766 5	0.327 3	0.570 2	0.477 5	0.427 3	0.943 3	0.288 2	0.220 3	0.274 5	0.135 3	0.471 3	0.697 3	0.504 2	0.714 5	0.767 3	0.566 3

DE-3DLearner LR	0.508 3	0.824 2	0.530 8	0.314 5	0.479 2	0.746 7	0.334 2	0.490 4	0.508 2	0.477 2	0.950 1	0.269 3	0.221 2	0.324 3	0.029 6	0.421 5	0.626 6	0.490 3	0.727 3	0.782 2	0.620 1
Ping-Chung Yu, Cheng Sun, Min Sun: Data Efficient 3D Learner via Knowledge Transferred from 2D Model. ECCV 2022
PointContrast_LR_SEM	0.438 7	0.517 6	0.659 5	0.251 6	0.332 8	0.783 3	0.244 8	0.408 6	0.411 7	0.409 4	0.935 6	0.206 8	0.119 7	0.200 7	0.048 5	0.355 6	0.682 4	0.414 8	0.647 6	0.743 4	0.391 5

Viewpoint_BN_LR_AIR	0.452 6	0.587 4	0.569 7	0.172 7	0.391 5	0.769 4	0.290 6	0.512 3	0.501 3	0.373 7	0.935 6	0.251 4	0.173 5	0.201 6	0.003 8	0.352 7	0.619 7	0.454 6	0.783 2	0.719 7	0.390 6

Scratch_LR_SEM	0.401 8	0.240 8	0.674 4	0.095 8	0.347 7	0.763 6	0.271 7	0.204 7	0.449 6	0.406 5	0.936 5	0.220 5	0.127 6	0.199 8	0.004 7	0.348 8	0.665 5	0.477 4	0.493 7	0.730 6	0.366 8

3D Semantic Label with Limited Reconstructions Benchmark