3D Semantic Label with Limited Reconstructions Benchmark
The 3D semantic labeling task involves predicting a semantic labeling of a 3D scan mesh.
Evaluation and metricsOur evaluation ranks all methods according to the PASCAL VOC intersection-over-union metric (IoU). IoU = TP/(TP+FP+FN), where TP, FP, and FN are the numbers of true positive, false positive, and false negative pixels, respectively. Predicted labels are evaluated per-vertex over the respective 3D scan mesh; for 3D approaches that operate on other representations like grids or points, the predicted labels should be mapped onto the mesh vertices (e.g., one such example for grid to mesh vertices is provided in the evaluation helpers).
This table lists the benchmark results for the 3D semantic label with limited reconstructions scenario.
Method | Info | avg iou | bathtub | bed | bookshelf | cabinet | chair | counter | curtain | desk | door | floor | otherfurniture | picture | refrigerator | shower curtain | sink | sofa | table | toilet | wall | window |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WS3D_LR_Sem | 0.684 1 | 0.865 1 | 0.761 1 | 0.780 1 | 0.644 1 | 0.810 2 | 0.445 1 | 0.796 1 | 0.596 1 | 0.594 1 | 0.945 2 | 0.456 1 | 0.234 1 | 0.541 1 | 0.793 1 | 0.723 1 | 0.761 1 | 0.618 1 | 0.906 1 | 0.822 1 | 0.598 2 | |
Kangcheng Liu: WS3D: Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination. European Conference on Computer Vision (ECCV), 2022 | ||||||||||||||||||||||
CSG_3DSegNet | 0.480 4 | 0.521 5 | 0.715 3 | 0.562 2 | 0.389 6 | 0.693 8 | 0.307 4 | 0.157 8 | 0.501 3 | 0.321 8 | 0.927 8 | 0.219 6 | 0.074 8 | 0.329 2 | 0.485 2 | 0.504 2 | 0.596 8 | 0.458 5 | 0.715 4 | 0.714 8 | 0.418 4 | |
CSC_LR_SEM | 0.460 5 | 0.472 7 | 0.731 2 | 0.465 3 | 0.398 4 | 0.817 1 | 0.292 5 | 0.442 5 | 0.311 8 | 0.387 6 | 0.939 4 | 0.218 7 | 0.181 4 | 0.302 4 | 0.076 4 | 0.449 4 | 0.743 2 | 0.430 7 | 0.444 8 | 0.737 5 | 0.368 7 | |
NWSYY | 0.517 2 | 0.725 3 | 0.619 6 | 0.396 4 | 0.455 3 | 0.766 5 | 0.327 3 | 0.570 2 | 0.477 5 | 0.427 3 | 0.943 3 | 0.288 2 | 0.220 3 | 0.274 5 | 0.135 3 | 0.471 3 | 0.697 3 | 0.504 2 | 0.714 5 | 0.767 3 | 0.566 3 | |
DE-3DLearner LR | 0.508 3 | 0.824 2 | 0.530 8 | 0.314 5 | 0.479 2 | 0.746 7 | 0.334 2 | 0.490 4 | 0.508 2 | 0.477 2 | 0.950 1 | 0.269 3 | 0.221 2 | 0.324 3 | 0.029 6 | 0.421 5 | 0.626 6 | 0.490 3 | 0.727 3 | 0.782 2 | 0.620 1 | |
Ping-Chung Yu, Cheng Sun, Min Sun: Data Efficient 3D Learner via Knowledge Transferred from 2D Model. ECCV 2022 | ||||||||||||||||||||||
PointContrast_LR_SEM | 0.438 7 | 0.517 6 | 0.659 5 | 0.251 6 | 0.332 8 | 0.783 3 | 0.244 8 | 0.408 6 | 0.411 7 | 0.409 4 | 0.935 6 | 0.206 8 | 0.119 7 | 0.200 7 | 0.048 5 | 0.355 6 | 0.682 4 | 0.414 8 | 0.647 6 | 0.743 4 | 0.391 5 | |
Viewpoint_BN_LR_AIR | 0.452 6 | 0.587 4 | 0.569 7 | 0.172 7 | 0.391 5 | 0.769 4 | 0.290 6 | 0.512 3 | 0.501 3 | 0.373 7 | 0.935 6 | 0.251 4 | 0.173 5 | 0.201 6 | 0.003 8 | 0.352 7 | 0.619 7 | 0.454 6 | 0.783 2 | 0.719 7 | 0.390 6 | |
Scratch_LR_SEM | 0.401 8 | 0.240 8 | 0.674 4 | 0.095 8 | 0.347 7 | 0.763 6 | 0.271 7 | 0.204 7 | 0.449 6 | 0.406 5 | 0.936 5 | 0.220 5 | 0.127 6 | 0.199 8 | 0.004 7 | 0.348 8 | 0.665 5 | 0.477 4 | 0.493 7 | 0.730 6 | 0.366 8 | |