Results for LRPNet
Submission data
Full name | Long Range Pooling for 3D Large-Scale Scene Understanding |
Description | Inspired by the success of recent vision transformers
(ViTs) and large kernel design in convolutional neural net- works (CNNs), in this paper, we analyse and explore es- sential reasons for their success and claim two factors that are critical for 3D large-scale scene understanding, i.e., (1) larger receptive field and (2) more non-linear operations. The former is responsible for global context and the lat- ter can enhance the capacity of the network. To achieve above properties, we propose a simple yet effective long range pooling (LRP) module by using dilated max pooling |
Input Data Types | Uses Color Uses 3D |
Programming language(s) | python |
Hardware | 3090 |
Submission creation date | 9 Nov, 2022 |
Last edited | 9 Nov, 2022 |
3D semantic label results
Info | avg iou | bathtub | bed | bookshelf | cabinet | chair | counter | curtain | desk | door | floor | otherfurniture | picture | refrigerator | shower curtain | sink | sofa | table | toilet | wall | window |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0.742 | 0.816 | 0.806 | 0.807 | 0.752 | 0.828 | 0.575 | 0.839 | 0.699 | 0.637 | 0.954 | 0.520 | 0.320 | 0.755 | 0.834 | 0.760 | 0.772 | 0.676 | 0.915 | 0.862 | 0.717 |