Submitted anonymously.

Submission data

Full nameLong Range Pooling for 3D Large-Scale Scene Understanding
DescriptionInspired by the success of recent vision transformers
(ViTs) and large kernel design in convolutional neural net-
works (CNNs), in this paper, we analyse and explore es-
sential reasons for their success and claim two factors that
are critical for 3D large-scale scene understanding, i.e., (1)
larger receptive field and (2) more non-linear operations.
The former is responsible for global context and the lat-
ter can enhance the capacity of the network. To achieve
above properties, we propose a simple yet effective long
range pooling (LRP) module by using dilated max pooling
Input Data TypesUses Color        Uses 3D
Programming language(s)python
Hardware3090
Submission creation date9 Nov, 2022
Last edited9 Nov, 2022

3D semantic label results

Infoavg ioubathtubbedbookshelfcabinetchaircountercurtaindeskdoorfloorotherfurniturepicturerefrigeratorshower curtainsinksofatabletoiletwallwindow
0.7420.8160.8060.8070.7520.8280.5750.8390.6990.6370.9540.5200.3200.7550.8340.7600.7720.6760.9150.8620.717