Result details - ScanNet Benchmark

Submitted by Rohit Mohan.

Full name	Self-Supervised Model Adaptation for Multimodal Semantic Segmentation
Description	Benchmarking the SSMA mutimodal semantic segmentation framework that dynamically adapts the fusion of modality-specific features while being sensitive to the object category, spatial location and scene context in a self-supervised manner. The model was trained with the visual RGB image and the HHA encoded depth image as input to the network. The architecture consists of two modality-specific encoder streams that fuse intermediate encoder representations into a single decoder using the SSMA fusion mechanism which optimally combines complementary features.
Publication title	Self-Supervised Model Adaptation for Multimodal Semantic Segmentation
Publication authors	Abhinav Valada, Rohit Mohan, Wolfram Burgard
Publication venue	International Journal of Computer Vision, 2019
Publication URL	https://arxiv.org/abs/1808.03833
Input Data Types	Uses Color,Uses Geometry Uses 2D
Programming language(s)	Python, Tensorflow
Hardware	Intel Xeon E5 CPU, NVIDIA TITAN X (Pascal)
Website	http://deepscene.cs.uni-freiburg.de
Source code or download URL	https://github.com/DeepSceneSeg/SSMA
Submission creation date	29 Dec, 2018
Last edited	18 Jul, 2019
Last uploaded	17 Jan, 2019

Info	avg iou	bathtub	bed	bookshelf	cabinet	chair	counter	curtain	desk	door	floor	otherfurniture	picture	refrigerator	shower curtain	sink	sofa	table	toilet	wall	window
	0.577	0.695	0.716	0.439	0.563	0.314	0.444	0.719	0.551	0.503	0.887	0.346	0.348	0.603	0.353	0.709	0.600	0.457	0.901	0.786	0.599

Results for SSMA