TY - GEN
T1 - LocalBins: Improving Depth Estimation by Learning Local Distributions
AU - Alhashim, Ibraheem
AU - Wonka, Peter
N1 - KAUST Repository Item: Exported on 2022-10-29
Acknowledged KAUST grant number(s): OSR-CRG2018-3730
Acknowledgements: This work was supported by the KAUST Office of Sponsored Research (OSR) under Award No. OSR-CRG2018-3730.
PY - 2022/10/23
Y1 - 2022/10/23
N2 - We propose a novel architecture for depth estimation from a single image. The architecture itself is based on the popular encoder-decoder architecture that is frequently used as a starting point for all dense regression tasks. We build on AdaBins which estimates a global distribution of depth values for the input image and evolve the architecture in two ways. First, instead of predicting global depth distributions, we predict depth distributions of local neighborhoods at every pixel. Second, instead of predicting depth distributions only towards the end of the decoder, we involve all layers of the decoder. We call this new architecture LocalBins. Our results demonstrate a clear improvement over the state-of-the-art in all metrics on the NYU-Depth V2 dataset. Code and pretrained models will be made publicly available.
AB - We propose a novel architecture for depth estimation from a single image. The architecture itself is based on the popular encoder-decoder architecture that is frequently used as a starting point for all dense regression tasks. We build on AdaBins which estimates a global distribution of depth values for the input image and evolve the architecture in two ways. First, instead of predicting global depth distributions, we predict depth distributions of local neighborhoods at every pixel. Second, instead of predicting depth distributions only towards the end of the decoder, we involve all layers of the decoder. We call this new architecture LocalBins. Our results demonstrate a clear improvement over the state-of-the-art in all metrics on the NYU-Depth V2 dataset. Code and pretrained models will be made publicly available.
UR - http://hdl.handle.net/10754/676052
UR - https://link.springer.com/10.1007/978-3-031-19769-7_28
U2 - 10.1007/978-3-031-19769-7_28
DO - 10.1007/978-3-031-19769-7_28
M3 - Conference contribution
SN - 9783031197680
SP - 480
EP - 496
BT - Lecture Notes in Computer Science
PB - Springer Nature Switzerland
ER -