Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Aleksandar Jevtić*1 Christoph Reich*1,2,4,5 Felix Wimbauer1,4 Oliver Hahn2 Christian Rupprecht3 Stefan Roth2,5,6 Daniel Cremers1,4,5

1TU Munich 2TU Darmstadt 3University of Oxford 4MCML 5ELIZA 6hessian.AI *equal contribution

Paper PDF Project Page URL Project Page URL License Framework

Overview

SceneDINO is unsupervised and infers 3D geometry and features from a single image in a feed-forward manner. Distilling and clustering SceneDINO's 3D feature field results in unsupervised semantic scene completion predictions. The method is trained using multi-view self-supervision.

Installation & Quick Start

Please refer to our Github Repo.

Citation

If you find our work useful, please consider giving it a star ⭐ and citing our paper.

@inproceedings{Jevtic:2025:SceneDINO,
    author  = {Aleksandar Jevti{\'c} and
               Christoph Reich and
               Felix Wimbauer and
               Oliver Hahn and
               Christian Rupprecht and
               Stefan Roth and
               Daniel Cremers},
    title   = {Feed-Forward {SceneDINO} for Unsupervised Semantic Scene Completion},
    journal = {IEEE/CVF International Conference on Computer Vision (ICCV)},
    year    = {2025},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using jev-aleks/SceneDINO 1