Abstract

In this work, we present a new dataset for monocular depth estimation created by extracting images, dense depth maps, and odometer data from a realistic video game simulation, Euro Truck Simulator 2. The dataset is used to train state-of-the-art depth estimation models in both supervised and unsupervised ways, which are evaluated against real-world sequences. Our results demonstrate that models trained exclusively with synthetic data achieve satisfactory performance in the real domain. The quantitative evaluation brings light to possible causes of domain gap in monocular depth estimation. Specifically, we discuss the effects of coarse-grained ground-truth depth maps in contrast to the fine-grained depth estimation. The dataset and code for data extraction and experiments are released open-source.
Loading...

Quotes

0 citations in WOS
0 citations in

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Nature

Description

This is the author’s version of the manuscript that was accepted for publication in the proceedings of IbPRIA 2023, published in the Lecture Notes in Computer Science (LNCS) series by Springer. The final published version, including copyediting and typesetting by Springer, is available at: https://link.springer.com/chapter/10.1007/978-3-031-36616-1_30

Citation

María-Arribas, D., Cuesta-Infante, A., Pantrigo, J.J. (2023). The ETS2 Dataset, Synthetic Data from Video Games for Monocular Depth Estimation. In: Pertusa, A., Gallego, A.J., Sánchez, J.A., Domingues, I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2023. Lecture Notes in Computer Science, vol 14062. Springer, Cham. https://doi.org/10.1007/978-3-031-36616-1_30

Endorsement

Review

Supplemented By

Referenced By

Statistics

Views
32
Downloads
5

Bibliographic managers