A visual questioning answering approach to enhance robot localization in indoor environments

Peña-Narvaez, Juan Diego; Martín, Francisco; Guerrero Hernández, José Miguel; Pérez-Rodríguez, Rodrigo

A visual questioning answering approach to enhance robot localization in indoor environments

dc.contributor.author	Peña-Narvaez, Juan Diego
dc.contributor.author	Martín, Francisco
dc.contributor.author	Guerrero Hernández, José Miguel
dc.contributor.author	Pérez-Rodríguez, Rodrigo
dc.date.accessioned	2023-12-19T11:31:21Z
dc.date.available	2023-12-19T11:31:21Z
dc.date.issued	2023-11-27
dc.description	The usage of a visual large language model to localize a robot in an indoor environment	es
dc.description.abstract	Navigating robots with precision in complex environments remains a significant challenge. In this article, we present an innovative approach to enhance robot localization in dynamic and intricate spaces like homes and offices. We leverage Visual Question Answering (VQA) techniques to integrate semantic insights into traditional mapping methods, formulating a novel position hypothesis generation to assist localization methods, while also addressing challenges related to mapping accuracy and localization reliability. Our methodology combines a probabilistic approach with the latest advances in Monte Carlo Localization methods and Visual Language models. The integration of our hypothesis generation mechanism results in more robust robot localization compared to existing approaches. Experimental validation demonstrates the effectiveness of our approach, surpassing state-of-the-art multi-hypothesis algorithms in both position estimation and particle quality. This highlights the potential for accurate self-localization, even in symmetric environments with large corridor spaces. Furthermore, our approach exhibits a high recovery rate from deliberate position alterations, showcasing its robustness. By merging visual sensing, semantic mapping, and advanced localization techniques, we open new horizons for robot navigation. Our work bridges the gap between visual perception, semantic understanding, and traditional mapping, enabling robots to interact with their environment through questions and enrich their map with valuable insights. The code for this project is available on GitHub "https://github.com/juandpenan/topology_nav_ros2"	es
dc.identifier.citation	Peña-Narvaez JD, Martín F, Guerrero JM and Pérez-Rodríguez R (2023) A visual questioning answering approach to enhance robot localization in indoor environments. Front. Neurorobot. 17:1290584. doi: 10.3389/fnbot.2023.1290584	es
dc.identifier.doi	10.3389/fnbot.2023.1290584	es
dc.identifier.uri	https://hdl.handle.net/10115/27453
dc.language.iso	eng	es
dc.publisher	Frontiers in Neurorobotics	es
dc.rights	Atribución 4.0 Internacional	*
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	visual question answering	es
dc.subject	robot localization	es
dc.subject	robot navigation	es
dc.subject	semantic map	es
dc.subject	robot mapping	es
dc.title	A visual questioning answering approach to enhance robot localization in indoor environments	es
dc.type	info:eu-repo/semantics/article	es

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: fnbot-17-1290584 (1).pdf
Tamaño:: 2.77 MB
Formato:: Adobe Portable Document Format
Descripción:

Descargar

Colecciones

Artículos de Revista