On the representation and methodology for wide and short range head pose estimation

dc.contributor.authorCobo, Alejandro
dc.contributor.authorValle, Roberto
dc.contributor.authorBuenaposada, José M.
dc.contributor.authorBaumela, Luis
dc.date.accessioned2024-06-11T09:55:05Z
dc.date.available2024-06-11T09:55:05Z
dc.date.issued2024-05
dc.description.abstractHead pose estimation (HPE) is a problem of interest in computer vision to improve the performance of face processing tasks in semi-frontal or profile settings. Recent applications require the analysis of faces in the full 360° rotation range. Traditional approaches to solve the semi-frontal and profile cases are not directly amenable for the full rotation case. In this paper we analyze the methodology for short- and wide-range HPE and discuss which representations and metrics are adequate for each case. We show that the popular Euler angles representation is a good choice for short-range HPE, but not at extreme rotations. However, the Euler angles’ gimbal lock problem prevents them from being used as a valid metric in any setting. We also revisit the current cross-data set evaluation methodology and note that the lack of alignment between the reference systems of the training and test data sets negatively biases the results of all articles in the literature. We introduce a procedure to quantify this misalignment and a new methodology for cross-data set HPE that establishes new, more accurate, SOTA for the 300W-LP/Biwi benchmark. We also propose a generalization of the geodesic angular distance metric that enables the construction of a loss that controls the contribution of each training sample to the optimization of the model. Finally, we introduce a wide range HPE benchmark based on the CMU Panoptic data set. code:https://github.com/pcr-upm/opal23_headposees
dc.identifier.citationAlejandro Cobo, Roberto Valle, José M. Buenaposada, Luis Baumela, On the representation and methodology for wide and short range head pose estimation, Pattern Recognition, Volume 149, 2024, 110263, ISSN 0031-3203, https://doi.org/10.1016/j.patcog.2024.110263es
dc.identifier.doi10.1016/j.patcog.2024.110263es
dc.identifier.issn0031-3203 (print)
dc.identifier.issn1873-5142 (online)
dc.identifier.urihttps://hdl.handle.net/10115/33673
dc.language.isoenges
dc.publisherElsevieres
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectShort- and wide-range head pose estimationes
dc.subjectOrientation representationes
dc.subjectError metricses
dc.subjectCross-data set evaluation methodologyes
dc.titleOn the representation and methodology for wide and short range head pose estimationes
dc.typeinfo:eu-repo/semantics/articlees

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
1-s2.0-S0031320324000141-main.pdf
Tamaño:
2.31 MB
Formato:
Adobe Portable Document Format
Descripción: