Learning the Value Systems of Agents with Preference-based and Inverse Reinforcement Learning

dc.affiliation.dptoInformática y Estadística
dc.contributor.authorHolgado-Sánchez, Andrés
dc.contributor.authorBillhardt, Holger
dc.contributor.authorFernández, Alberto
dc.contributor.authorOssowski, Sascha
dc.contributor.funderMCIN/AEI/10.13039/501100011033
dc.contributor.funderMICIU/AEI/10.13039/501100011033/FEDER
dc.contributor.funderUniversidad Rey Juan Carlos
dc.date.accessioned2026-02-04T14:44:26Z
dc.date.issued2026-02-03
dc.description.abstractAgreement Technologies refer to open computer systems in which autonomous software agents interact with one another, typically on behalf of humans, in order to come to mutually acceptable agreements. With the advance of AI systems in recent years, it has become apparent that such agreements, in order to be acceptable to the involved parties, must remain aligned with ethical principles and moral values. However, this is notoriously difficult to ensure, especially as different human users (and their software agents) may hold different value systems, i.e. they may differently weigh the importance of individual moral values. Furthermore, it is often hard to specify the precise meaning of a value in a particular context in a computational manner. Methods to estimate value systems based on human-engineered specifications, e.g. based on value surveys, are limited in scale due to the need for intense human moderation. In this article, we propose a novel method to automatically learn value systems from observations and human demonstrations. In particular, we propose a formal model of the value system learning problem, its instantiation to sequential decision-making domains based on multi-objective Markov decision processes, as well as tailored preference-based and inverse reinforcement learning algorithms to infer value grounding functions and value systems. The approach is illustrated and evaluated by two simulated use cases.
dc.description.sponsorshipThis work has been supported by grant VAE: TED2021-131295B-C33 funded by MCIN/AEI/10.13039/501100011033 and by the “European Union NextGenerationEU/PRTR”, by grant COSASS: PID2021-123673OB-C32 funded by MCIN/AEI/10.13039/501100011033 and by “ERDF A way of making Europe”, and by project grant EVASAI: PID2024-158227NB-C32 funded by MICIU/AEI/10.13039/501100011033/FEDER, UE. Andrés Holgado-Sánchez has received funding by grant "Contratos Predoctorales de Personal Investigador en Formación en Departamentos de la Universidad Rey Juan Carlos (C1 PREDOC 2025)", funded by Universidad Rey Juan Carlos.
dc.identifier.citationHolgado-Sánchez, A., Billhardt, H., Fernández, A. et al. Learning the value systems of agents with preference-based and inverse reinforcement learning. Auton Agent Multi-Agent Syst 40, 4 (2026). https://doi.org/10.1007/s10458-026-09732-0
dc.identifier.doihttps://doi.org/10.1007/s10458-026-09732-0
dc.identifier.issn1387-2532 (print)
dc.identifier.issn1573-7454 (online)
dc.identifier.publicationfirstpage1
dc.identifier.publicationissue4
dc.identifier.publicationlastpage42
dc.identifier.publicationtitleAutonomous Agents and Multi-Agent Systems
dc.identifier.publicationvolume40
dc.identifier.urihttps://hdl.handle.net/10115/159697
dc.language.isoen
dc.publisherSpringer
dc.relation.projectCodeTED2021-131295B-C33
dc.relation.projectCodePID2021-123673OB-C32
dc.relation.projectCodePID2024-158227NB-C32
dc.relation.projectNameVAE
dc.relation.projectNameCOSSAS
dc.relation.projectNameEVASAI
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsinfo:eu-repo/semantics/embargoedAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectValue alignment
dc.subjectValue systems
dc.subjectValue-aware decision-making
dc.subjectValue Learning
dc.subjectInverse Reinforcement Learning
dc.subjectPreference-based Reinforcement Learning
dc.titleLearning the Value Systems of Agents with Preference-based and Inverse Reinforcement Learning
dc.typeArticle
dc.type.hasVersionhttp://purl.org/coar/version/c_ab4af688f83e57aa

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
s10458-026-09732-0.pdf
Size:
5.28 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
2.96 KB
Format:
Item-specific license agreed upon to submission
Description: