Interpreting clinical latent representations using autoencoders and probabilistic models

Electronic health records (EHRs) are a valuable data source that, in conjunction with deep learning (DL) methods, have provided important outcomes in different domains, contributing to supporting decision-making. Owing to the remarkable advancements achieved by DL-based models, autoencoders (AE) are becoming extensively used in health care. Nevertheless, AE-based models are based on nonlinear transformations, resulting in black-box models leading to a lack of interpretability, which is vital in the clinical setting. To obtain insights from AE latent representations, we propose a methodology by combining probabilistic models based on Gaussian mixture models and hierarchical clustering supported by Kullback-Leibler divergence. To validate the methodology from a clinical viewpoint, we used real-world data extracted from EHRs of the University Hospital of Fuenlabrada (Spain). Records were associated with healthy and chronic hypertensive and diabetic patients. Experimental outcomes showed that our approach can find groups of patients with similar health conditions by identifying patterns associated with diagnosis and drug codes. This work opens up promising opportunities for interpreting representations obtained by the AE-based model, bringing some light to the decision-making process made by clinical experts in daily practice.

Palabras clave

Autoencoder , Learning latent representations , Gaussian mixture model , Clustering , Chronic diseases , Electronic health records

Citación

David Chushig-Muzo, Cristina Soguero-Ruiz, Pablo de Miguel-Bohoyo, Inmaculada Mora-Jiménez, Interpreting clinical latent representations using autoencoders and probabilistic models, Artificial Intelligence in Medicine, Volume 122, 2021, 102211, ISSN 0933-3657, https://doi.org/10.1016/j.artmed.2021.102211. (https://www.sciencedirect.com/science/article/pii/S0933365721002049)

Colecciones

Artículos de Revista

Página completa del ítem

Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional

Interpreting clinical latent representations using autoencoders and probabilistic models

Archivos

Fecha

Autores

Título de la revista

ISSN de la revista

Título del volumen

Editor

Enlace externo

URI

DOI

Resumen

Descripción

Palabras clave

Citación

Colecciones