Learning and visualizing chronic latent representations using electronic health records

Resumen

Background: Nowadays, patients with chronic diseases such as diabetes and hypertension have reached alarming numbers worldwide. These diseases increase the risk of developing acute complications and involve a substantial economic burden and demand for health resources. The widespread adoption of Electronic Health Records (EHRs) is opening great opportunities for supporting decision-making. Nevertheless, data extracted from EHRs are complex (heterogeneous, high-dimensional and usually noisy), hampering the knowledge extraction with conventional approaches. Methods: We propose the use of the Denoising Autoencoder (DAE), a Machine Learning (ML) technique allowing to transform high-dimensional data into latent representations (LRs), thus addressing the main challenges with clinical data. We explore in this work how the combination of LRs with a visualization method can be used to map the patient data in a two-dimensional space, gaining knowledge about the distribution of patients with diferent chronic conditions. Furthermore, this representation can be also used to characterize the patient’s health status evolution, which is of paramount importance in the clinical setting. Results: To obtain clinical LRs, we considered real-world data extracted from EHRs linked to the University Hospital of Fuenlabrada in Spain. Experimental results showed the great potential of DAEs to identify patients with clinical patterns linked to hyper‑ tension, diabetes and multimorbidity. The procedure allowed us to fnd patients with the same main chronic disease but diferent clinical characteristics. Thus, we identifed two kinds of diabetic patients with diferences in their drug therapy (insulin and non-insulin dependant), and also a group of women afected by hypertension and gestational diabetes. We also present a proof of concept for mapping the health status evolution of synthetic patients when considering the most signifcant diagnoses and drugs associated with chronic patients. Conclusion: Our results highlighted the value of ML techniques to extract clinical knowledge, supporting the identifcation of patients with certain chronic conditions. Furthermore, the patient’s health status progression on the two-dimensional space might be used as a tool for clinicians aiming to characterize health conditions and identify their more relevant clinical codes.

Descripción

Citación

Chushig-Muzo, D., Soguero-Ruiz, C., Miguel Bohoyo, P.d. et al. Learning and visualizing chronic latent representations using electronic health records. BioData Mining 15, 18 (2022). https://doi.org/10.1186/s13040-022-00303-z
license logo
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional