Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques

Lozano-Paredes, Dafne; Bote-Curiel, Luis; Sabater-Molina, María; Bielza, Concha; Gimeno-Blanes, Juan R.; Muñoz-Romero, Sergio

Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques

dc.contributor.author	Lozano-Paredes, Dafne
dc.contributor.author	Bote-Curiel, Luis
dc.contributor.author	Sabater-Molina, María
dc.contributor.author	Bielza, Concha
dc.contributor.author	Gimeno-Blanes, Juan R.
dc.contributor.author	Muñoz-Romero, Sergio
dc.date.accessioned	2025-06-03T06:34:54Z
dc.date.available	2025-06-03T06:34:54Z
dc.date.issued	2025-05-26
dc.description.abstract	Hypertrophic cardiomyopathy is known to have strong genetic foundations. However, only some studies have addressed the complex network of co-expressed genes and variants that modify the phenotype. Machine learning methods offer robust information discovery when dealing with high-dimensional datasets. We aimed to perform relevance and interaction analysis on genetic variants from hypertrophic cardiomyopathy patients using diverse machine learning techniques, with the following stages: (a) Statistical univariate techniques (with various p-value adjustment methods) identified relevant variants; (b) Linear classifiers (support vector machines, Fisher discriminant analysis) provided combined relevance based on feature weights; (c) Informative variable identifier method and Bayesian networks explained inter-variant relationships; (d) Manifold learning of low-dimensional latent spaces gave interpretable representations of groups; (e) Linkage disequilibrium matrices and frequency tables discovered associations between variants. We analyzed 61 patients and 67 controls with genetic information comprising 216 variants from a genetic panel of 15 genes. Across all methodologies, ten variants were consistently identified as significant, with 22 total variants significant in at least three out of five methods. Machine learning has been found to detect disease-associated variants, including pathogenic founder variants (11:47357494, 11:47360070, 11:47372137). This methodology allows for identifying potential disease modulators while accounting for relevance and interactions among variants.
dc.identifier.citation	D. Lozano-Paredes et al., "Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques," in IEEE Transactions on Computational Biology and Bioinformatics, doi: 10.1109/TCBBIO.2025.3572833
dc.identifier.doi	10.1109/TCBBIO.2025.3572833
dc.identifier.issn	2998-4165 (online)
dc.identifier.uri	https://hdl.handle.net/10115/87677
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers
dc.rights	Attribution 4.0 International	en
dc.rights.accessRights	info:eu-repo/semantics/openAccess
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Genetics
dc.subject	Bioinformatics
dc.subject	Diseases
dc.subject	Genomics
dc.subject	Machine learning
dc.subject	Phenotypes
dc.subject	Vectors
dc.subject	Sequential analysis
dc.subject	Support vector machines
dc.subject	Random variables
dc.title	Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques
dc.type	Article

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: Discovering_Genetic_Variants_in_Hypertrophic_Cardiomyopathy_with_Multiple_Machine_Learning_Techniques.pdf
Tamaño:: 5.01 MB
Formato:: Adobe Portable Document Format

Descargar

Colecciones

Artículos de Revista