Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques
dc.contributor.author | Lozano-Paredes, Dafne | |
dc.contributor.author | Bote-Curiel, Luis | |
dc.contributor.author | Sabater-Molina, María | |
dc.contributor.author | Bielza, Concha | |
dc.contributor.author | Gimeno-Blanes, Juan R. | |
dc.contributor.author | Muñoz-Romero, Sergio | |
dc.date.accessioned | 2025-06-03T06:34:54Z | |
dc.date.available | 2025-06-03T06:34:54Z | |
dc.date.issued | 2025-05-26 | |
dc.description.abstract | Hypertrophic cardiomyopathy is known to have strong genetic foundations. However, only some studies have addressed the complex network of co-expressed genes and variants that modify the phenotype. Machine learning methods offer robust information discovery when dealing with high-dimensional datasets. We aimed to perform relevance and interaction analysis on genetic variants from hypertrophic cardiomyopathy patients using diverse machine learning techniques, with the following stages: (a) Statistical univariate techniques (with various p-value adjustment methods) identified relevant variants; (b) Linear classifiers (support vector machines, Fisher discriminant analysis) provided combined relevance based on feature weights; (c) Informative variable identifier method and Bayesian networks explained inter-variant relationships; (d) Manifold learning of low-dimensional latent spaces gave interpretable representations of groups; (e) Linkage disequilibrium matrices and frequency tables discovered associations between variants. We analyzed 61 patients and 67 controls with genetic information comprising 216 variants from a genetic panel of 15 genes. Across all methodologies, ten variants were consistently identified as significant, with 22 total variants significant in at least three out of five methods. Machine learning has been found to detect disease-associated variants, including pathogenic founder variants (11:47357494, 11:47360070, 11:47372137). This methodology allows for identifying potential disease modulators while accounting for relevance and interactions among variants. | |
dc.identifier.citation | D. Lozano-Paredes et al., "Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques," in IEEE Transactions on Computational Biology and Bioinformatics, doi: 10.1109/TCBBIO.2025.3572833 | |
dc.identifier.doi | 10.1109/TCBBIO.2025.3572833 | |
dc.identifier.issn | 2998-4165 (online) | |
dc.identifier.uri | https://hdl.handle.net/10115/87677 | |
dc.language.iso | en | |
dc.publisher | Institute of Electrical and Electronics Engineers | |
dc.rights | Attribution 4.0 International | en |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Genetics | |
dc.subject | Bioinformatics | |
dc.subject | Diseases | |
dc.subject | Genomics | |
dc.subject | Machine learning | |
dc.subject | Phenotypes | |
dc.subject | Vectors | |
dc.subject | Sequential analysis | |
dc.subject | Support vector machines | |
dc.subject | Random variables | |
dc.title | Discovering Genetic Variants in Hypertrophic Cardiomyopathy with Multiple Machine Learning Techniques | |
dc.type | Article |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- Discovering_Genetic_Variants_in_Hypertrophic_Cardiomyopathy_with_Multiple_Machine_Learning_Techniques.pdf
- Tamaño:
- 5.01 MB
- Formato:
- Adobe Portable Document Format