Exploratory integration of near-infrared spectroscopy with clinical data: a machine learning approach for HCV detection in serum samples

dc.affiliation.dptoEspecialidades Medicas y Salud Publica
dc.contributor.authorPerez Gómez , Eloy
dc.contributor.authorGómez, José
dc.contributor.authorGonzalo, Jennifer
dc.contributor.authorSalgüero, Sergio
dc.contributor.authorRiado, Daniel
dc.contributor.authorCasas, Maria Luisa
dc.contributor.authorGutiérrez , Maria Luisa
dc.contributor.authorJaime, Elena
dc.contributor.authorPerez Martinez, Enrique
dc.contributor.authorGarcia Carretero, Rafael
dc.contributor.authorRamos, Javier
dc.contributor.authorFernández-Rodríguez, Conrado
dc.contributor.authorCatalá, Myriam
dc.contributor.authorMartino, Luca
dc.contributor.authorBarquero Perez, Oscar
dc.contributor.funderThis work has been partially funded by Grant PID2022-136887NB-I00 funded by MCIN AEI 10.13039 501100011033.
dc.date.accessioned2025-12-22T08:20:46Z
dc.date.issued2025-06-09
dc.descriptionThis repository presents a machine learning approach for improving Hepatitis C Virus (HCV) detection by integrating Near-Infrared Spectroscopy (NIRS) of serum samples with routine clinical biomarkers. The workflow leverages L1-regularized models for spectral feature selection and Random Forest classifiers for multimodal prediction, enabling a rapid, non-invasive, and resource-efficient diagnostic strategy with potential applications in large-scale screening programs.
dc.description.abstractBackground: Managing chronic viral infections like Hepatitis C virus (HCV) often requires expensive healthcare resources and highly qualified personnel, making e cient diagnostic methods essential. Despite remarkable therapeutic advancements for the treatment of HCV, several challenges remain, such as improved fast diagnostic procedures allowing universal screening. Objective: We propose a novel approach that combines Near-Infrared Spectroscopy (NIRS) and clinical data with machine learning (ML) to improve Hepatitis C Virus (HCV) detection in serum samples. Methods: NIRS offers a fast, non-destructive, and residue-free alternative to traditional diagnostic methods, while ML models enable feature selection and predictive analysis. We applied L1-regularized Logistic Regression (L1-LR) to identify the most informative wavelengths for HCV detection within the 1,000– 2,500 nm range, and then integrated these spectral features with routine clinical markers using a Random Forest (RF) model. Our dataset comprised 137 serum samples from38 patients, each represented by a NIRS spectrumand clinical data from blood tests. Results: After preprocessing with Standard Normal Variate (SNV) correction and downsampling, the best-performing RF model, which combined NIRS features and clinical data, achieved an accuracy of 72.2% and an AUC-ROC of 0.850, outperforming models using only clinical or spectral data. Feature importance analysis highlighted specific wavelengths near 1,150 nm, 1,410 nm, and 1,927 nm, associated with water molecular states and liver function biomarkers (GPT, GOT, GGT), reinforcing the biological relevance of this approach
dc.description.sponsorshipUniversity Rey Juan Carlos. This work has been partially funded by Grant PID2022-136887NB-I00 funded by MCIN AEI 10.13039 501100011033.
dc.identifier.citationPérez-Gómez E, Gómez J, Gonzalo J, Salgüero S, Riado D, Casas ML, Gutiérrez ML, Jaime E, Pérez-Martínez E, García-Carretero R, Ramos J, Fernández-Rodríguez C, Catalá M, Martino L, Barquero-Pérez Ó. Exploratory integration of near-infrared spectroscopy with clinical data: a machine learning approach for HCV detection in serum samples. Front Med (Lausanne). 2025 Jun 9;12:1596476. doi: 10.3389/fmed.2025.1596476. PMID: 40552174; PMCID: PMC12183225.
dc.identifier.doi10.3389/fmed.2025.1596476
dc.identifier.publicationfirstpage1596476
dc.identifier.publicationtitleExploratory integration of near-infrared spectroscopy with clinical data: a machine learning approach for HCV detection in serum samples
dc.identifier.publicationvolume12
dc.identifier.urihttps://hdl.handle.net/10115/135257
dc.language.isoen
dc.publisherFrontiers in Medicine
dc.relation.projectNameHCV Detection Using NIRS and Machine Learning
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectHCV
dc.subjectHepatitis C
dc.subjectNIRS
dc.subjectmachine learning
dc.subjectpermutation feature importance
dc.titleExploratory integration of near-infrared spectroscopy with clinical data: a machine learning approach for HCV detection in serum samples
dc.title.alternativeA Non-Invasive Machine Learning Approach for Hepatitis C Virus Detection Based on Near-Infrared Spectroscopy
dc.typeArticle
dc.type.hasVersionhttp://purl.org/coar/version/c_970fb48d4fbd8a85

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
Frontiers.pdf
Size:
545.1 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
2.96 KB
Format:
Item-specific license agreed upon to submission
Description: