An Integrated Machine Learning and Genomic Framework for Precise Detection of Gastric Cancer

Iman, Eshmal and Jabbar, Sohail and Ramzan, Shabana and Raza, Ali and Raoof, Farwa and Carvajal-Altamiranda, Stefanía and Lipari, Vivian and Ashraf, Imran UNSPECIFIED, UNSPECIFIED, UNSPECIFIED, UNSPECIFIED, UNSPECIFIED, stefania.carvajal@uneatlantico.es, vivian.lipari@uneatlantico.es, UNSPECIFIED (2026) An Integrated Machine Learning and Genomic Framework for Precise Detection of Gastric Cancer. The American Journal of Pathology. ISSN 00029440

[img] Text
PIIS0002944026001367.pdf

Download (1MB)

Abstract

This study presents a novel integrative approach for the analysis of high-dimensional gene expression data, leveraging the complementary strengths of unsupervised clustering and supervised classification. Using K-means clustering, the dataset is stratified into three distinct clusters, revealing intrinsic biological patterns and relationships. The resulting cluster assignments are subsequently employed as pseudo-labels to train machine learning models, including support vector machines, random forest, and a stacking ensemble classifier. To validate and enhance the robustness of clustering, complementary methodologies such as hierarchical clustering and DBSCAN are employed, with results visualized through PCA-driven dimensionality reduction. The high predictive accuracy achieved by the classifiers underscores the separability and reliability of the identified clusters. Furthermore, feature importance analysis highlighted key genetic determinants within each cluster, offering actionable insights into potential biomarkers and critical genomic features. This framework bridges the gap between exploratory unsupervised learning and predictive supervised modeling, providing a scalable and interpretable methodology for analyzing complex genomic datasets. Its applicability extends to biomarker discovery, patient stratification, and other precision medicine applications, emphasizing its utility in advancing genomic research and clinical practice.

Item Type: Article
Uncontrolled Keywords: Gastric cancer histological images k-means clustering unsupervised learning convolutional neural networks image processing
Subjects: Subjects > Biomedicine
Subjects > Engineering
Divisions: Europe University of Atlantic > Research > Scientific Production
Fundación Universitaria Internacional de Colombia > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Scientific Production
University of La Romana > Research > Scientific Production
Depositing User: Sr Bibliotecario
Date Deposited: 15 May 2026 08:43
Last Modified: 15 May 2026 08:43
URI: http://repositorio.funiber.org/id/eprint/28577

Actions (login required)

View Item View Item