Segment-Weighting Similarity-Based Fragment-Learning Model for Single-Cell Raman Spectral Analysis


Yi L. Ye Q. Wang C. Hu Q. Zhang S. Shen X. Liang M. Li G. Dmitriy K. Guo Y. Yu Q. Hu B.
10 June 2025American Chemical Society

Analytical Chemistry
2025#97Issue 2211734 - 11744 pp.

Raman spectroscopy provides intrinsic biochemical profiles of all cellular biomolecules in a segmented manner, promising nondestructive and label-free phenotyping at the single-cell level. However, current analytical methods rarely utilize spectral biological characteristics and their fusion with data characteristics, limiting the application of these methods to biological Raman spectroscopy. Herein, a segment-weighting similarity-based fragment-learning (SWS-FL) model, integrating SWS-based feature extraction and fusion learning, is proposed to fuse biological and data characteristics for single-cell spectral analysis, which segments spectra into fragments and differentiates their biological characteristics for fusing feature matrices. The SWS-based feature extraction fabricates a group of low-dimensional feature vectors at multiple N values, providing a more distinguishable feature space compared to conventional KNN. The weights of five fragments, including the fingerprint region, protein I region, mixed region, protein II region, and genetic material region, are assigned as 0.282, 0.302, 0.273, 0.276, and 0.239, respectively, which highlights the spectral biological characteristics. The fusion learning process synthesizes characteristics from all spectral fragments using an ANN, achieving accuracy with only 0.5% variation across N values from 1 to 30, greatly enhancing the robustness of the model. In the five-classification task of breast cancer cells and their subtypes, the accuracy and kappa coefficient of SWS-FL can reach 94.9% and 0.943%, respectively, which are 5% and 7% higher than those of ANN. The generalization capability is also validated on the data set of lung cancer cells and their subtypes. This model provides a new path for the fusion of biological and data characteristics in spectral analysis and promises to be a powerful analytical framework in more spectroscopic areas.



Text of the article Перейти на текст статьи

School of Life Science and Technology, Xidian University, Shaanxi, Xi’an, 710126, China
School of Computer Science and Technology, Xidian University, Shaanxi, Xi’an, 710126, China
School of Mathematics and Physics Science and Engineering, Hebei University of Engineering, Hebei, Handan, 056038, China
Institute of Life Sciences, Karaganda Medical University, Karaganda, 100008, Kazakhstan
Hangzhou Institute of Technology, Xidian University, Zhejiang, Hangzhou, 311231, China
Xi’an Intelligent Precision Diagnosis and Treatment International Science and Technology Cooperation Base, Shaanxi, Xi’an, 710126, China

School of Life Science and Technology
School of Computer Science and Technology
School of Mathematics and Physics Science and Engineering
Institute of Life Sciences
Hangzhou Institute of Technology
Xi’an Intelligent Precision Diagnosis and Treatment International Science and Technology Cooperation Base

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026