Table of Contents |
guest 2025-07-13 |
Short write-up
Machine Learning Reveals Protein Signatures in CSF and Plasma Fluids of Clinical Value for ALS
Michael S. Bereman1-3*,Joshua Beri2, Jeffrey R. Enders3, and Tara Nash3
1Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695
2Department of Chemistry, North Carolina State University, Raleigh, NC 27695
3Center for Human Health and the Environment, North Carolina State University, Raleigh, NC 27695
*Author for Correspondence
Michael S. Bereman, Ph.D.
Department of Biological Sciences
Center for Human Health and the Environment
North Carolina State University
Raleigh, NC
Phone: 919.515.8520
Email: michaelbereman@ncsu.edu
Abstract
We use shotgun proteomics to identify biomarkers of diagnostic and prognostic value in individuals diagnosed with amyotrophic lateral sclerosis. Matched cerebrospinal and plasma fluids were subjected to abundant protein depletion and analyzed by nano-flow liquid chromatography high resolution tandem mass spectrometry. Label free quantitation was used to identify differential proteins between individuals with ALS (n=33) and healthy controls (n=30) in both fluids. In CSF, 118 (p-value<0.05) and 27 proteins (q-value<0.05) were identified as significantly altered between ALS and controls. In plasma, 20 (p-value< 0.05) and 0 (q-value<0.05) proteins were identified as significantly altered between ALS and controls. Proteins involved in complement activation, acute phase response and retinoid signaling pathways were significantly enriched in the CSF from ALS patients. Subsequently various machine learning methods were evaluated for disease classification using a repeated Monte Carlo cross-validation approach. A linear discriminant analysis model achieved a median area under the receiver operating characteristic curve of 0.94 with an interquartile range of 0.88-1.0. Three proteins composed a prognostic model (p=5e-4) that explained 49% of the variation in the ALS-FRS scores. Finally we validated the specificity of two promising proteins from our discovery data set, chitinase-3 like 1 protein and alpha-1-antichymotrypsin, using targeted proteomics in a separate set of CSF samples derived from individuals diagnosed with ALS (n=15) and other neurodegenerative diseases (n=15). These results demonstrate the potential of a panel of targeted proteins for objective measurements of clinical value in ALS.
The peptide mixture standard consists of 6 peptides that were specifically synthesized to cover a wide range of hydrophobicities (Grand Average Hydropathy scores -0.6-1.9). Within each peptide sequence, a combination of stable isotope labeled amino acids (13C and 15N) were inserted to create 5 isotopologues. These 5 peptide isotopologues, based on amino acid analysis, span 4 orders of magnitude in concentration within each distinct sequence. One can assess numerous fundamental analytical figures of merit as a function of intra-scan dynamic range across the gradient within a single injection.
Below, you may download several Skyline template files for the peptide reference mixture. The files designated MS1/MS2 filtering will extract only precursors and product ions, respectively. The largest file will extract both.
Please direct all inquiries to Michael Bereman at msberema@ncsu.edu.