NeuroLINCS Proteomics: Defining human-derived iPSC proteomes and protein signatures of pluripotency
Dataset with its publication pending
- Organism: Homo sapiens
- Instrument: TripleTOF 5600+
iPSC, Proteomics, DIA, SWATH, LINCS, multi-omic
The National Institute of Health (NIH) Library of integrated network-based cellular signatures (LINCS) program is premised on generation of a publicly available data resource composed of cell-based biochemical responses or “signatures” to genetic or environmental perturbations. The NeuroLINCS center focuses on human induced pluripotent stem cells (iPSCs) derived from patients with motor neuron disease and the relevant differentiated neuronal cell cultures originating from patients and healthy controls. To establish a robust data generation process we strive to provide i) workflows for the generation of various multi-omic and functional assays for iPSC-derived motor and cortical neurons, ii) public annotated data sets and iii) relevant and targetable biological pathways of two motor neuron disorders, spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis (ALS). Multi-omic assays are performed on aliquots of the same biological specimen and omic data integration analyses have been performed on both iPSCs and iPSC-derived motor neuron (iMN) cultures (unpublished results) Epigenomics, transcriptomics and proteomics data were collected for iPSC cultures originating from 12 individual human iPSC lines. Although iPSCs do not manifest a neurological disease state, they may be explored for asymptomatic biological defects that could ultimately be involved in disease in the context of neuronal tissues. Here we describe the biological diversity of the proteome among 12 individual genetic backgrounds and multiple cell growth replicates of iPSCs. By first defining aspects of non-disease specific biological and technical variability, true disease specific signatures of each omic assay may be extracted with greater clarity. These experiments are part of the transition of discovery proteomics to large data sets consisting of genetically diverse specimens for whole proteome analyses. Through the NIH LINCS program, the multi-omic data sets generated by NeuroLINCS are a public resource provided in tiers of data levels to enable broad applicability throughout the scientific community. From the DIA-MS proteomic analysis of iPSCs originating from 6 cell lines of district genetic backgrounds, 39 proteins recognized iPSC or embryonic stem cell markers were reproducibly quantified with consistent expression levels across all lines analyzed. Another set of consistently quantified proteins extracted from the analyses are more diverse, with little or no known direct associations to iPSCs. Interestingly, some are associated with cancer in the literature, not surprising in that malignancies are known to transition to a less differentiated, more pluripotent biological state to enable increased cell division and metastasis. Possibly even more interesting, as it is less well studied than cancer is the appearance of iPSC proteins quantified that are associated with oocyte health and fertility. Another sub-category of iPSC quantified proteins maybe considered typical housekeeping proteins of cellular metabolism, cell division and cell structure and morphology though as a public resource to be compared with other human tissue or organ cell types many allow the level of expression or isoform specificity to be unique of iPSCs. The data quality assessments and metadata provided make these DIA-MS analyses of 6 cell lines as iPSCs a rich public resource.
Analysis of 6 human cell lines as inducible pluripotent stem cells (iPSCs) for the analysis of protein variability in human iPSCs by DIA-MS.
Fibroblasts from ALS, SMA and control donors were reprogrammed into iPSCs. Cell pellets were lysed in 2% SDS, 0.1% TCEP and sonicated for 30 minutes at 70 amp, 10 second on/off pulses (QSonica Q800R) before transfer to 30 kD MWCO filters according to the FASP sample processing protocol.40. SDS was removed by buffer exchange with 8 M urea in Tris, pH 8 and samples were alkylated using iodoacetamide. Protein digestion was performed in 50 mM NH4HCO3, pH 8, with Trypsin/LysC mix (Promega) overnight while shaking at 37°C. Digested sample was desalted and cleaned for mass spectral analysis using MCX (Waters) and resulting samples were dried and reconstituted using in 0.1% FA H20. Liquid Chromatography retention time standards (Biognosys) were added to each sample before analysis by mass spectrometry.
Created on 9/15/20, 10:37 AM