Proteogenomics Integrating Novel Junction Peptide Identification Strategy Discovers Three Novel Protein Isoforms of Human NHSL1 and EEF1B2.

Abstract	In eukaryotes, alternative pre-mRNA splicing allows a single gene to encode different protein isoforms that function in many biological processes, and they are used as biomarkers or therapeutic targets for diseases. Although protein isoforms in the human genome are well annotated, we speculate that some low-abundance protein isoforms may still be under-annotated because most genes have a primary coding product and alternative protein isoforms tend to be under-expressed. A peptide coencoded by a novel exon and an annotated exon separated by an intron is known as a novel junction peptide. In the absence of known transcripts and homologous proteins, traditional whole-genome six-frame translation-based proteogenomics cannot identify novel junction peptides, and it cannot capture novel alternative splice sites. In this article, we first propose a strategy and tool for identifying novel junction peptides, called CJunction, which we then integrate into a proteogenomics process specifically designed for novel protein isoform discovery and apply to the analysis of a deep-coverage HeLa mass spectrometry data set with identifier PXD004452 in ProteomeXchange. We succeeded in identifying and validating three novel protein isoforms of two functionally important genes, NHSL1 (causative gene of Nance-Horan syndrome) and EEF1B2 (translation elongation factor), which validate our hypothesis. These novel protein isoforms have significant sequence differences from the annotated gene-coding products introduced by the novel N-terminal, suggesting that they may play importantly different functions.
Authors	Cuitong He, Jiangtao Guo, Wenmin Tian, Catherine C L Wong
Journal	Journal of proteome research (J Proteome Res) Vol. 20 Issue 12 Pg. 5294-5303 (12 03 2021) ISSN: 1535-3907 [Electronic] United States
PMID	34420305 (Publication Type: Journal Article, Research Support, Non-U.S. Gov't)
Chemical References	Guanine Nucleotide Exchange Factors NHSL1 protein, human Peptide Elongation Factor 1 Peptides Protein Isoforms Proteins eEF1B-beta protein, human
Topics	Alternative Splicing Genome, Human Guanine Nucleotide Exchange Factors (genetics, metabolism) Humans Mass Spectrometry Peptide Elongation Factor 1 (genetics, metabolism) Peptides (chemistry) Protein Isoforms (genetics, metabolism) Proteins (genetics, metabolism) Proteogenomics (methods)

Join CureHunter, for free Research Interface BASIC access!

Take advantage of free CureHunter research engine access to explore the best drug and treatment options for any disease. Find out why thousands of doctors, pharma researchers and patient activists around the world use CureHunter every day.

Realize the full power of the drug-disease research graph!