Overview: N-terminal and C-terminal
Protein function depends on a specific three-dimensional structure, which begins with the linear arrangement of amino acid chains. In this chain-like molecule, the two ends—the amino terminus (N-terminus) and the carboxyl terminus (C-terminus)—are not only the starting and ending points of protein synthesis but also key functional sites regulating protein fate. Understanding the characteristics of the N-terminus and C-terminus is of great significance for revealing the mechanisms of protein function and disease.
I. Chemical Nature of N-terminus and C-terminus
Proteins are formed by the linear connection of amino acids through peptide bonds. Each amino acid contains an amino group (-NH₂) and a carboxyl group (-COOH). During peptide chain synthesis, the amino group participates in the condensation reaction as the starting point, forming a peptide bond, and the end retaining the free amino group is called the N-terminus; the carboxyl group serves as the endpoint, and the end retaining the free carboxyl group is the C-terminus. This directionality (N→C) is strictly determined by the ribosomal translation process and runs through the entire life cycle of the protein:
-
N-terminus (Amino terminus): The starting end of the chain, containing a free amino group;
-
C-terminus (Carboxyl terminus): The ending end of the chain, containing a free carboxyl group.
This directionality is not only a hallmark of the primary structure of proteins but also directly affects the formation of higher-level structures. For example, the orientation of α-helices and β-sheets and domain assembly are closely related to the spatial arrangement of the N-terminus and C-terminus.
II. Division and Collaboration of Biological Functions
1. Core Role of N-terminus
(1) Starting Point of Protein Synthesis
The N-terminus is the anchoring point for translation initiation, and its sequence characteristics (such as the start codon) directly affect translation efficiency and accuracy.
(2) Core Carrier of Localization Signals
Many proteins have signal peptide sequences at the N-terminus that guide them to specific organelles such as the endoplasmic reticulum and mitochondria. For example, secretory proteins enter the secretory pathway through N-terminal signal peptides.
(3) Regulation Site for Post-translational Modifications
N-terminal acetylation is one of the most common covalent modifications, affecting protein stability, subcellular localization, and interaction networks.
2. Functional Diversity of C-terminus
(1) Contributor to Structural Stability
The C-terminus often participates in the formation of the hydrophobic core or interacts with adjacent domains to maintain the overall conformation of the protein.
(2) Integration Platform for Functional Modules
The C-terminus of some proteins carries enzymatic activity centers, binding sites, or regulatory domains, such as the C-terminus of G protein-coupled receptors (GPCRs) involved in signal transduction.
(3) Storage Repository for Degradation Signals
Certain C-terminal sequences (like degradation tags) can be recognized by the ubiquitin-proteasome system, regulating the turnover rate of proteins.
3. Sequencing Methods for N-terminus and C-terminus
(1) N-terminal Sequencing
The classic method for N-terminal sequencing is Edman degradation, which uses phenyl isothiocyanate (PITC) to react with the N-terminal amino acid to form PTH-amino acid derivatives, then sequentially removes the N-terminal residue and detects its type. This method has high accuracy and can provide single amino acid resolution sequence information. However, Edman degradation is limited by N-terminal modifications, such as acetylation or formylation, which prevent direct measurement. Additionally, this method is suitable for short peptides and high-purity proteins, but its application in complex proteomics research is limited. Therefore, modern N-terminal sequencing primarily relies on mass spectrometry analysis, especially LC-MS/MS (liquid chromatography-tandem mass spectrometry) technology. By using specific proteases (such as Lys-N) to selectively cleave peptide bonds near the N-terminus, combined with high-resolution mass spectrometry, it is possible to simultaneously resolve N-terminal sequences and their post-translational modifications. Top-down MS (whole protein mass spectrometry) methods are also used for N-terminal sequencing, which do not require protein digestion and can directly detect intact proteins and analyze N-terminal structure through fragmentation patterns. These methods overcome the limitations of Edman degradation and improve the throughput and sensitivity of N-terminal sequencing.
(2) C-terminal Sequencing
Compared to the N-terminus, C-terminal sequencing is more challenging. Due to the lack of efficient chemical degradation strategies like Edman degradation, C-terminal sequencing mainly relies on carboxypeptidase degradation and mass spectrometry techniques. Carboxypeptidase degradation uses specific carboxypeptidases (such as Carboxypeptidase A/B/Y) to gradually hydrolyze amino acids from the C-terminus and analyze the released amino acid residues through chromatography or mass spectrometry to infer the C-terminal sequence. However, this method is limited by the specificity of carboxypeptidases and cannot handle proteins with C-terminal modifications (such as amidation or glycosylation). Furthermore, the stepwise degradation process is slow, resulting in low sequencing efficiency, so its application in high-throughput proteomics research is limited. Modern C-terminal sequencing tends to use LC-MS/MS, digesting proteins with specific C-terminal proteases (such as Asp-N, Glu-C) and detecting C-terminal peptides through mass spectrometry. The application of Top-down MS in C-terminal sequencing is also increasing, allowing direct detection of intact protein C-terminal sequences and analysis of C-terminal modification information. Additionally, the Middle-down MS strategy retains longer C-terminal fragments through partial digestion, increasing the coverage of C-terminal sequences and overcoming the potential loss of C-terminal fragments in Bottom-up MS strategies.
The N-terminus and C-terminus, as the "molecular ID" of proteins, not only define their chemical nature but also participate in the precise regulation of life activities through dynamic modifications and interactions. With advancements in proteomics technology, research on terminal functions will continue to drive breakthroughs in disease diagnosis, drug development, and synthetic biology. With a professional technical team and seven quality control testing platforms,Biotech Pack BioTechprovides high-quality protein N/C-terminal sequencing services to researchers engaged in proteomics research, receiving widespread recognition.
Biotech Pack BioTech—Characterization of Biological Products, High-quality Service Provider for Multi-omics Mass Spectrometry Analysis
Related Services:
How to order?






