Protein Primary Structure
What is Protein Primary Structure?
The primary structure of a protein is the linear sequence of amino acids in a polypeptide chain, determined by the genetic code in mRNA. It dictates higher-level structures and ultimately the protein's function.
1. Methods for Predicting Primary Structure
Since the primary structure is just the amino acid sequence, its prediction mainly involves determining the sequence from DNA or protein databases.
A. Direct Sequencing Methods
- Edman Degradation – Identifies amino acids sequentially from the N-terminus.
- Mass Spectrometry (MS) – Determines peptide sequences using mass-to-charge ratios.
B. Computational Methods (Database-Based)
- BLASTp (Protein BLAST) – Finds similar protein sequences in databases.
- Expasy Translate Tool – Converts DNA sequences into amino acids.
- UniProt & NCBI Databases – Stores known protein sequences.
- Gene Prediction Tools (GENSCAN, Augustus) – Predicts protein-coding sequences from DNA.
2. Steps in Primary Structure Prediction
Step 1: Obtain DNA or
mRNA sequence (from sequencing or genome databases).
Step 2: Translate the sequence using tools like Expasy
Translate or in silico methods.
Step 3: Compare to known proteins using BLASTp or UniProt
to confirm identity.
Step 4: Validate using mass spectrometry or sequencing
techniques if required.
3. Applications of Primary Structure Prediction
✔ Identifying mutations &
disease-related changes
✔ Comparative genomics &
evolutionary studies
✔ Developing protein-based drugs
✔ Predicting post-translational
modifications (PTMs)