Large biomolecules composed of linear chains of amino acids linked by peptide bonds, folded into specific three-dimensional structures that enable them to function as enzymes, structural components, signaling molecules, transport carriers, immune effectors, and hormones. Each protein's sequence is encoded by a gene through the central dogma pathway (DNA→mRNA→protein), and proper function depends critically on post-translational folding into the correct conformation. Proteins constitute approximately 15-20% of body mass and are continuously synthesized and degraded (protein turnover), with dietary protein providing the amino acid substrate for this ongoing renewal.
Think of proteins as highly specialized workers in a massive factory (your body), where each worker is assembled from a specific instruction manual (gene). The instruction manual is stored in the vault (nucleus/DNA), photocopied (transcribed to mRNA), and sent to the assembly line (ribosomes). At the assembly line, individual building blocks (amino acids) are delivered by couriers (tRNA) and snapped together in the exact sequence specified by the photocopy. But here's the critical part: a freshly assembled worker comes off the line as a floppy, unfolded chain—completely useless. Quality control supervisors (chaperone proteins like HSP70 and HSP90) must fold this chain into the precise 3D shape required for the worker to do their job—whether that's cutting things (protease), carrying cargo (hemoglobin), sending signals (hormones), or fighting invaders (antibodies). If the folding goes wrong, the worker is either sent back for refolding or destroyed. When you eat protein, you're not directly importing workers—you're importing scrap metal (amino acids) from demolished workers in your food, which your factory recycles into the exact workers your body needs right now. No protein factory, no workers; no workers, no function—whether that's building muscle, making neurotransmitters, healing wounds, or mounting immune responses.
graph TD
A[Gene in DNA] -->|Transcription by RNA Polymerase| B[Pre-mRNA in nucleus]
B -->|Splicing removes introns| C[Mature mRNA]
C -->|Export to cytoplasm| D[mRNA binds to ribosome]
D -->|tRNA delivers amino acids| E["Translation: amino acid chain elongation"]
E -->|Ribosome release| F[Unfolded polypeptide chain]
F -->|HSP70 binding| G[Partially folded intermediate]
G -->|"HSP90 + cochaperones"| H[Native 3D structure]
H -->|Post-translational modifications| I[Functional protein]
I -->|Ubiquitination if misfolded| J[Proteasomal degradation]
style F fill:#ffcccc
style H fill:#ccffcc
style J fill:#ff9999
Transcription (nucleus): RNA polymerase II reads DNA coding strand (only 21,787 genes out of ~3 billion base pairs), synthesizes pre-mRNA with exons (protein-coding) and introns (non-coding). Splicing machinery removes introns, ligates exons → mature mRNA (typically 1-15 kb). Only 1-2% of human genome codes for proteins; 98% is regulatory elements, enhancers, intergenic sequences, repetitive DNA.
Translation (cytoplasm/ER): Mature mRNA binds to ribosome (40S + 60S subunits). Each three-nucleotide codon specifies one amino acid. tRNA molecules with anticodon sequences deliver matching amino acids to ribosomal A-site. Peptidyl transferase catalyzes peptide bond formation between growing chain and incoming amino acid. Average synthesis rate: 3-10 amino acids/second. Ribosome releases when stop codon (UAA, UAG, UGA) is encountered.
Protein Folding (ER/cytoplasm): Newly synthesized polypeptide emerges unfolded. HSP70 chaperones bind hydrophobic patches, prevent aggregation, guide initial folding. HSP90 + co-chaperones (Hop, p23, immunophilins) refine 3D structure. Disulfide bonds (cysteine-cysteine) stabilize structure in ER (catalyzed by protein disulfide isomerase). ER-associated degradation (ERAD) pathway targets misfolded proteins → ubiquitination → proteasomal destruction.
Post-Translational Modifications: Glycosylation (addition of sugar chains), phosphorylation (kinases add phosphate groups), acetylation, methylation, citrullination (arginine→citrulline by PAD enzymes—key in autoimmunity), ubiquitination (protein degradation signal). These modify function, stability, localization, and interactions.
Digestion:
- Stomach: Pepsin (activated from pepsinogen by HCl, pH 1.5-3.5) breaks large proteins → polypeptides
- Duodenum: Pancreatic proteases (trypsin, chymotrypsin, elastase, carboxypeptidase) activated by enterokinase → cleave polypeptides → oligopeptides + amino acids
- Brush border: Aminopeptidases complete digestion → di-/tripeptides + free amino acids
Absorption (jejunum):
- Free amino acids: Sodium-dependent transporters (multiple systems: neutral, acidic, basic amino acids)
- Di-/tripeptides: PEPT1 transporter (H+-coupled)
- Portal circulation → liver (first-pass metabolism)
Hepatic Processing:
- Transamination: Amino group transfer to α-ketoglutarate → glutamate (ALT, AST enzymes)
- Deamination: Amino groups → ammonia → urea cycle → urea excretion (kidney)
- Gluconeogenesis: Glucogenic amino acids (alanine, serine, glycine, etc.) → glucose (via pyruvate or TCA intermediates)
- Ketogenesis: Ketogenic amino acids (leucine, lysine) → acetoacetate, β-hydroxybutyrate
- Protein synthesis: Albumin (~12-15 g/day), acute phase proteins (CRP, SAA, complement), clotting factors (fibrinogen, prothrombin), transport proteins (transferrin, ceruloplasmin)
Whole-body protein synthesis: ~250-300 g/day in adults. Breakdown rate nearly matches synthesis (dynamic equilibrium). Net protein balance = synthesis - breakdown. Negative balance → muscle wasting, immune dysfunction, wound healing impairment.
Antibody production requires massive protein synthesis: single plasma cell produces ~2000 antibody molecules/second. Immunoglobulin structure: 4 polypeptide chains (2 heavy + 2 light), each requiring separate gene transcription, translation, folding, and assembly. Protein deficiency → reduced IgG, IgA, IgM → impaired adaptive immunity. Acute phase response consumes ~200-400 mg/kg/day additional protein (via IL-6 → STAT3 → hepatic acute phase gene transcription). Malnourished patients cannot mount adequate inflammatory resolution—insufficient substrate for specialized pro-resolving mediators (SPMs) synthesis pathways.
Skeletal muscle is the body's largest amino acid reservoir (~40% body mass in healthy adults). During infection/injury, muscle protein breakdown increases (via ubiquitin-proteasome pathway activated by TNF-α, IL-1β, cortisol) → amino acids released → hepatic acute phase protein synthesis + immune cell proliferation. This is metabolic reprioritization: selfish immune system commandeers muscle stores. Chronic inflammation → persistent protein catabolism → sarcopenia, cachexia. Intervention requires addressing root inflammatory drivers, not just protein supplementation (which feeds inflammatory fire if underlying triggers persist).
High protein intake (especially animal protein, cheese) generates acidic metabolites during amino acid catabolism: sulfuric acid (from cysteine, methionine), phosphoric acid (from phosphoproteins). Kidney excretes ~50-100 mEq H+/day. Chronic acidic load (PRAL >0) → bone mineral buffering (calcium carbonate, calcium phosphate released) → increased urinary calcium excretion → osteoporosis risk. Cheese has extremely high PRAL (+27 mEq/100g) due to high protein + phosphate, low potassium. Counterbalance with alkaline foods (vegetables, fruits: PRAL -2 to -4). Clinical Pearl: Calculate patient's dietary PRAL—if consistently >+10 mEq/day + low vegetable intake + family history of osteoporosis → intervention required.
Example from modules: Fish diet providing adequate protein (3500 g/day to meet energy needs in hypothetical scenario) would require 14 mg/day iron—at fish's typical iron content (~0.4 mg/100g), this is 3.5 kg fish. This illustrates micronutrient leverage: protein metabolism is iron-dependent (hemoglobin, myoglobin, cytochromes, iron-sulfur cluster enzymes). High protein without adequate iron → anemia, mitochondrial dysfunction, impaired collagen synthesis (prolyl hydroxylase requires iron).
- Steatorrhea (fat in stool) + muscle wasting → pancreatic insufficiency (check fecal elastase <200 μg/g)
- Low albumin (
.5 g/dL) + edema → hepatic dysfunction or protein-losing enteropathy
- Elevated fecal nitrogen → small intestine bacterial overgrowth (protein fermentation by bacteria)
- pH dysregulation: achlorhydria (low stomach acid) → inadequate pepsin activation → polypeptide malabsorption
Hunter-gatherer protein intake: ~30-35% calories (100-150 g/day from diverse sources). Modern Western diet: often >100 g/day from concentrated sources (dairy, meat) without compensatory alkaline plant foods. Additionally, modern proteins are often heat-processed (AGE formation), combined with refined carbs (insulin resistance), consumed in chronic caloric excess (mTOR overstimulation → autophagy suppression). Selfish Brain perspective: brain prioritizes glucose; in protein excess + sedentarism, amino acids shunted to gluconeogenesis rather than muscle synthesis—metabolic inefficiency.
- Human genome: 21,787 protein-coding genes (1-2% of total DNA); each gene codes for one primary protein (alternative splicing can create variants)
- 98% of genome is non-coding: regulatory elements, enhancers, introns, transposons, structural DNA
- Protein synthesis rate: 250-300 g/day whole-body turnover in healthy adults (nearly equals daily breakdown)
- Essential amino acids: 9 (histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, valine)—cannot be synthesized, must be dietary
- HSP70 and HSP90 chaperones prevent protein misfolding; HSP expression increases 2-5 fold during heat stress, infection, oxidative stress
- Albumin half-life: ~20 days; acute phase proteins (CRP, SAA): 6-12 hours—rapid turnover enables dynamic response
- Cheese PRAL: +27 mEq/100g (extremely acidogenic); fish PRAL: +7-8 mEq/100g; vegetables PRAL: -2 to -4 mEq/100g
- Iron requirement for high-protein fish diet (3.5 kg/day): 14 mg/day—illustrates micronutrient leverage concept
- Plasma cell antibody production: ~2000 IgG molecules/second (requires massive ribosomal capacity)
- Acute phase response protein cost: 200-400 mg/kg/day additional hepatic synthesis (IL-6 driven)
- Protein quality markers: PDCAAS (protein digestibility-corrected amino acid score); DIAAS (digestible indispensable amino acid score)—egg, whey, casein score ~1.0; plant proteins typically 0.5-0.8
- mTOR activation by leucine threshold: ~2-3 g/meal required for maximal muscle protein synthesis signaling
- amino acids — monomeric building blocks polymerized via peptide bonds to form protein chains; 20 standard types determine protein sequence and function
- genes — each gene's coding sequence (exons) specifies the amino acid sequence of one protein through transcription to mRNA
- DNA — double-helix repository containing genetic instructions; only 1-2% codes for proteins, 98% regulatory/structural
- mRNA — messenger RNA transcribed from DNA template, carries protein-coding information from nucleus to ribosome
- ribosomes — molecular machines composed of rRNA + ribosomal proteins that catalyze peptide bond formation during translation
- tRNA — transfer RNA molecules with anticodon and amino acid attachment site; deliver specific amino acids to ribosome during synthesis
- protein folding — critical post-translational process converting linear amino acid chain into functional 3D structure via hydrophobic collapse, disulfide bonds, chaperone assistance
- HSP70 — heat shock protein 70 family; binds nascent polypeptides, prevents aggregation, facilitates initial folding steps
- HSP90 — heat shock protein 90 family; refines protein structure, stabilizes intermediate conformations, prevents misfolding-induced aggregation
- endoplasmic reticulum — site of membrane/secreted protein synthesis and folding; ER stress response activated when misfolded proteins accumulate
- protein synthesis — anabolic process requiring ATP, GTP, amino acids, ribosomes, and tRNA; rate-limited by mTOR signaling and amino acid availability
- proteases — enzymes that cleave peptide bonds; include digestive enzymes (pepsin, trypsin), matrix metalloproteinases, caspases, proteasome components
- pancreatic enzymes — secreted into duodenum; include trypsin, chymotrypsin, elastase (protein digestion), lipase (fat), amylase (starch)
- liver — central protein synthesis organ producing 10-15 g/day albumin, acute phase proteins during inflammation, clotting factors, transport proteins; also site of amino acid catabolism and urea synthesis
- acute phase protein — hepatic proteins upregulated during inflammation (CRP, SAA, fibrinogen, complement); IL-6 → STAT3 → gene transcription; consume 200-400 mg/kg/day protein substrate
- immune system — requires protein for antibody synthesis (IgG ~150 kDa, 4 polypeptide chains), cytokine production, complement cascade, acute phase response
- muscle tissue — largest body protein reservoir (~40% body mass); provides amino acids during infection/stress via ubiquitin-proteasome breakdown; requires dietary protein + resistance exercise for maintenance
- iron — cofactor for hemoglobin, myoglobin, cytochromes, prolyl hydroxylase (collagen synthesis); limiting nutrient in high-protein diets (14 mg/day needed for 3.5 kg fish/day example)
- PRAL — Potential Renal Acid Load framework; calculates net acid/alkaline contribution from foods; high protein (especially cheese +27 mEq/100g) → acidic load → bone calcium buffering
- cheese — high PRAL (+27 mEq/100g), high protein (20-30 g/100g), high phosphate; chronic consumption without alkaline vegetables → bone demineralization risk
- IL-6 — pleiotropic cytokine driving hepatic acute phase protein synthesis; elevation >10 pg/mL sustained → muscle protein catabolism, metabolic reprioritization to immune function
- mTOR — mechanistic target of rapamycin; amino acid sensor (especially leucine >2-3 g/meal) → protein synthesis activation; chronic activation without fasting → autophagy suppression
- autophagy — cellular "recycling" process degrading misfolded proteins, damaged organelles; suppressed by excess protein/mTOR activation; required for cellular quality control
- gluconeogenesis — synthesis of glucose from non-carbohydrate precursors; glucogenic amino acids (alanine, serine, glutamine) → pyruvate/TCA intermediates → glucose; upregulated during fasting, stress
- collagen — most abundant body protein (~30% total); requires vitamin C, iron, copper for synthesis; made from glycine, proline, hydroxyproline in repeating tripeptide structure
- albumin — major serum protein (3.5-5.0 g/dL); maintains oncotic pressure, transports hormones, fatty acids, drugs; half-life ~20 days; low levels indicate hepatic dysfunction or protein-losing state
- muscle protein synthesis — anabolic process building myofibrillar proteins; requires leucine threshold (2-3 g), resistance exercise stimulus, adequate energy; opposed by cortisol, TNF-α, IL-1β during inflammation
- Module 1 — Genes, DNA, mRNA, protein synthesis fundamentals in evolutionary medicine context
- Module 2 — Iron as limiting nutrient in high-protein diets, protein quality and digestibility
- Module 3 — Neuroendocrine regulation of protein metabolism, stress-induced catabolism, hypothalamic nutrient sensing
- Module 6 — PRAL framework, acidic load from cheese and high protein, bone-protein interactions, connective tissue synthesis requirements
- Module 10 — Clinical nutrition assessment, protein malabsorption diagnostics, inflammatory protein catabolism, muscle-immune axis