Now I have rich, well-sourced material. Here is the complete explanation of chromosomal linkage theory:
Chromosomal Linkage Theory
1. Historical Background and the Mendelian Exception
Mendel's Law of Independent Assortment states that alleles at different loci segregate independently during gamete formation. However, this law holds only when the loci in question are on different chromosomes (or far apart on the same chromosome). Thomas Hunt Morgan, working with Drosophila melanogaster (fruit flies) in the early 20th century, discovered that certain pairs of traits were consistently inherited together - violating independent assortment. He concluded that these traits were controlled by genes physically located on the same chromosome, forming a "linkage group."
"Not all genes assort independently of each other. This variation of the Mendelian principle was initially identified by Thomas Morgan through analysis of transmission of selected traits in fruit flies." - Shambaugh Surgery of the Ear
2. The Core Concept: Linkage
Linkage is the tendency for alleles at loci that are close together on the same chromosome to be transmitted together as an intact unit through meiosis. More precisely:
- Two loci are linked when they show a departure from independent assortment
- The degree of linkage depends on how physically close the two loci are on the chromosome
- Loci on the same chromosome are called syntenic
"Linkage is the term used to describe a departure from the independent assortment of two loci, or, in other words, the tendency for alleles at loci that are close together on the same chromosome to be transmitted together, as an intact unit, through meiosis." - Thompson & Thompson Genetics and Genomics in Medicine, 9th ed.
3. Crossing Over and Recombination
Linkage is not absolute. During meiosis I, homologous chromosomes pair up and can exchange segments through a process called crossing over (homologous recombination). This creates new combinations of alleles (recombinants) not present in the original parental chromosomes.
- Crossing over = physical exchange of chromosomal segments between homologous chromosomes during prophase I of meiosis
- The resulting offspring carrying non-parental allele combinations are called recombinants
- The frequency of recombination between two loci depends on how far apart they are - the farther apart, the more likely a crossover will occur between them
"This new combination of alleles was considered to result from crossing over and exchange of genetic material between two homologous chromosomes, known as homologous recombination, yielding the new combination of alleles not present in the original parental chromosomes." - Shambaugh Surgery of the Ear
4. Recombination Frequency (θ) and Genetic Distance
Recombination frequency (θ) is the proportion of offspring that are recombinants. It serves as a measure of the distance between two loci:
| θ value | Meaning |
|---|
| θ = 0 | Complete linkage - loci always inherited together |
| 0 < θ < 0.5 | Partial linkage - intermediate distance |
| θ = 0.5 | No linkage - independent assortment (loci either on different chromosomes, or far apart) |
Key rule: θ can never exceed 0.5, because even if loci are very far apart, at least one crossover will always occur, making recombinants and non-recombinants equally frequent (50:50).
5. Genetic Maps and Centimorgans (cM)
The centimorgan (cM) is the unit of genetic distance, named after T.H. Morgan:
- 1 cM = the genetic length over which, on average, one crossover occurs in 1% of meioses
- A recombination fraction of 1% (θ = 0.01) translates approximately into 1 cM
- As a rough average, 1 cM ≈ 1 megabase (Mb) of DNA
"The map distance between two loci is a theoretical concept that is based on actual data - the extent of observed recombination, θ, between the loci. Map distance is measured in units called centimorgans (cM), defined as the genetic length over which, on average, one crossover occurs in 1% of meioses." - Thompson & Thompson Genetics, 9th ed.
An important nuance: The relationship between physical distance (base pairs) and genetic distance (cM) is not uniform across the genome. So-called recombination hot spots - small regions occupying only ~6% of the genome sequence - account for ~60% of all meiotic recombination. Between hot spots are large stretches with very little recombination.
Sex differences in recombination: The total genetic length of the human genome measured in female meiosis (~4596 cM) is about 60% greater than in male meiosis (~2868 cM), with an overall sex-averaged length of ~3790 cM.
6. Linkage Analysis in Medicine - LOD Scores
Linkage analysis is a classical method for mapping disease genes, particularly for Mendelian (single-gene) disorders. It is applied to families with a known inheritance pattern (autosomal dominant, autosomal recessive, or X-linked).
The key statistical tool is the LOD score (Log of the Odds score):
- LOD score = log₁₀ of the likelihood that the marker and disease locus are linked at a given θ, divided by the likelihood they are unlinked (θ = 0.5)
- LOD ≥ 3 = strong evidence for linkage (odds of 1000:1 in favor)
- LOD ≤ -2 = evidence against linkage (effectively rules it out)
The LOD score provides two pieces of information:
- The best estimate of recombination frequency (θ_max) between marker and disease locus
- The strength of evidence for linkage at that θ_max
"Linkage analysis is used when there is a particular mode of inheritance (autosomal dominant, autosomal recessive, or X-linked) that explains the inheritance pattern. LOD score analysis allows mapping of genes with variants for phenotypes that follow Mendelian inheritance." - Thompson & Thompson Genetics, 9th ed.
7. Linkage Disequilibrium (LD)
Linkage disequilibrium is a related but distinct concept. When two alleles at different loci are in LD, they occur together on the same haplotype more (or less) frequently than would be predicted from their individual allele frequencies alone.
- Linkage equilibrium: haplotype frequency = product of individual allele frequencies (freq(A-S) = freq(A) × freq(S))
- Linkage disequilibrium: haplotype frequency deviates from that expected product
- LD occurs when two loci are very close together (typically <0.1 cM) and insufficient time has passed for random recombination to break up the original associations
- LD is the basis for GWAS (genome-wide association studies) - identifying disease-associated variants via their correlated markers
"It is generally the case that the alleles at two loci will not show any preferred phase in the population if the loci are linked, but at a distance of 0.1 to 1 cM or more." - Thompson & Thompson Genetics, 9th ed.
8. Haplotypes and Linkage Groups
A haplotype is a set of alleles at closely spaced loci that tend to be inherited together as a block due to LD. Classic examples include HLA haplotypes (e.g., A1-Cw7-B8-DR17-DQ2 in Caucasians, occurring ~4x more frequently than expected by chance).
A linkage group = all the genes on a single chromosome. Humans have 23 linkage groups (22 autosomes + the X chromosome).
9. Clinical Applications
| Application | How linkage theory is used |
|---|
| Presymptomatic diagnosis | Using linked polymorphic markers to predict carrier/disease status when a direct mutation test is unavailable |
| Prenatal testing | Tracking disease alleles through families via linked markers |
| Disease gene identification | LOD score analysis to map unknown disease genes to chromosomal regions |
| GWAS | Exploiting LD patterns to find common variant associations with complex diseases |
| Genetic counseling | Estimating recurrence risk based on chromosomal location and recombination risk |
Summary: Key Relationships
Physical proximity on chromosome
↓
Lower recombination frequency (θ closer to 0)
↓
Stronger linkage (higher LOD scores)
↓
Greater linkage disequilibrium
↓
More stable haplotype blocks
Sources:
- Thompson & Thompson Genetics and Genomics in Medicine, 9th ed. (pp. 224-228)
- Shambaugh Surgery of the Ear (pp. 162-163)
- Bradley and Daroff's Neurology in Clinical Practice (Linkage Analysis section)