Molecular Basis of Inheritance
DNA structure and packaging, the search for the genetic material, DNA replication, transcription, the genetic code, translation, gene regulation and the Human Genome Project
The Genetic Material, Its Structure and Replication
DNA Structure, Packaging and the Search for the Genetic MaterialTopic 1
DNA (deoxyribonucleic acid) is the genetic material of most organisms. It is a long polymer of nucleotides, each made of a deoxyribose sugar, a phosphate and a nitrogenous base — the purines adenine (A) and guanine (G) and the pyrimidines cytosine (C) and thymine (T). The Watson–Crick double-helix model (1953) describes DNA as two antiparallel strands (one running 5'→3', the other 3'→5') coiled into a right-handed helix, with the sugar-phosphate backbones outside and the bases paired inside.
Base pairing is specific and complementary: A pairs with T by two hydrogen bonds and G pairs with C by three hydrogen bonds. This matches Chargaff's rule that the amount of A = T and G = C. The helix has about 10 base pairs per turn, a pitch (length of one turn) of 3.4 nm and a rise of 0.34 nm per base pair.
A human cell holds about 2 metres of DNA, so it must be tightly packaged. The negatively charged DNA wraps around a core of positively charged proteins called histones. About 200 base pairs of DNA wound around a histone octamer form a nucleosome, the basic repeating unit, giving a 'beads-on-a-string' look; nucleosomes coil further into chromatin and chromosomes. Loosely-packed, transcriptionally active chromatin is euchromatin; densely-packed, inactive chromatin is heterochromatin.
How did we learn DNA is the genetic material? Griffith (1928) found a 'transforming principle' in Streptococcus pneumoniae (his transformation experiment with R and S strains). Avery, MacLeod and McCarty identified that transforming principle as DNA. Finally, Hershey and Chase (1952) used bacteriophages labelled with radioactive ³²P (DNA) and ³⁵S (protein) and showed that only the ³²P (DNA) entered the bacteria — conclusively proving DNA is the genetic material. (RNA was likely the first genetic material in evolution, but DNA, being double-stranded and chemically more stable, took over.) For NEET, fix the helix details, base-pairing/H-bonds, Chargaff's rule, the nucleosome, and the three classic experiments.
| Feature | Detail |
|---|---|
| Base pairs | A=T (2 H-bonds), G≡C (3 H-bonds) |
| Helix | antiparallel; 10 bp/turn; pitch 3.4 nm |
| Nucleosome | ~200 bp + histone octamer |
| Euchromatin / heterochromatin | loose, active / dense, inactive |
| Hershey–Chase | ³²P (DNA) enters bacteria → DNA is genetic material |
If a DNA segment has 22% adenine, what percentage of guanine does it contain?
Show solution
By Chargaff's rule, A = T, so T = 22%, and A + T = 44%. The rest is G + C = 56%, and G = C, so guanine = 28%.
How did Hershey and Chase show that DNA, not protein, is the genetic material?
Show solution
They labelled phage DNA with ³²P and phage protein with ³⁵S. After infection, only the ³²P (DNA) was found inside the bacteria, while the ³⁵S (protein coat) stayed outside — proving DNA is the genetic material.
Adenine pairs with thymine by how many hydrogen bonds?
According to Chargaff's rule:
The basic packaging unit of DNA with histones is the:
The Hershey–Chase experiment used:
Transcriptionally active, loosely-packed chromatin is:
NEET tip: DNA: antiparallel helix, A=T (2 H-bonds), G≡C (3 H-bonds), Chargaff A=T & G=C, 10 bp/turn, pitch 3.4 nm. Nucleosome = ~200 bp + histone octamer; euchromatin (active) vs heterochromatin (inactive). Genetic material: Griffith (transformation) → Avery et al. (DNA) → Hershey–Chase (³²P proves DNA).
DNA ReplicationTopic 2
For genetic information to pass to daughter cells, DNA must be copied (replicated) before cell division. The Watson–Crick model itself suggested how: because the two strands are complementary, each can serve as a template to make the other. This gives semiconservative replication — each new DNA molecule keeps one old (parental) strand and one newly synthesised strand.
This was proved by the classic experiment of Meselson and Stahl (1958) on Escherichia coli. They grew bacteria in medium with heavy nitrogen (¹⁵N) so the DNA was heavy, then shifted them to normal nitrogen (¹⁴N). After one generation the DNA was of intermediate (hybrid) density (one heavy + one light strand), and after two generations there were equal amounts of hybrid and light DNA — exactly the pattern predicted by semiconservative replication.
Mechanistically, replication begins at a specific site, the origin of replication (ori). The enzyme helicase unwinds the helix to form a replication fork, and the main enzyme DNA polymerase adds nucleotides to build the new strands. A crucial constraint is that DNA polymerase can only synthesise in the 5'→3' direction.
Because the two template strands are antiparallel, this directionality means one new strand — the leading strand — is made continuously toward the fork, while the other — the lagging strand — is made discontinuously in short pieces called Okazaki fragments, which are later joined by DNA ligase. Replication is highly accurate, with proofreading. For NEET, fix the meaning of semiconservative replication, the Meselson–Stahl result (hybrid after one generation), and the mechanism: origin, DNA polymerase building 5'→3', leading (continuous) vs lagging (Okazaki fragments + ligase) strands.
| Item | Detail |
|---|---|
| Mode | semiconservative (1 old + 1 new strand) |
| Proof | Meselson & Stahl (¹⁵N/¹⁴N in E. coli) |
| Main enzyme | DNA polymerase (synthesises 5'→3') |
| Leading strand | continuous toward the fork |
| Lagging strand | discontinuous — Okazaki fragments + ligase |
In the Meselson–Stahl experiment, what density of DNA is found after one generation in ¹⁴N, and why?
Show solution
After one generation the DNA is of intermediate (hybrid) density. Each daughter molecule has one heavy (¹⁵N) parental strand and one light (¹⁴N) new strand — exactly what semiconservative replication predicts.
Why is one strand made continuously and the other in fragments during replication?
Show solution
DNA polymerase works only in the 5'→3' direction. On the strand oriented toward the fork the new strand can be made continuously (leading strand); on the antiparallel strand it must be made in short Okazaki fragments (lagging strand), later joined by DNA ligase.
DNA replication is:
Semiconservative replication was proved by:
DNA polymerase synthesises DNA in the direction:
Short fragments made on the lagging strand are:
The enzyme that joins Okazaki fragments is:
NEET tip: Replication = semiconservative (Meselson & Stahl, ¹⁵N/¹⁴N → hybrid after 1 generation). DNA polymerase builds 5'→3'; leading strand continuous, lagging strand = Okazaki fragments joined by DNA ligase; starts at the origin (ori).
The Central Dogma, Gene Regulation and Genomics
Transcription, the Genetic Code and TranslationTopic 3
The flow of genetic information is summarised by the central dogma (proposed by Crick): DNA → RNA → protein, i.e. DNA is transcribed into RNA, which is translated into protein. Transcription is the copying of one strand of DNA into RNA by the enzyme RNA polymerase. Only one strand, the template strand (read 3'→5'), is copied; the other is the coding strand. A transcription unit has a promoter (where RNA polymerase binds), the structural gene and a terminator.
In eukaryotes the first transcript, hnRNA, contains coding exons and non-coding introns (split genes). It is processed: the introns are removed and exons joined by splicing, a cap is added at the 5' end and a poly-A tail at the 3' end, producing mature mRNA.
The information in mRNA is read using the genetic code, whose features are heavily tested. The code is read in triplets called codons (three bases = one amino acid), giving 64 codons. It is degenerate (most amino acids have more than one codon), unambiguous (each codon specifies only one amino acid), nearly universal (same in almost all organisms), non-overlapping and comma-less. AUG is the start (initiation) codon (and codes for methionine), while UAA, UAG and UGA are the three stop (termination) codons.
The codons are decoded during translation, which occurs on the ribosome. The key molecule is tRNA (transfer RNA), the 'adaptor molecule': it has an anticodon that base-pairs with the mRNA codon and carries the matching amino acid. Translation proceeds in three steps — initiation (ribosome assembles at AUG), elongation (amino acids are added one by one, joined by peptide bonds) and termination (at a stop codon, releasing the polypeptide). For NEET, fix the central dogma, the template strand and transcription unit, splitting/splicing in eukaryotes, the features of the genetic code (with start/stop codons) and the adaptor role of tRNA.
| Item | Detail |
|---|---|
| Central dogma | DNA → RNA → protein (Crick) |
| Transcription unit | promoter + structural gene + terminator |
| Splicing | introns removed, exons joined (eukaryotes) |
| Genetic code | triplet, 64 codons, degenerate, universal |
| Start / stop | AUG / UAA, UAG, UGA; tRNA = adaptor |
What is meant by saying the genetic code is 'degenerate', and name the start and stop codons.
Show solution
Degenerate means most amino acids are coded by more than one codon (e.g. several codons for leucine). The start codon is AUG (codes for methionine); the stop codons are UAA, UAG and UGA.
Why is tRNA called the adaptor molecule?
Show solution
tRNA links the language of nucleic acids to that of proteins: one end has an anticodon that base-pairs with the mRNA codon, and the other end carries the specific amino acid for that codon. So it 'adapts' the codon to the correct amino acid during translation.
The central dogma is:
The start codon is:
The number of codons in the genetic code is:
Removal of introns and joining of exons is:
The adaptor molecule in translation is:
NEET tip: Central dogma: DNA→RNA→protein. Transcription: RNA polymerase on template strand (3'→5'); unit = promoter + gene + terminator; eukaryotic hnRNA spliced (introns out, exons joined) + cap + tail. Genetic code: triplet, 64 codons, degenerate, unambiguous, universal; AUG start, UAA/UAG/UGA stop. tRNA = adaptor (anticodon + amino acid).
Regulation of Gene Expression and the Human Genome ProjectTopic 4
Cells do not make every protein all the time — gene expression is regulated so that genes are switched on only when their products are needed. The classic model is the lac operon of E. coli, described by Jacob and Monod. An operon is a unit of bacterial genes controlled together. The lac operon has a regulatory gene (i) that makes a repressor, a promoter (p), an operator (o), and three structural genes — z (β-galactosidase), y (permease) and a (transacetylase) — whose enzymes metabolise lactose.
When lactose is absent, the repressor binds the operator and blocks transcription — the genes are off. When lactose is present, it acts as an inducer: it binds the repressor, inactivating it, so the repressor leaves the operator and RNA polymerase can transcribe the genes, making the lactose-digesting enzymes. Because lactose switches the operon on by removing the repressor's block, this is an example of negative regulation by induction.
On a much larger scale, the entire genetic make-up of humans was decoded by the Human Genome Project (HGP), a global effort that ran from 1990 to 2003. It sequenced the roughly 3 billion (3 × 10⁹) base pairs of the human genome and found that humans have only about 20,000–25,000 genes — far fewer than expected. Among its findings: less than 2% of the genome codes for proteins, and there are huge stretches of repetitive sequences.
One powerful application of knowing the variable parts of the genome is DNA fingerprinting, developed by Alec Jeffreys. It compares highly variable repetitive sequences called VNTRs (variable number of tandem repeats), which differ from person to person; since these patterns are unique to each individual (except identical twins), DNA fingerprinting is used in forensic identification, paternity testing and pedigree analysis. For NEET, fix the lac operon parts and its induction by lactose (negative regulation), the HGP key facts (1990–2003, ~3 billion bp, ~20,000–25,000 genes) and DNA fingerprinting using VNTRs.
| Item | Detail |
|---|---|
| lac operon (Jacob & Monod) | i (repressor), p, o, genes z, y, a |
| Inducer | lactose → inactivates repressor → genes ON |
| β-galactosidase | product of the z gene |
| HGP | 1990–2003; ~3 billion bp; ~20,000–25,000 genes |
| DNA fingerprinting | VNTRs (Alec Jeffreys); forensics, paternity |
What happens to the lac operon when lactose is added to the medium?
Show solution
Lactose acts as an inducer: it binds the repressor and inactivates it, so the repressor leaves the operator. RNA polymerase can now transcribe the structural genes (z, y, a), producing the enzymes (β-galactosidase, etc.) to use lactose. So the operon is switched ON.
State two key findings of the Human Genome Project and one use of DNA fingerprinting.
Show solution
HGP findings (any two): the human genome has about 3 billion base pairs; it contains only about 20,000–25,000 genes; less than 2% codes for proteins. DNA fingerprinting (using VNTRs) is used in forensic identification / paternity testing.
The lac operon was explained by:
In the lac operon, the inducer is:
The z gene of the lac operon codes for:
The human genome contains approximately how many base pairs?
DNA fingerprinting is based on differences in:
NEET tip: lac operon (Jacob & Monod): i→repressor, p, o, genes z (β-galactosidase), y (permease), a. Lactose = inducer → inactivates repressor → operon ON (negative regulation). HGP: 1990–2003, ~3 billion bp, ~20,000–25,000 genes. DNA fingerprinting = VNTRs (Alec Jeffreys) for forensics/paternity.
Quick Revision — Molecular Basis of Inheritance
- DNA: antiparallel double helix (Watson & Crick); A=T (2 H-bonds), G≡C (3 H-bonds); Chargaff: A=T, G=C; 10 bp/turn, pitch 3.4 nm.
- Packaging: negatively-charged DNA wraps positively-charged histones → nucleosome (~200 bp + histone octamer) → chromatin (euchromatin active, heterochromatin inactive).
- Genetic material: Griffith (transformation), Avery–MacLeod–McCarty (DNA), Hershey–Chase (phage → DNA). DNA is more stable than RNA.
- Replication is semiconservative (Meselson & Stahl); DNA polymerase synthesises 5'→3'; leading + lagging (Okazaki) strands.
- Central dogma: DNA → (transcription) RNA → (translation) protein.
- Genetic code: triplet, 64 codons, degenerate, universal, unambiguous; start AUG, stop UAA/UAG/UGA; tRNA = adaptor.
- lac operon (Jacob & Monod): lactose = inducer. HGP: ~3 billion bp; DNA fingerprinting uses VNTRs.
Frequently Asked Questions
Ready to test yourself?
Attempt the full timed mock tests — Main & Advanced level.
Start Mock Test 1 →