NEET (UG)

Molecular Basis of Inheritance

DNA structure and packaging, the search for the genetic material, DNA replication, transcription, the genetic code, translation, gene regulation and the Human Genome Project

Module 1

The Genetic Material, Its Structure and Replication

DNA Structure, Packaging and the Search for the Genetic MaterialTopic 1

DNA (deoxyribonucleic acid) is the genetic material of most organisms. It is a long polymer of nucleotides, each made of a deoxyribose sugar, a phosphate and a nitrogenous base — the purines adenine (A) and guanine (G) and the pyrimidines cytosine (C) and thymine (T). The Watson–Crick double-helix model (1953) describes DNA as two antiparallel strands (one running 5'→3', the other 3'→5') coiled into a right-handed helix, with the sugar-phosphate backbones outside and the bases paired inside.

Base pairing is specific and complementary: A pairs with T by two hydrogen bonds and G pairs with C by three hydrogen bonds. This matches Chargaff's rule that the amount of A = T and G = C. The helix has about 10 base pairs per turn, a pitch (length of one turn) of 3.4 nm and a rise of 0.34 nm per base pair.

A human cell holds about 2 metres of DNA, so it must be tightly packaged. The negatively charged DNA wraps around a core of positively charged proteins called histones. About 200 base pairs of DNA wound around a histone octamer form a nucleosome, the basic repeating unit, giving a 'beads-on-a-string' look; nucleosomes coil further into chromatin and chromosomes. Loosely-packed, transcriptionally active chromatin is euchromatin; densely-packed, inactive chromatin is heterochromatin.

How did we learn DNA is the genetic material? Griffith (1928) found a 'transforming principle' in Streptococcus pneumoniae (his transformation experiment with R and S strains). Avery, MacLeod and McCarty identified that transforming principle as DNA. Finally, Hershey and Chase (1952) used bacteriophages labelled with radioactive ³²P (DNA) and ³⁵S (protein) and showed that only the ³²P (DNA) entered the bacteria — conclusively proving DNA is the genetic material. (RNA was likely the first genetic material in evolution, but DNA, being double-stranded and chemically more stable, took over.) For NEET, fix the helix details, base-pairing/H-bonds, Chargaff's rule, the nucleosome, and the three classic experiments.

Figure — DNA Structure, Packaging and the Search for the Genetic Material

Feature	Detail
Base pairs	A=T (2 H-bonds), G≡C (3 H-bonds)
Helix	antiparallel; 10 bp/turn; pitch 3.4 nm
Nucleosome	~200 bp + histone octamer
Euchromatin / heterochromatin	loose, active / dense, inactive
Hershey–Chase	³²P (DNA) enters bacteria → DNA is genetic material

Worked Examples

If a DNA segment has 22% adenine, what percentage of guanine does it contain?

Show solution

By Chargaff's rule, A = T, so T = 22%, and A + T = 44%. The rest is G + C = 56%, and G = C, so guanine = 28%.

How did Hershey and Chase show that DNA, not protein, is the genetic material?

Show solution

They labelled phage DNA with ³²P and phage protein with ³⁵S. After infection, only the ³²P (DNA) was found inside the bacteria, while the ³⁵S (protein coat) stayed outside — proving DNA is the genetic material.

✎ Self-Check — 5 questions0 / 5

Q1.

Adenine pairs with thymine by how many hydrogen bonds?

Explanation: A=T has two hydrogen bonds; G≡C has three.

Q2.

According to Chargaff's rule:

Explanation: Amounts of A=T and G=C.

Q3.

The basic packaging unit of DNA with histones is the:

Explanation: ~200 bp DNA + histone octamer = nucleosome.

Q4.

The Hershey–Chase experiment used:

Explanation: Bacteriophages labelled with ³²P and ³⁵S.

Q5.

Transcriptionally active, loosely-packed chromatin is:

Explanation: Euchromatin is loose and active; heterochromatin is dense and inactive.

NEET tip: DNA: antiparallel helix, A=T (2 H-bonds), G≡C (3 H-bonds), Chargaff A=T & G=C, 10 bp/turn, pitch 3.4 nm. Nucleosome = ~200 bp + histone octamer; euchromatin (active) vs heterochromatin (inactive). Genetic material: Griffith (transformation) → Avery et al. (DNA) → Hershey–Chase (³²P proves DNA).

DNA ReplicationTopic 2

For genetic information to pass to daughter cells, DNA must be copied (replicated) before cell division. The Watson–Crick model itself suggested how: because the two strands are complementary, each can serve as a template to make the other. This gives semiconservative replication — each new DNA molecule keeps one old (parental) strand and one newly synthesised strand.

This was proved by the classic experiment of Meselson and Stahl (1958) on Escherichia coli. They grew bacteria in medium with heavy nitrogen (¹⁵N) so the DNA was heavy, then shifted them to normal nitrogen (¹⁴N). After one generation the DNA was of intermediate (hybrid) density (one heavy + one light strand), and after two generations there were equal amounts of hybrid and light DNA — exactly the pattern predicted by semiconservative replication.

Mechanistically, replication begins at a specific site, the origin of replication (ori). The enzyme helicase unwinds the helix to form a replication fork, and the main enzyme DNA polymerase adds nucleotides to build the new strands. A crucial constraint is that DNA polymerase can only synthesise in the 5'→3' direction.

Because the two template strands are antiparallel, this directionality means one new strand — the leading strand — is made continuously toward the fork, while the other — the lagging strand — is made discontinuously in short pieces called Okazaki fragments, which are later joined by DNA ligase. Replication is highly accurate, with proofreading. For NEET, fix the meaning of semiconservative replication, the Meselson–Stahl result (hybrid after one generation), and the mechanism: origin, DNA polymerase building 5'→3', leading (continuous) vs lagging (Okazaki fragments + ligase) strands.

Figure — DNA Replication

Item	Detail
Mode	semiconservative (1 old + 1 new strand)
Proof	Meselson & Stahl (¹⁵N/¹⁴N in E. coli)
Main enzyme	DNA polymerase (synthesises 5'→3')
Leading strand	continuous toward the fork
Lagging strand	discontinuous — Okazaki fragments + ligase

Worked Examples

In the Meselson–Stahl experiment, what density of DNA is found after one generation in ¹⁴N, and why?

Show solution

After one generation the DNA is of intermediate (hybrid) density. Each daughter molecule has one heavy (¹⁵N) parental strand and one light (¹⁴N) new strand — exactly what semiconservative replication predicts.

Why is one strand made continuously and the other in fragments during replication?

Show solution

DNA polymerase works only in the 5'→3' direction. On the strand oriented toward the fork the new strand can be made continuously (leading strand); on the antiparallel strand it must be made in short Okazaki fragments (lagging strand), later joined by DNA ligase.

✎ Self-Check — 5 questions0 / 5

Q1.

DNA replication is:

Explanation: Each daughter molecule has one old and one new strand.

Q2.

Semiconservative replication was proved by:

Explanation: Meselson and Stahl used ¹⁵N/¹⁴N in E. coli.

Q3.

DNA polymerase synthesises DNA in the direction:

Explanation: Synthesis is strictly 5'→3'.

Q4.

Short fragments made on the lagging strand are:

Explanation: Okazaki fragments are joined later by ligase.

Q5.

The enzyme that joins Okazaki fragments is:

Explanation: DNA ligase seals the fragments.

NEET tip: Replication = semiconservative (Meselson & Stahl, ¹⁵N/¹⁴N → hybrid after 1 generation). DNA polymerase builds 5'→3'; leading strand continuous, lagging strand = Okazaki fragments joined by DNA ligase; starts at the origin (ori).

Module 2

The Central Dogma, Gene Regulation and Genomics

Transcription, the Genetic Code and TranslationTopic 3

The flow of genetic information is summarised by the central dogma (proposed by Crick): DNA → RNA → protein, i.e. DNA is transcribed into RNA, which is translated into protein. Transcription is the copying of one strand of DNA into RNA by the enzyme RNA polymerase. Only one strand, the template strand (read 3'→5'), is copied; the other is the coding strand. A transcription unit has a promoter (where RNA polymerase binds), the structural gene and a terminator.

In eukaryotes the first transcript, hnRNA, contains coding exons and non-coding introns (split genes). It is processed: the introns are removed and exons joined by splicing, a cap is added at the 5' end and a poly-A tail at the 3' end, producing mature mRNA.

The information in mRNA is read using the genetic code, whose features are heavily tested. The code is read in triplets called codons (three bases = one amino acid), giving 64 codons. It is degenerate (most amino acids have more than one codon), unambiguous (each codon specifies only one amino acid), nearly universal (same in almost all organisms), non-overlapping and comma-less. AUG is the start (initiation) codon (and codes for methionine), while UAA, UAG and UGA are the three stop (termination) codons.

The codons are decoded during translation, which occurs on the ribosome. The key molecule is tRNA (transfer RNA), the 'adaptor molecule': it has an anticodon that base-pairs with the mRNA codon and carries the matching amino acid. Translation proceeds in three steps — initiation (ribosome assembles at AUG), elongation (amino acids are added one by one, joined by peptide bonds) and termination (at a stop codon, releasing the polypeptide). For NEET, fix the central dogma, the template strand and transcription unit, splitting/splicing in eukaryotes, the features of the genetic code (with start/stop codons) and the adaptor role of tRNA.

Figure — Transcription, the Genetic Code and Translation

Item	Detail
Central dogma	DNA → RNA → protein (Crick)
Transcription unit	promoter + structural gene + terminator
Splicing	introns removed, exons joined (eukaryotes)
Genetic code	triplet, 64 codons, degenerate, universal
Start / stop	AUG / UAA, UAG, UGA; tRNA = adaptor

Worked Examples

What is meant by saying the genetic code is 'degenerate', and name the start and stop codons.

Show solution

Degenerate means most amino acids are coded by more than one codon (e.g. several codons for leucine). The start codon is AUG (codes for methionine); the stop codons are UAA, UAG and UGA.

Why is tRNA called the adaptor molecule?

Show solution

tRNA links the language of nucleic acids to that of proteins: one end has an anticodon that base-pairs with the mRNA codon, and the other end carries the specific amino acid for that codon. So it 'adapts' the codon to the correct amino acid during translation.

✎ Self-Check — 5 questions0 / 5

Q1.

The central dogma is:

Explanation: DNA → RNA → protein (Crick).

Q2.

The start codon is:

Explanation: AUG is the start codon (methionine).

Q3.

The number of codons in the genetic code is:

Explanation: There are 64 codons (4³).

Q4.

Removal of introns and joining of exons is:

Explanation: Splicing removes introns and joins exons.

Q5.

The adaptor molecule in translation is:

Explanation: tRNA carries an anticodon and an amino acid.

NEET tip: Central dogma: DNA→RNA→protein. Transcription: RNA polymerase on template strand (3'→5'); unit = promoter + gene + terminator; eukaryotic hnRNA spliced (introns out, exons joined) + cap + tail. Genetic code: triplet, 64 codons, degenerate, unambiguous, universal; AUG start, UAA/UAG/UGA stop. tRNA = adaptor (anticodon + amino acid).

Regulation of Gene Expression and the Human Genome ProjectTopic 4

Cells do not make every protein all the time — gene expression is regulated so that genes are switched on only when their products are needed. The classic model is the lac operon of E. coli, described by Jacob and Monod. An operon is a unit of bacterial genes controlled together. The lac operon has a regulatory gene (i) that makes a repressor, a promoter (p), an operator (o), and three structural genes — z (β-galactosidase), y (permease) and a (transacetylase) — whose enzymes metabolise lactose.

When lactose is absent, the repressor binds the operator and blocks transcription — the genes are off. When lactose is present, it acts as an inducer: it binds the repressor, inactivating it, so the repressor leaves the operator and RNA polymerase can transcribe the genes, making the lactose-digesting enzymes. Because lactose switches the operon on by removing the repressor's block, this is an example of negative regulation by induction.

On a much larger scale, the entire genetic make-up of humans was decoded by the Human Genome Project (HGP), a global effort that ran from 1990 to 2003. It sequenced the roughly 3 billion (3 × 10⁹) base pairs of the human genome and found that humans have only about 20,000–25,000 genes — far fewer than expected. Among its findings: less than 2% of the genome codes for proteins, and there are huge stretches of repetitive sequences.

One powerful application of knowing the variable parts of the genome is DNA fingerprinting, developed by Alec Jeffreys. It compares highly variable repetitive sequences called VNTRs (variable number of tandem repeats), which differ from person to person; since these patterns are unique to each individual (except identical twins), DNA fingerprinting is used in forensic identification, paternity testing and pedigree analysis. For NEET, fix the lac operon parts and its induction by lactose (negative regulation), the HGP key facts (1990–2003, ~3 billion bp, ~20,000–25,000 genes) and DNA fingerprinting using VNTRs.

Figure — Regulation of Gene Expression and the Human Genome Project

Item	Detail
lac operon (Jacob & Monod)	i (repressor), p, o, genes z, y, a
Inducer	lactose → inactivates repressor → genes ON
β-galactosidase	product of the z gene
HGP	1990–2003; ~3 billion bp; ~20,000–25,000 genes
DNA fingerprinting	VNTRs (Alec Jeffreys); forensics, paternity

Worked Examples

What happens to the lac operon when lactose is added to the medium?

Show solution

Lactose acts as an inducer: it binds the repressor and inactivates it, so the repressor leaves the operator. RNA polymerase can now transcribe the structural genes (z, y, a), producing the enzymes (β-galactosidase, etc.) to use lactose. So the operon is switched ON.

State two key findings of the Human Genome Project and one use of DNA fingerprinting.

Show solution

HGP findings (any two): the human genome has about 3 billion base pairs; it contains only about 20,000–25,000 genes; less than 2% codes for proteins. DNA fingerprinting (using VNTRs) is used in forensic identification / paternity testing.

✎ Self-Check — 5 questions0 / 5

Q1.

The lac operon was explained by:

Explanation: Jacob and Monod proposed the lac operon model.

Q2.

In the lac operon, the inducer is:

Explanation: Lactose induces the operon by inactivating the repressor.

Q3.

The z gene of the lac operon codes for:

Explanation: The z gene encodes β-galactosidase.

Q4.

The human genome contains approximately how many base pairs?

Explanation: About 3 × 10⁹ (3 billion) base pairs.

Q5.

DNA fingerprinting is based on differences in:

Explanation: It compares VNTRs that vary between individuals.

NEET tip: lac operon (Jacob & Monod): i→repressor, p, o, genes z (β-galactosidase), y (permease), a. Lactose = inducer → inactivates repressor → operon ON (negative regulation). HGP: 1990–2003, ~3 billion bp, ~20,000–25,000 genes. DNA fingerprinting = VNTRs (Alec Jeffreys) for forensics/paternity.

Quick Revision — Molecular Basis of Inheritance

DNA: antiparallel double helix (Watson & Crick); A=T (2 H-bonds), G≡C (3 H-bonds); Chargaff: A=T, G=C; 10 bp/turn, pitch 3.4 nm.
Packaging: negatively-charged DNA wraps positively-charged histones → nucleosome (~200 bp + histone octamer) → chromatin (euchromatin active, heterochromatin inactive).
Genetic material: Griffith (transformation), Avery–MacLeod–McCarty (DNA), Hershey–Chase (phage → DNA). DNA is more stable than RNA.
Replication is semiconservative (Meselson & Stahl); DNA polymerase synthesises 5'→3'; leading + lagging (Okazaki) strands.
Central dogma: DNA → (transcription) RNA → (translation) protein.
Genetic code: triplet, 64 codons, degenerate, universal, unambiguous; start AUG, stop UAA/UAG/UGA; tRNA = adaptor.
lac operon (Jacob & Monod): lactose = inducer. HGP: ~3 billion bp; DNA fingerprinting uses VNTRs.

Frequently Asked Questions

What is the Watson–Crick model of DNA?

DNA is a double helix of two antiparallel polynucleotide strands (one 5'→3', the other 3'→5') wound around a common axis, with sugar-phosphate backbones outside and bases pairing inside. Base pairing is specific and complementary: adenine pairs with thymine (two hydrogen bonds) and guanine pairs with cytosine (three hydrogen bonds). The helix is right-handed, with about 10 base pairs per turn and a pitch of 3.4 nm.

How is the long DNA molecule packaged inside a cell?

The negatively charged DNA wraps around a core of positively charged proteins called histones. About 200 base pairs of DNA wound around a histone octamer form a nucleosome, the basic repeating unit, giving a 'beads-on-a-string' appearance. Nucleosomes coil further into chromatin fibres and ultimately into chromosomes. Loosely packed, transcriptionally active chromatin is euchromatin; densely packed, inactive chromatin is heterochromatin.

What did the Hershey–Chase experiment prove?

Hershey and Chase used bacteriophages whose DNA was labelled with radioactive phosphorus (32P) and whose protein coat was labelled with radioactive sulphur (35S). When the phages infected bacteria, only the 32P (DNA) entered the bacterial cells, not the 35S (protein). This proved that DNA, not protein, is the genetic material that is passed on to the next generation.

Why is DNA replication called semiconservative?

Because each new DNA molecule keeps one old (parental) strand and one newly made strand. The two parental strands separate, and each acts as a template for a new complementary strand, so every daughter molecule is half old and half new. This was proved by Meselson and Stahl using heavy (15N) and normal (14N) nitrogen isotopes in E. coli.

What are the main features of the genetic code?

The genetic code is read in triplets (codons) — three bases code for one amino acid, giving 64 codons. It is degenerate (some amino acids are coded by more than one codon), unambiguous (each codon codes for only one amino acid), nearly universal (the same in almost all organisms), non-overlapping and comma-less. AUG is the start codon (and codes for methionine), and UAA, UAG and UGA are stop codons.

Ready to test yourself?

Attempt the full timed mock tests — Main & Advanced level.

Start Mock Test 1 →