Question 1

What are the assumptions of Hardy-Weinberg equilibrium?

Accepted Answer

Five strict assumptions: (1) large population size — no genetic drift (random sampling effects that change allele frequencies in small populations); (2) random mating — no assortative mating, no inbreeding; (3) no mutation — allele frequencies don't change through new mutations or back-mutations; (4) no migration — no gene flow into or out of the population; (5) no natural selection — all genotypes have equal fitness. Real populations always violate at least some of these assumptions to some degree; Hardy-Weinberg is a null hypothesis, not a description of any actual population. Deviations from expected H-W genotype frequencies (tested with chi-square) indicate which assumption is violated and by how much. The principle is most useful as a baseline expectation against which to detect evolutionary forces or to estimate allele frequencies from genotype data when the assumptions approximately hold.

Question 2

How do I calculate carrier frequency for a recessive genetic disease?

Accepted Answer

If the disease occurs in 1 out of N newborns, then q² = 1/N (assuming H-W holds for the population). Take the square root to get q = √(1/N). Then p = 1 − q (for a two-allele system). The carrier frequency is 2pq, which for rare diseases is approximately 2q (since p ≈ 1). Example: cystic fibrosis at 1/2500 → q² = 0.0004 → q = 0.02 → carriers 2 · 0.98 · 0.02 ≈ 0.039 ≈ 1 in 25 people. Tay-Sachs at 1/3500 in Ashkenazi Jews → q ≈ 0.017 → carriers ≈ 3.3% or 1 in 30. Phenylketonuria at 1/10000 → q ≈ 0.01 → carriers ≈ 2% or 1 in 50. The pattern: carrier frequencies are dramatically higher than disease frequencies, which is why pre-conception or prenatal carrier screening is so cost-effective for population health.

Question 3

How do I test whether a population is in Hardy-Weinberg equilibrium?

Accepted Answer

Compare observed genotype counts to expected counts using a chi-square goodness-of-fit test. Step 1: calculate p and q from observed allele frequencies (count alleles, not genotypes). Step 2: compute expected genotype counts as p²·N (AA), 2pq·N (Aa), q²·N (aa) where N = total individuals. Step 3: χ² = Σ (observed − expected)² / expected, summed over the three genotypes. Step 4: compare against the chi-square distribution with degrees of freedom = (number of genotypes − number of alleles) = 3 − 2 = 1 for a biallelic locus. A p-value below 0.05 suggests the population deviates significantly from H-W expectations. Common causes: assortative mating, inbreeding (excess homozygotes), heterozygote advantage (excess heterozygotes), recent migration, sampling bias.

Question 4

What are the most common mistakes people make with Hardy-Weinberg?

Accepted Answer

The first is forgetting that p + q must equal 1 for a two-allele locus; entering p = 0.6, q = 0.6 produces nonsense because the alleles can't each be 60% of the population. The second is confusing allele frequency (p, q) with genotype frequency (p², 2pq, q²); 30% of people having the disease genotype (q² = 0.3) is very different from 30% allele frequency (q = 0.3, q² = 0.09). The third is applying H-W expectations to small populations (less than ~1000 individuals) where genetic drift can produce substantial deviation even without selection. The fourth is assuming H-W applies across populations; combining genotype counts from two genetically distinct populations always produces an apparent excess of homozygotes (the Wahlund effect) even if each subpopulation is in H-W. The fifth is using H-W for sex-linked loci without adjustment; X-linked traits have different expected frequencies in males (hemizygous) vs females (homozygous or heterozygous).

Question 5

When should I not use this calculator?

Accepted Answer

Skip it for loci with more than two alleles (most actual genetic loci have multiple variants); use the multinomial expansion (p² + q² + r² + 2pq + 2pr + 2qr = 1 for three alleles) or specialised population-genetics software. Don't use it for X-linked or Y-linked traits without adjusting for hemizygous males; the expected frequencies differ between sexes for sex-linked loci. Avoid it for populations known to violate H-W assumptions strongly (small isolated populations, populations with strong recent migration or selection, founder populations); the equilibrium frequencies don't apply meaningfully. It's the wrong tool for analysing multi-locus haplotypes, linkage disequilibrium, or polygenic traits where multiple loci interact. Finally, don't use it to "prove" no selection is occurring; the H-W test has limited power to detect mild selection over a few generations, and absence of evidence isn't evidence of absence.

Hardy-Weinberg Equilibrium Calculator

Compare with similar

About this calculator

How to use

Frequently asked questions

What are the assumptions of Hardy-Weinberg equilibrium?

How do I calculate carrier frequency for a recessive genetic disease?

How do I test whether a population is in Hardy-Weinberg equilibrium?

What are the most common mistakes people make with Hardy-Weinberg?

When should I not use this calculator?

Sources & references