Question 1

Why does halving the margin of error roughly quadruple the required sample size?

Accepted Answer

Because the standard error of a proportion scales with 1/√n, the margin of error you can achieve at any given confidence level also scales with 1/√n. To halve the margin you must quadruple n — that is the unavoidable arithmetic of the inverse square-root relationship. This is one of the most important practical facts in survey design: precision is expensive, and small precision gains at the tight end of the spectrum cost dramatically more than equivalent gains at the loose end. Moving from a ±10% to a ±5% margin requires 4× the sample; moving from ±5% to ±2.5% requires another 4×, and so on. Always start by asking what precision your downstream decision actually requires, then design backwards — over-precise surveys are a common form of overspending.

Question 2

Why is sample size almost independent of population size for large populations?

Accepted Answer

The finite-population correction term in the denominator (z²·0.25 / N) becomes negligible when N is much larger than n, so the formula collapses to n ≈ z²·0.25 / (e/100)² — a function only of confidence level and margin, not of N. Intuitively, the precision of an estimate depends on the absolute number of independent observations, not on what fraction of the population they represent. This is why national polls of the US (330 M people) get reliable results from around 1000–2000 respondents, the same order of magnitude that would be needed to poll a city of 100 K. The correction matters only when n approaches a meaningful fraction of N (roughly >5%); for small populations like a single company's employees or a single school's students, the corrected formula saves a noticeable number of respondents.

Question 3

When should I use a different p value instead of the 0.5 worst case?

Accepted Answer

Replace 0.25 (= 0.5 × 0.5) with p·(1 − p) for a known or expected proportion p whenever you have prior evidence that the true p is far from 0.5. For example, if you are estimating a defect rate believed to be around 5%, p·(1 − p) = 0.05 × 0.95 = 0.0475 — about a fifth of the worst-case value — and required sample size drops by the same factor. The trade-off is that if your prior estimate is wrong and p turns out to be closer to 0.5, your achieved margin will be wider than planned. The 0.5 worst-case assumption is the conservative, defensive choice and is what this calculator uses. Custom-p formulas are common in industrial sampling (defect rates), medical screening (rare-disease prevalence), and market research where strong prior data exists.

Question 4

What are the most common mistakes people make in sample-size planning?

Accepted Answer

The first is conflating margin of error with effect size — sample-size formulas for proportions answer "how precisely can I estimate p?", not "what sample do I need to detect a difference of size Δ?" The latter is a power-analysis question and uses a different formula. The second is forgetting non-response: if you expect 30% non-response, you need to send the survey to 1/(1 − 0.30) ≈ 1.43× as many people as the formula's n. The third is assuming simple random sampling when the actual design uses clusters or stratification; clustered designs typically need 1.5–3× as many respondents to hit the same precision (the design-effect multiplier). The fourth is treating "95% confidence" as some sort of magic threshold — it is a convention, not a law, and other levels are equally valid if your decision context calls for them. Finally, people often forget that this calculator answers a precision question, not a statistical-power question; for hypothesis testing, use a power-analysis tool.

Question 5

When should I not use this calculator?

Accepted Answer

Skip it when you are planning a study to detect a specific effect size (a treatment difference, an A/B-test lift, a correlation) — that requires a power-analysis calculator (effect size, α, power), not a margin-of-error formula. Do not use it for surveys with multiple outcome variables of vastly different prevalences; design for the rarest outcome you care about. Avoid it for stratified or clustered sample designs without applying a design effect; the unmodified n understates the requirement and your real margin will be wider than planned. It is the wrong tool for estimating means (not proportions) of continuous variables — use a mean-based SE = σ/√n formula instead. Finally, do not use it for very small populations where you can simply survey everyone (a census); the formula will recommend a sample close to N and you may as well take the census and have zero sampling error.

Sample Size Calculator

Compare with similar

About this calculator

How to use

Frequently asked questions

Why does halving the margin of error roughly quadruple the required sample size?

Why is sample size almost independent of population size for large populations?

When should I use a different p value instead of the 0.5 worst case?

What are the most common mistakes people make in sample-size planning?

When should I not use this calculator?

Sources & references