Question 1

What does statistical significance mean in an A/B test?

Accepted Answer

Statistical significance means the probability that your observed difference in conversion rates occurred by random chance is below a threshold you set in advance, called the significance level (α). At 95% confidence, α = 0.05, meaning you accept a 5% chance of a false positive (concluding there is a real difference when there isn't). It does not tell you the size of the effect or whether it is practically meaningful — a statistically significant result can still represent a trivially small conversion lift. Always pair significance with effect size and business impact.

Question 2

How many visitors do I need for a valid A/B test?

Accepted Answer

Required sample size depends on three factors: your baseline conversion rate, the minimum detectable effect (MDE) you care about, and your desired confidence level. As a rough rule, detecting a 10% relative lift (e.g., from 5% to 5.5%) at 95% confidence typically requires thousands of visitors per variant. You can use a sample size calculator before starting your test to avoid under-powered experiments, which are the most common cause of misleading A/B test results. Running a test too early and stopping when you see a positive result inflates false-positive rates significantly.

Question 3

Why should I use 99% confidence instead of 95% confidence for some A/B tests?

Accepted Answer

Use 99% confidence when the cost of a false positive is high — for example, when rolling out a change that is expensive to reverse, affects a core revenue flow, or will be seen by all users permanently. The tradeoff is that 99% confidence requires roughly 70% more traffic to achieve the same statistical power as a 95% test. For low-stakes UI tweaks or early-stage exploration, 95% is usually sufficient. Many teams also use 90% confidence for initial screening tests and reserve 99% for final go/no-go decisions.

A/B Test Statistical Significance Calculator

About this calculator

How to use

Frequently asked questions

What does statistical significance mean in an A/B test?

How many visitors do I need for a valid A/B test?

Why should I use 99% confidence instead of 95% confidence for some A/B tests?