Question 1

How long should I run an A/B test?

Accepted Answer

Until you reach your required sample size — not a day sooner. Use the Sample Size Planner to calculate the number, then divide by your daily traffic to estimate how many days you need. A common rule of thumb is at least two full business cycles (typically two weeks) to account for day-of-week effects, even if you hit your sample size earlier.

Question 2

What's a good minimum detectable effect?

Accepted Answer

It depends on your traffic and how much lift would actually matter to your business. A 20% relative MDE is a common starting point — if your conversion rate is 5%, you're looking to detect a shift to at least 6%. Smaller effects require exponentially more traffic. If you'd need 500,000 visitors to detect a 2% lift and you only get 10,000 a month, that test isn't worth running.

Question 3

Can I stop a test early if one variant is winning?

Accepted Answer

No. Early stopping massively inflates your false positive rate. If you check your test every day and stop the first time you see significance, your actual false positive rate can be 20-30% instead of the 5% you planned for. Decide your sample size upfront and commit to it. If you need the ability to stop early, look into sequential testing methods, which use adjusted significance thresholds.

Question 4

What does "80% power" mean?

Accepted Answer

Power is your test's ability to detect a real effect when one exists. At 80% power, if Variation B truly converts better than Control A by at least your MDE, you have an 80% chance of detecting that difference. The remaining 20% is the false negative rate — the chance you'll miss a real winner and call the test inconclusive. Higher power (90%) requires more traffic but reduces the risk of missing real improvements.

Question 5

Why do I need so many visitors for small effects?

Accepted Answer

Small effects are harder to distinguish from random variation. If your conversion rate is 5% and you're trying to detect a 2% relative lift (5.0% to 5.1%), the signal is tiny compared to the noise in conversion data. You need a large enough sample for the math to confidently say that 0.1 percentage point difference isn't just luck. The sample size grows roughly with the square of the effect size — halve the effect you want to detect, and you need roughly four times the traffic.

A/B Test Calculator

Related Tools

Marketing ROI Calculator

Keyword Density Checker

UTM Link Builder

Subject Line Analyzer

How to Calculate A/B Test Sample Size

What Is Statistical Significance?

Understanding P-Values and Confidence Levels

Common A/B Testing Mistakes