Skip to content

A/B Testing Process Cheat Sheet

A one-page summary of the entire A/B testing lifecycle. Use this as a quick reference to ensure every experiment you run is rigorous, insightful, and impactful.

Print Me!

This page is designed to be a single-page reference. Use your browser's print function (Ctrl/Cmd + P) to save it as a PDF or print it out to keep on your desk.

The 4 Phases of High-Impact A/B Testing

Phase 1: Hypothesis & Ideation

GoalKey Steps
Find high-impact ideas.1.Start with Data: Use quantitative analytics (e.g., funnels) to find where a problem is. Use qualitative data (e.g., session recordings) to understand why.2. **Write a Strong Hypothesis:** Use the `Because [data], we will [change], and we predict [outcome] because [assumption]` structure. The "because" clause is your engine for learning.3. Prioritize Rigorously: Use a framework like ICE (Impact, Confidence, Ease) to score and rank ideas objectively. Avoid the "Ease" trap of only doing minor tests.

Phase 2: Design & Setup

GoalKey Steps
Ensure a valid, trustworthy test.1.Define Metrics: Select one Primary Metric to decide the winner. Define 1-3 Guardrail Metrics to prevent unintended harm.2. **Isolate Variables:** In a standard A/B test, change only **one core element** per variant to get a clean learning.3. Calculate Sample Size: Use a calculator to determine the required sample size and test duration before you launch. This is non-negotiable.``4. Run for Full Weeks: Run tests for at least one full weekly cycle (e.g., Tuesday to Tuesday) to account for natural variations in user behavior.

Phase 3: Analysis & Interpretation

GoalKey Steps
Make a data-informed decision.1.If Variant Wins: The change is statistically significant (>95% confidence) and no guardrails were harmed. Decision: Implement the change and use the learning to inform your next hypothesis.2. **If Control Wins ("Loss"):** The change had a negative impact. **Decision:** A success! You prevented a bad change. Document the invalidated assumption and celebrate the learning.3. If Inconclusive: No significant difference. Decision: The change didn't matter. Default to the control (don't ship). Segment the results to generate new, more targeted hypotheses for your next test.

Phase 4: Communication & Learning

GoalKey Steps
Drive action & build knowledge.1.Tell a Story: Frame results as a narrative (Setup, Confrontation, Resolution). Make the "learning" the hero.2. **Tailor the Message:** Speak in terms of **ROI** for executives, **user problems** for engineers, and **user experience** for designers.3. Document Everything: Maintain a central, searchable repository of all tests. This is your company's institutional memory and prevents duplicate work.

Critical Pitfalls to Avoid

PitfallWhy It's DangerousThe Solution
"Peeking" at ResultsStopping a test the moment it looks good. This capitalizes on random noise and is the**#1 cause of false positives**.Trust your pre-calculated sample size. Commit to letting the test run its full, pre-determined course. Discipline is your best defense.
Ignoring SignificanceMaking a decision on a result with low confidence (e.g., 70%). This is just acting on random chance.Adhere to a strict 95% confidence threshold. If it doesn't meet the bar, it's not a real result.
Forgetting GuardrailsA "win" on your primary metric that tanks a key guardrail metric (like user retention) is actually anet loss.Always define guardrail metrics. A true win improves the user experience without causing collateral damage.
Testing Trivial ChangesFocusing on minor tweaks (like shades of a color) that have no chance of producing a detectable impact.Prioritize bold, high-impact hypotheses. Focus on significant changes on high-traffic, high-value pages.
3 min read
Text-to-Speech is not supported in your browser.