Which Statistical Test Do I Use?
This is an interactive clickable wizard with immediate feedback, a live decision tree, and MFM-specific examples.
Interactive Test Selector
Answer the prompts. The recommended test updates instantly.
1. What type of outcome are you analyzing?
2. Are the groups independent or paired?
3. How many groups or levels are being compared?
4. Is the continuous data approximately normal, or is sample size small?
5. Do you need to adjust for confounders or model predictors?
Start the wizard
Choose the outcome type first. The tool will narrow to the best statistical test and explain why.
When to use it
Common
Decision Tree
The highlighted path shows how the tool is reasoning.
Side-by-Side Examples
Independent categorical
Paired categorical
Continuous, 2 groups
Continuous, skewed
Quick Reference
Categorical outcomes
Continuous outcomes
Nonparametric + models
Diagnostic Test Interpretation Mini-Module
1) Start with the 2×2 table
2) Rule-out vs rule-in
3) Predictive values depend on prevalence
4) Likelihood ratios are very high yield
5) Quick LR interpretation
6) ROC curve and AUC
7) Example: screening test
8) Example: positive screen in low prevalence setting
Interactive Ultimate Test Selection Flowchart
Click through the decision tree below. It highlights the path and can auto-fill the wizard above.
Free Review Links
Focused external review pages for the tests most commonly used in MFM research and board questions.
Choosing the right test
General free textbook
Choosing the correct statistical test
Likelihood ratios
Diagnostic test characteristics
Quiz Mode
Use these vignette-style questions after the wizard to lock in the pattern recognition.
How it works
Reference Page: Side-by-Side Calculations
More detailed worked examples for the most commonly tested biostatistics methods in MFM board-style questions.
Chi-square
| Preeclampsia | No Preeclampsia | Total | |
|---|---|---|---|
| Aspirin | 10 | 90 | 100 |
| Placebo | 20 | 80 | 100 |
| Total | 30 | 170 | 200 |
Aspirin + preeclampsia = (100 × 30) / 200 = 15
Aspirin + no preeclampsia = (100 × 170) / 200 = 85
Placebo + preeclampsia = 15
Placebo + no preeclampsia = 85
χ² = (10−15)²/15 + (90−85)²/85 + (20−15)²/15 + (80−85)²/85
= 25/15 + 25/85 + 25/15 + 25/85
≈ 1.67 + 0.29 + 1.67 + 0.29 = 3.92
Fisher exact
| Rare Outcome | No Rare Outcome | Total | |
|---|---|---|---|
| Exposure | 1 | 9 | 10 |
| No Exposure | 8 | 2 | 10 |
| Total | 9 | 11 | 20 |
Because an expected count is < 5, the Chi-square approximation is unreliable.
McNemar
| After Positive | After Negative | Total | |
|---|---|---|---|
| Before Positive | 30 | 20 | 50 |
| Before Negative | 10 | 40 | 50 |
| Total | 40 | 60 | 100 |
The concordant cells (30 and 40) do not drive the test.
= (20−10)² / (20+10)
= 100 / 30 ≈ 3.33
With continuity correction: χ² = (|b−c|−1)² / (b+c) = 9²/30 = 2.7
t-test
| Group | n | Mean | SD |
|---|---|---|---|
| Diabetic | 50 | 3200 | 400 |
| Non-diabetic | 50 | 3000 | 350 |
Here the question is whether the 200 g difference is large relative to the variability within groups.
Paired t-test
| Patient | Before | After | Difference |
|---|---|---|---|
| 1 | 9.0 | 10.1 | +1.1 |
| 2 | 9.4 | 10.2 | +0.8 |
| 3 | 8.9 | 10.3 | +1.4 |
| 4 | 9.5 | 10.1 | +0.6 |
| 5 | 9.1 | 10.1 | +1.0 |
ANOVA
| Group | n | Mean birthweight |
|---|---|---|
| BMI I | 40 | 3000 g |
| BMI II | 40 | 3150 g |
| BMI III | 40 | 3350 g |
Conceptually: F = variance between groups / variance within groups
Mann-Whitney U
| Group A | Group B |
|---|---|
| 4 | 3 |
| 6 | 5 |
| 8 | 7 |
| 30 | 9 |
Group B ranks = 1, 3, 5, 7 → rank sum = 16
The test asks whether one group tends to have systematically higher ranks than the other.
Wilcoxon signed-rank
| Patient | Difference | Absolute difference | Rank |
|---|---|---|---|
| 1 | +2 | 2 | 3.5 |
| 2 | +1 | 1 | 1.5 |
| 3 | −1 | 1 | 1.5 |
| 4 | +3 | 3 | 5 |
| 5 | +2 | 2 | 3.5 |
Negative signed ranks = 1.5
Kruskal-Wallis
| Group A | Group B | Group C |
|---|---|---|
| 1 | 2 | 4 |
| 2 | 3 | 5 |
| 10 | 12 | 15 |
This is the nonparametric analogue of asking whether the group means differ in ANOVA.
Logistic regression
| Predictor | β coefficient | Approximate interpretation |
|---|---|---|
| BMI | 0.08 | Higher BMI raises log odds |
| Age | 0.03 | Older age slightly raises log odds |
| Prior PE | 1.10 | Much higher odds |
Interpretation: for each 1-unit increase in BMI, the odds of preeclampsia increase by about 8%, holding the other covariates constant.
Linear regression
| Predictor | β coefficient | Interpretation |
|---|---|---|
| Gestational age (weeks) | 180 | +180 g per week |
| Smoking | −120 | 120 g lower if smoker |
| Male fetus | +90 | 90 g higher |
Cox proportional hazards
| Group | Median time to delivery | Hazard ratio |
|---|---|---|
| Abnormal UA Doppler | 10 days | 1.8 |
| Normal UA Doppler | 18 days | Reference |