Question 1

What is sample size calculation?

Accepted Answer

Sample size calculation determines the minimum number of participants or observations needed to detect a meaningful effect with a specified level of confidence. It balances four quantities: significance level ($\alpha$), statistical power (1–β), effect size, and sample size. Under-powered studies waste resources and may fail to detect real effects, while over-powered studies enroll more participants than necessary.

Question 2

What is statistical power?

Accepted Answer

Statistical power is the probability of correctly rejecting the null hypothesis when a true effect exists. A power of 0.80 means there is an 80% chance of detecting a real effect. Power depends on the significance level, sample size, and the magnitude of the true effect. Journals and funding agencies typically require power of at least 0.80, though 0.90 is preferred for confirmatory studies.

Question 3

What is effect size?

Accepted Answer

Effect size quantifies the magnitude of a difference or relationship, independent of sample size. For comparing two means, Cohen’s d expresses the difference in standard deviation units. For comparing proportions, the raw difference between proportions is used. Effect size is the single most important input to a power analysis — overestimating it leads to under-powered studies.

Question 4

What are Cohen’s d conventions?

Accepted Answer

Jacob Cohen (1988) proposed benchmarks for interpreting effect sizes in behavioral science: d = 0.2 is a small effect, d = 0.5 is medium, and d = 0.8 is large. These are rough guidelines — the clinically meaningful effect size in your domain should always take precedence. A “small” effect can be very important in high-stakes contexts like drug efficacy.

Question 5

When should I use which test?

Accepted Answer

Use a two-sample t-test to compare means of two independent groups (e.g., treatment vs. control). Use a paired t-test when the same subjects are measured twice (e.g., pre/post intervention). Use ANOVA when comparing means across three or more groups. Use a proportions (chi-square) test when comparing success/failure rates between groups. The choice of test must be determined before data collection.

Question 6

What is the difference between alpha and power?

Accepted Answer

Alpha ($\alpha$) is the probability of a Type I error — concluding there is an effect when there is none (false positive). Power (1–β) is the probability of correctly detecting a true effect (true positive). Alpha is conventionally set at 0.05. Decreasing alpha (e.g., to 0.01) reduces false positives but requires a larger sample size to maintain the same power. These two error rates are in tension: reducing one requires increasing sample size to avoid inflating the other.

Question 7

What effect sizes should I use for behavioral neuroscience paradigms?

Accepted Answer

Published effect sizes vary by paradigm. Common estimates from the literature: Morris water maze escape latency d $\approx$ 0.8–1.2 (Vorhees & Williams 2006), NOR discrimination index d $\approx$ 0.6–1.0, EPM % open arm time d $\approx$ 0.5–0.8, FST % immobility d $\approx$ 0.7–1.0, fear conditioning % freezing d $\approx$ 1.0–1.5, von Frey threshold d $\approx$ 0.8–1.2, and rotarod latency d $\approx$ 0.5–0.8. Use the Behavioral Paradigms preset dropdown to auto-fill these values. Always prefer pilot data or meta-analysis estimates when available.

Sample Size & Power Calculator

Power Analysis Fundamentals

Choosing Effect Sizes

Frequently Asked Questions

Next Steps

Behavioral Mazes

Research Equipment

Automate with ConductVision

Need Help?