Chapter 2 Sample Size Determination
Chapter Objectives
The objective of this chapter is to provide a practical, defensible, and regulator-ready framework for sample size determination in clinical trials.
Upon completing this chapter, the reader should be able to:
- Understand sample size as a design decision rather than a mechanical calculation
- Justify assumptions used in sample size determination
- Evaluate effect size sources critically
- Account for dropout and multiplicity in planning
- Produce clear, auditable outputs for Protocols and SAPs
2.1 The Role of Sample Size in Clinical Trial Design
2.1.1 Sample Size Is Designed, Not Calculated
In practice, sample size is not a purely mathematical result.
It is the consequence of multiple design assumptions:
Sample Size = Statistical Hypothesis × Effect Size × Variability × Operational Assumptions
The role of the statistician is not to apply formulas blindly, but to ensure that:
- The study question is answerable with the proposed sample size
- Assumptions are scientifically reasonable
- The design can withstand regulatory scrutiny
2.1.2 Two Dangerous Extremes
Statisticians should actively avoid:
- Over-optimism: assuming an unrealistically large effect size, leading to underpowered studies
- Over-conservatism: inflating sample size beyond operational feasibility
The true value of statistical leadership lies in balancing scientific rigor, feasibility, and risk.
2.2 Sample Size Calculation Framework
2.2.1 Core Components
Regardless of endpoint type, all sample size calculations must clearly define:
| Component | Description |
|---|---|
| Primary endpoint | Sample size must be based on the primary endpoint only |
| Statistical hypothesis | Null and alternative hypotheses |
| Significance level | Type I error rate (α) |
| Power | Probability of detecting the assumed effect |
| Effect size | Mean difference, rate difference, or hazard ratio |
| Variability | Standard deviation or event rate assumptions |
2.2.2 Endpoint-Specific Considerations
2.2.2.1 Continuous Endpoints
- Based on mean differences
- Requires standard deviation assumptions
- Highly sensitive to variance misspecification
2.2.2.2 Binary Endpoints
- Based on response rates or risk differences
- Requires control group event rate assumptions
- Low event rates rapidly increase sample size
2.3 Effect Size Justification
2.3.1 Why Effect Size Is the Highest-Risk Assumption
Effect size assumptions directly determine:
- Required sample size
- Probability of trial success
- Interpretability of results
An unrealistic effect size almost guarantees trial failure.
2.4 Dropout and Attrition Assumptions
2.4.1 Why Dropout Matters
Sample size calculations yield the number of evaluable subjects required for primary analysis, not the number to be randomized.
Dropout assumptions bridge this gap.
2.5 Impact of Multiplicity on Sample Size
2.5.1 Multiplicity Is Not Only an Analysis Issue
Sample size may be affected when there are:
- Multiple primary endpoints
- Multiple dose comparisons
- Multiple formal hypotheses