Module 4, Part 4: Response Surface Methodology

🎯 Learning Objectives

By the end of this section, you will be able to:

Select appropriate RSM designs (Central Composite vs Box-Behnken) for pharmaceutical optimization
Fit quadratic polynomial models to experimental data using step-by-step reasoning
Create and interpret contour plots and 3D response surfaces for formulation optimization
Apply desirability functions to optimize multiple pharmaceutical responses simultaneously
Validate optimization results through confirmation experiments

🔬 Section 4.1: RSM Design Selection (15 minutes)

Why Move Beyond Factorial Designs?

🤔 Let's Think Step-by-Step: When Do We Need RSM?

Step 1: Understanding the Limitation
Factorial designs (2^k) can only model linear relationships and interactions. But in pharmaceutical formulation, we often see:

📈 Curvature: Drug release may peak at medium polymer concentrations
🎯 Optimum regions: Best tablet hardness occurs within a narrow range
⚖️ Non-linear behavior: Dissolution rate changes exponentially with surface area

Step 2: The RSM Solution
Response Surface Methodology uses second-order polynomial models to capture these curved relationships.

Second-Order RSM Model:
Y = b₀ + b₁X₁ + b₂X₂ + b₁₁X₁² + b₂₂X₂² + b₁₂X₁X₂ + ε

Where the X² terms capture curvature, and X₁X₂ terms capture interactions.

Central Composite Designs (CCD)

🔧 Let's Build a CCD Step-by-Step

Step 1: Start with a 2^k Factorial Core
For 2 factors, we begin with our familiar 2² = 4 runs at the corners.

Step 2: Add Center Points
Multiple runs at (0,0) to estimate pure error and improve central prediction.

Step 3: Add Star Points
Axial points at (±α, 0) and (0, ±α) to estimate curvature.

Total CCD Runs = 2^k + 2k + n_center
For k=2: 4 + 4 + 5 = 13 runs

CCD Design Matrix Example (2 factors)

Run	X₁ (Drug Loading %)	X₂ (Polymer %)	Point Type	Response (% Released)
1	-1	-1	Factorial	[Measured]
2	+1	-1	Factorial	[Measured]
3	-1	+1	Factorial	[Measured]
4	+1	+1	Factorial	[Measured]
5	-1.414	0	Star	[Measured]
6	+1.414	0	Star	[Measured]
7	0	-1.414	Star	[Measured]
8	0	+1.414	Star	[Measured]
9-13	0	0	Center	[Measured]

🎯 Understanding Alpha (α) Values

Let's think about what α controls:

Rotatable Design (α = (2^k)^(1/4)):
For 2 factors: α = (2²)^(1/4) = 4^0.25 = 1.414

✅ Advantage: Prediction variance is constant at all points equidistant from center
⚠️ Consideration: Star points are outside the factorial space

Face-Centered Design (α = 1):

✅ Advantage: All points stay within the factorial cube (safer for extreme conditions)
⚠️ Consideration: Slightly less efficient for curvature estimation

💊 Pharmaceutical Example: Extended-Release Tablet Optimization

Scenario: Optimizing an extended-release tablet formulation

Factors:

X₁: HPMC concentration (2-8%)
X₂: Drug loading (10-30%)

Response: % Drug released at 12 hours

Why use CCD? We suspect there's an optimal HPMC level - too little gives burst release, too much prevents adequate release. The quadratic terms will capture this curvature.

Design choice: Face-centered CCD (α=1) because testing beyond 8% HPMC or 30% drug loading might create tablets that are too hard to manufacture.

Box-Behnken Designs (BBD)

🔄 Let's Understand Box-Behnken Step-by-Step

Step 1: The Spherical Concept
BBD places points on the edges of the design space, forming a sphere-like pattern.

Step 2: Three-Level Structure
Each factor has exactly 3 levels: -1, 0, +1 (low, medium, high).

Step 3: Edge-Center Strategy
Points are placed at the midpoints of edges, never at extreme corners.

BBD Runs for k factors = 2k(k-1) + n_center
For k=3: 2×3×2 + 3 = 15 runs

Box-Behnken Design Matrix (3 factors)

Run	X₁ (Binder %)	X₂ (Disintegrant %)	X₃ (Lubricant %)	Hardness (kp)
1	-1	-1	0	[Measured]
2	+1	-1	0	[Measured]
3	-1	+1	0	[Measured]
4	+1	+1	0	[Measured]
5	-1	0	-1	[Measured]
6	+1	0	-1	[Measured]
7	-1	0	+1	[Measured]
8	+1	0	+1	[Measured]
9	0	-1	-1	[Measured]
10	0	+1	-1	[Measured]
11	0	-1	+1	[Measured]
12	0	+1	+1	[Measured]
13-15	0	0	0	[Measured]

⚖️ CCD vs BBD: How to Choose?

Choose CCD when:

✅ You can safely test beyond the factorial limits
✅ You want maximum flexibility in model fitting
✅ Rotatability is important for your application

Choose BBD when:

✅ Extreme combinations are risky or impossible
✅ You want fewer total runs
✅ Three-level factors are natural for your process

💻 Excel Implementation:
• Use Data Analysis ToolPak → "Regression" for model fitting
• Create design matrices using formulas: =IF(A2="Star", SQRT(2), IF(A2="Factorial", 1, 0))
• Generate center points with =RAND() for randomization

📊 Section 4.2: Response Surface Analysis & Visualization (20 minutes)

Quadratic Model Fitting

🔢 Let's Fit a Quadratic Model Step-by-Step

Step 1: Understand the Model Structure
For 2 factors, our complete model is:

Y = b₀ + b₁X₁ + b₂X₂ + b₁₁X₁² + b₂₂X₂² + b₁₂X₁X₂ + ε

This gives us 6 coefficients to estimate:

b₀: Intercept (response at center point)
b₁, b₂: Linear effects of X₁ and X₂
b₁₁, b₂₂: Quadratic effects (curvature)
b₁₂: Interaction effect

Step 2: Prepare the Design Matrix
We need to create columns for X₁, X₂, X₁², X₂², and X₁X₂.

Step 3: Apply Multiple Linear Regression
Use Excel's LINEST function or Data Analysis ToolPak.

💊 Worked Example: Dissolution Optimization

Scenario: Optimizing immediate-release tablet dissolution

Factors (coded):

X₁: Disintegrant concentration (-1 = 2%, +1 = 6%)
X₂: Compression force (-1 = 5 kN, +1 = 15 kN)

Response: % Dissolved at 30 minutes

CCD Results (Simplified Dataset)

X₁	X₂	X₁²	X₂²	X₁X₂	% Dissolved
-1	-1	1	1	1	45
+1	-1	1	1	-1	75
-1	+1	1	1	-1	35
+1	+1	1	1	1	55
0	0	0	0	0	85
...additional star and center points...

📊 Let's Interpret the Results Step-by-Step

Fitted Model (hypothetical results):

% Dissolved = 85 + 10X₁ - 15X₂ - 20X₁² - 5X₂² + 8X₁X₂

Step 1: Interpret Each Coefficient

b₀ = 85: At center conditions (4% disintegrant, 10 kN), we expect 85% dissolution
b₁ = +10: Increasing disintegrant improves dissolution (positive linear effect)
b₂ = -15: Increasing compression force reduces dissolution (negative linear effect)
b₁₁ = -20: Negative quadratic - dissolution peaks at moderate disintegrant levels
b₂₂ = -5: Slight curvature in compression force effect
b₁₂ = +8: Positive interaction - disintegrant is more effective at lower compression forces

Step 2: Identify the Key Insights

🎯 Optimum exists: Negative quadratic terms suggest a maximum
⚖️ Balance required: Need enough disintegrant but not too much compression
🔄 Interaction present: Effects depend on each other

💻 Excel Implementation for Model Fitting:
=LINEST(Y_range, X_matrix_range, TRUE, TRUE)
Where X_matrix includes columns for X₁, X₂, X₁², X₂², X₁X₂
Use Ctrl+Shift+Enter for array formula

Surface Visualization Tools

🎮 Interactive Response Surface Demo

Adjust the model coefficients to see how they affect the response surface:

Linear Effect (b₁): 10

Quadratic Effect (b₁₁): -20

Response Surface

Optimum Region

Current Model:
Y = 85 + 10X₁ - 15X₂ - 20X₁² - 5X₂² + 8X₁X₂

🗺️ Let's Create Contour Plots Step-by-Step

Step 1: Generate a Grid of Predictions
Create X₁ and X₂ values from -2 to +2 in 0.2 increments (21×21 = 441 points).

Step 2: Calculate Predicted Responses
For each (X₁, X₂) combination, apply your fitted model equation.

Step 3: Create the Contour Plot
Lines connect points with equal predicted response values.

Interpretation Guide:
• Concentric circles/ellipses = Maximum or minimum
• Saddle shape = Stationary point (not optimum)
• Parallel lines = Ridge or valley

💊 Reading Contour Plots for Formulation Development

What to Look For:

🎯 Sweet Spots: Regions where multiple responses meet targets
⚠️ Flat Regions: Areas where changes have minimal impact (robust formulation)
🔥 Steep Gradients: Areas requiring tight process control
🚫 Constraint Boundaries: Physical or regulatory limits

Pharmaceutical Decision Framework:

Identify feasible region (all specifications met)
Look for robust areas (flat contours)
Consider manufacturing constraints
Select operating point with safety margin

🎯 Section 4.3: Optimization & The Desirability Function (10 minutes)

Defining Optimization Goals

🎯 Let's Set Up Multi-Response Optimization Step-by-Step

Step 1: Define Individual Response Goals
For each response, specify your objective:

Maximize: Dissolution rate, bioavailability
Minimize: Friability, cost, variability
Target: Specific hardness, pH, potency
Range: Within USP limits

Step 2: Set Acceptable Limits
Define the minimum acceptable, target, and maximum acceptable values.

Step 3: Assign Relative Importance
Weight each response based on criticality (equal weights = 1.0 each).

💊 Multi-Response Optimization Example

Formulation Goals:

Response	Goal	Lower Limit	Target	Upper Limit	Weight
Dissolution (%)	Maximize	80	95	-	2.0
Hardness (kp)	Target	8	10	12	1.5
Friability (%)	Minimize	-	0.2	1.0	1.0
Disintegration (min)	Minimize	-	2	5	1.0

The Desirability Function Approach

📊 Let's Calculate Desirability Step-by-Step

Step 1: Transform Each Response to Desirability (0-1 scale)

For "Maximize" Goals:

d_i = 0 if Y_i ≤ Lower
d_i = ((Y_i - Lower) / (Target - Lower))^r if Lower < Y_i < Target
d_i = 1 if Y_i ≥ Target

For "Minimize" Goals:

d_i = 1 if Y_i ≤ Target
d_i = ((Upper - Y_i) / (Upper - Target))^r if Target < Y_i < Upper
d_i = 0 if Y_i ≥ Upper

For "Target" Goals:

d_i = ((Y_i - Lower) / (Target - Lower))^r if Lower ≤ Y_i ≤ Target
d_i = ((Upper - Y_i) / (Upper - Target))^s if Target ≤ Y_i ≤ Upper
d_i = 0 otherwise

Where r and s are shape parameters (r=s=1 for linear, r>1 for emphasis on target).

Step 2: Calculate Overall Desirability

D = (d₁^w₁ × d₂^w₂ × d₃^w₃ × d₄^w₄)^(1/(w₁+w₂+w₃+w₄))

This is the weighted geometric mean of individual desirabilities.

💊 Worked Desirability Calculation

Candidate Formulation Results:

Dissolution: 90%
Hardness: 9.5 kp
Friability: 0.4%
Disintegration: 3 minutes

📊 Step-by-Step Desirability Calculation

Step 1: Calculate Individual Desirabilities

d₁ (Dissolution - Maximize):
90% is between 80% (lower) and 95% (target)
d₁ = (90-80)/(95-80) = 10/15 = 0.67

d₂ (Hardness - Target 10 kp):
9.5 is between 8 (lower) and 10 (target)
d₂ = (9.5-8)/(10-8) = 1.5/2 = 0.75

d₃ (Friability - Minimize):
0.4% is between 0.2% (target) and 1.0% (upper)
d₃ = (1.0-0.4)/(1.0-0.2) = 0.6/0.8 = 0.75

d₄ (Disintegration - Minimize):
3 min is between 2 (target) and 5 (upper)
d₄ = (5-3)/(5-2) = 2/3 = 0.67

Step 2: Calculate Overall Desirability
D = (0.67² × 0.75^1.5 × 0.75¹ × 0.67¹)^(1/(2+1.5+1+1))
D = (0.45 × 0.65 × 0.75 × 0.67)^(1/5.5)
D = (0.147)^0.182 = 0.70

Interpretation: This formulation achieves 70% of the ideal desirability. Look for combinations that give D > 0.8 for excellent formulations.

💻 Excel Desirability Functions:
=IF(Y<=Lower,0,IF(Y>=Target,1,(Y-Lower)/(Target-Lower))) for maximize
=IF(Y<=Target,1,IF(Y>=Upper,0,(Upper-Y)/(Upper-Target))) for minimize
=GEOMEAN(d1^w1, d2^w2, d3^w3, d4^w4)^(1/SUM(weights)) for overall D

Optimization Strategy

🔍 Let's Find the Optimum Step-by-Step

Step 1: Generate Candidate Points
Create a grid of factor combinations across the design space.

Step 2: Predict All Responses
Use your fitted models to predict each response at every candidate point.

Step 3: Calculate Desirability at Each Point
Apply the desirability function to convert predictions to D values.

Step 4: Identify Maximum Desirability
Find the factor combination that gives the highest D value.

Step 5: Validate with Confirmation Experiments
Run experiments at the predicted optimum to verify model accuracy.

⚠️ Critical Validation Step:
Always run confirmation experiments at your predicted optimum! Models are approximations - real experiments verify that your optimization actually works in practice.

💊 Optimization Success Story

Case Study: Immediate-release tablet optimization

Challenge: Balance dissolution, hardness, and manufacturing efficiency

RSM Results:

Predicted Optimum: 4.2% disintegrant, 8.5 kN compression
Predicted Desirability: D = 0.89

Confirmation Experiments (n=3):

Dissolution: 94.2% ± 1.1% (predicted: 93.8%)
Hardness: 9.8 ± 0.3 kp (predicted: 10.1 kp)
Friability: 0.25% ± 0.03% (predicted: 0.22%)

Outcome: ✅ Model predictions within 2% of actual results. Formulation moved to scale-up with confidence!

📋 Section Summary & Key Takeaways

🎯 What We've Learned

Response Surface Methodology gives us the power to:

📈 Capture curvature in pharmaceutical responses using quadratic models
🎯 Find true optima rather than just "better" conditions
⚖️ Balance multiple objectives using desirability functions
🗺️ Visualize design spaces through contour plots and 3D surfaces
✅ Make confident decisions backed by statistical validation

💊 When to Apply RSM in Pharmaceutical Development

Formulation optimization: Finding optimal ingredient levels
Process optimization: Optimizing manufacturing parameters
Stability studies: Understanding degradation kinetics
Quality by Design: Defining design spaces for regulatory submission
Method development: Optimizing analytical procedures

🚨 Common RSM Pitfalls to Avoid:
• Don't extrapolate beyond your design space
• Always check model adequacy (residuals, R²)
• Validate predictions with confirmation experiments
• Consider practical constraints in optimization
• Remember that correlation ≠ causation

📈 Response Surface Methodology (RSM)