Two Sample Proportions Test – Effect of Sample Size

Amelie has developed a new drug against the flu and wanted to test its effectiveness. She recruited 50 people and divided them into two groups. She gave the medicine to the first group (34 people) and provided a placebo to the second (16). Here are the results.

In the treatment group, 15 out of 34 (44%) recovered in 1 day.
In the control group, 4 out of 16 (25%) recovered in 1 day.

Amelie thinks her medicine is effective because of the big defence (44 – 25 = 19%) in performance and the larger sample size for the treatment group. Do you agree?

We must do a statistical test to conclude. The data is:

SampleEventsTrials
Treatment1534
Control416

The Null Hypothesis, H0: No impact of medicine; recovery proportion on drug equals that of placebo.
The Alternative Hypothesis, H1: Medicine improves the condition; recovery proportion on drug greater than placebo.

We use the ‘prop.test()’ function in R.

prop.test(x = c(15, 4), n = c(34, 16), alternative = "greater")

We used the one-tailed test to determine whether the treatment has improved the condition compared to the control.


	2-sample test for equality of proportions with continuity correction

data:  c(15, 4) out of c(34, 16)
X-squared = 0.97389, df = 1, p-value = 0.1619
alternative hypothesis: greater
95 percent confidence interval:
 -0.0813273  1.0000000
sample estimates:
   prop 1    prop 2 
0.4411765 0.2500000 

The p-value of 0.1619 (more than the significance value of 0.05) is not enough to reject the null hypothesis.

What would have happened if she had recruited more people in the placebo group and found results at the same proportions?

SampleEventsTrials
Treatment1534
Control1664
prop.test(x = c(15, 16), n = c(34, 64), alternative = "greater")
	2-sample test for equality of proportions with continuity correction

data:  c(15, 16) out of c(34, 64)
X-squared = 2.9205, df = 1, p-value = 0.04373
alternative hypothesis: greater
95 percent confidence interval:
 0.002691967 1.000000000
sample estimates:
   prop 1    prop 2 
0.4411765 0.2500000 

Here, the results are significant (p-value = 0.04373 < 0.05) to reject the null hypothesis in favour of the alternate (that the drug works).

What about recruiting more to the treatment group?

SampleEventsTrials
Treatment150340
Control416
prop.test(x = c(150, 4), n = c(340, 16), alternative = "greater")
	2-sample test for equality of proportions with continuity correction

data:  c(150, 4) out of c(340, 16)
X-squared = 1.5631, df = 1, p-value = 0.1056
alternative hypothesis: greater
95 percent confidence interval:
 -0.02503096  1.00000000
sample estimates:
   prop 1    prop 2 
0.4411765 0.2500000 

p-value = 0.1056; the null hypothesis stays!

Or, had many individuals in both groups?

SampleEventsTrials
Treatment150340
Control40160
prop.test(x = c(150, 40), n = c(340, 160), alternative = "greater")
	2-sample test for equality of proportions with continuity correction

data:  c(150, 40) out of c(340, 160)
X-squared = 16.076, df = 1, p-value = 3.042e-05
alternative hypothesis: greater
95 percent confidence interval:
 0.1149402 1.0000000
sample estimates:
   prop 1    prop 2 
0.4411765 0.2500000 

Both the groups have plenty of samples, and now the difference (44 – 25 = 22%) overwhelmingly supports the impact of the medicine.