![]() |
![]() |
|
|
How Many Animals Do I Need? Size of animal experiments is a delicate balance. Use too many animals and you are wasting resources and needlessly exposing animals to potential harm. If too few are used, experimental results will not be clear cut, again wasting animal resources unless the experiment can be enlarged by collecting more data. Size of an experiment involves five quantities:
Typically scientists will choose N to give an 80% chance of detecting the difference D (if it truly exists) with no more than a 5% chance of error, assuming V variability. Note this statement uses all five quantities. Experiment size will be smaller if:
These considerations show that controlling variance is the best option for reducing sample size. Festing also mentions experimental approaches that use fewer animals than the traditional comparison of groups of animals on different treatments. These five quantities are connected by complex formulas that change depending on the type of data (continuous, binary, etc.) and the type of question (differences in means is just one of many). A very rough approximation can be obtained from N = 25*V/ (D*D). Suppose a researcher wants to detect a difference in mouse body weight, and anticipates the control group will weigh 40g, the treatment group will weigh 50g (D=50-40), and the CV will be 20%. CV, the coefficient of variation, is std. deviation (s) divided by the mean, so s = (20%)*40g = 8g. V is simply s squared, giving N = 25*8*8/(10*10) = 16 animals per treatment.
For more accuracy, however, use of a sample size computer program is recommended. Web-based versions are convenient, and use of http://www.stat.uiowa.edu/~rlenth/Power/index.html is now illustrated.
A web search for "power sample size" will provide many other calculators. In a complex experiment, with many sub-experiments and treatments, does sample size need to be calculated for every combination? The calculations above theoretically only need to be done once for the worst-case scenario, where variability is highest and treatment difference is lowest. But this would produce excessive use of animals in some treatments, so a design that allows unequal samples sizes for different treatments might be considered. Then sample size calculations would have to be repeated for each unequal sample size allowed. If all sub-experiments are connected through the use of common animals or tissues, then only the "weakest-link" needs to be considered. If stage 3 of the experiment needs tissue from 10 animals, obviously stage 1 and 2 that lead to stage 3 will need 10 animals, even if sample size calculations suggest 4 animals are sufficient in stage 1 and 2. Again, identify the situation that has smallest treatment difference and largest variance, and that will dictate sample size of the experiment. As a final example, suppose researchers intend to use Fisher's Exact Test to compare two percentages. They want to be 90% sure of detecting a true difference between percentages of 70% and 80%, at the 5% significance level. Go to http://calculators.stat.ucla.edu/powercalc/ and choose Fisher's, and "Sample size for a given power." Fill in the form with 0.70, 0.80, 2 sided test, 0.05 and 0.90 power, and the sample size required is about 400 animals. Percentage data generally require large experiments.
|
|||||||
UT
Research · UT
Health Center · UT
Institute of Agriculture · UT
Medical Center · UT
Space Institute
|
|
| Copyright ©2003-2007 The University of Tennessee · Institutional Animal Care and Use Committee · 2431 Joe Johnson Drive · Knoxville Tennessee 37996-4564 · Telephone 865-976-1020 · Fax 865-974-9462 · cgreenac@utk.edu · Web site comments: etipton@utk.edu · Disclaimer | |