This means that large sample sizes are needed to detect differences in mean developmental test scores between control and intervention groups, particularly in studies that involve heterogeneous groups of infants, or when variance is increased in multicenter settings.