Compute the effect size for t-test. T-test conventional effect sizes, poposed by Cohen, are: 0.2 (small effect), 0.5 (moderate effect) and 0.8 (large effect).

Cohen's d is calculated as the difference between means or mean minus mu divided by the estimated standardized deviation.

For independent samples t-test, there are two possibilities implemented. If the t-test did not make a homogeneity of variance assumption, (the Welch test), the variance term will mirror the Welch test, otherwise a pooled estimate is used.

If a paired samples t-test was requested, then effect size desired is based on the standard deviation of the differences.

It can also returns confidence intervals by bootstap.

cohens_d(data, formula, comparisons = NULL, ref.group = NULL,
paired = FALSE, mu = 0, var.equal = FALSE, ci = FALSE,
conf.level = 0.95, ci.type = "perc", nboot = 1000)

## Arguments

data a data.frame containing the variables in the formula. a formula of the form x ~ group where x is a numeric variable giving the data values and group is a factor with one or multiple levels giving the corresponding groups. For example, formula = TP53 ~ cancer_group. A list of length-2 vectors specifying the groups of interest to be compared. For example to compare groups "A" vs "B" and "B" vs "C", the argument is as follow: comparisons = list(c("A", "B"), c("B", "C")) a character string specifying the reference group. If specified, for a given grouping variable, each of the group levels will be compared to the reference group (i.e. control group). If ref.group = "all", pairwise two sample tests are performed for comparing each grouping variable levels against all (i.e. basemean). a logical indicating whether you want a paired test. theoretical mean, use for one-sample t-test. Default is 0. a logical variable indicating whether to treat the two variances as being equal. If TRUE then the pooled variance is used to estimate the variance otherwise the Welch (or Satterthwaite) approximation to the degrees of freedom is used. If TRUE, returns confidence intervals by bootstrap. May be slow. The level for the confidence interval. The type of confidence interval to use. Can be any of "norm", "basic", "perc", or "bca". Passed to boot::boot.ci. The number of replications to use for bootstrap.

## Value

return a data frame with some of the following columns:

• .y.: the y variable used in the test.

• group1,group2: the compared groups in the pairwise tests.

• n,n1,n2: Sample counts.

• effsize: estimate of the effect size (d value).

• magnitude: magnitude of effect size.

• conf.low,conf.high: lower and upper bound of the effect size confidence interval.

## Details

Quantification of the effect size magnitude is performed using the thresholds defined in Cohen (1992). The magnitude is assessed using the thresholds provided in (Cohen 1992), i.e. |d| < 0.2 "negligible", |d| < 0.5 "small", |d| < 0.8 "medium", otherwise "large".

## References

• Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). New York:Academic Press.

• Cohen, J. (1992). A power primer. Psychological Bulletin, 112, 155-159.

## Examples

# One-sample t test effect size
ToothGrowth %>% cohens_d(len ~ 1, mu = 0)#> # A tibble: 1 x 6
#>   .y.   group1 group2     effsize     n magnitude
#> * <chr> <chr>  <chr>        <dbl> <int> <ord>
#> 1 len   1      null model    2.46    60 large
# Two indepedent samples t-test effect size
ToothGrowth %>% cohens_d(len ~ supp, var.equal = TRUE)#> # A tibble: 1 x 7
#>   .y.   group1 group2 effsize    n1    n2 magnitude
#> * <chr> <chr>  <chr>    <dbl> <int> <int> <ord>
#> 1 len   OJ     VC       0.495    30    30 small
# Paired samples effect size
df <- data.frame(
id = 1:5,
pre  = c(110, 122, 101, 120, 140),
post = c(150, 160, 110, 140, 155)
)
df <- df %>% gather(key = "treatment", value = "value", -id)
#> 1 value post   pre       1.75     5     5 large