Di Rienzo, Guzman and Casanoves (DGC) test for multiple comparisons. Implements a cluster-based method for identifying groups of nonhomogeneous means. Average linkage clustering is applied to a distance matrix obtained from the sample means. The distribution of \(Q\) (distance between the source and the root node of the tree) is used to build a test with a significance level of \(\alpha\). Groups whose means join above \(c\) (the \(\alpha\)-level cut-off criterion) are statistically different.
Arguments
- y
Either a model (created with
lm()
oraov()
) or a numerical vector with the values of the response variable for each unit.- trt
If
y
is a model, a string with the name of the column containing the treatments. Ify
is a vector, a vector of the same length asy
with the treatments for each unit.- alpha
Value equivalent to 0.05 or 0.01, corresponding to the significance level of the test. The default value is 0.05.
- show_plot
Logical value indicating whether the constructed dendrogram should be plotted or not.
- console
Logical value indicating whether the results should be printed on the console or not.
- abline_options
list
with optional arguments for the line in the dendrogram.- ...
Optional arguments for the
plot()
function.
Value
A list with three data.frame
and one hclust
:
- stats
data.frame
containing summary statistics by treatment.- groups
data.frame
indicating the group to which each treatment is assigned.- parameters
data.frame
with the values used for the test.treatments
is the total number of treatments,alpha
is the significance level used,c
is the cut-off criterion for the dendrogram (the height of the horizontal line on the dendrogram),q
is the \(1 - \alpha\) quantile of the distribution of \(Q\) (distance from the root node) under the null hypothesis andSEM
is an estimate of the standard error of the mean.- dendrogram_data
object of class
hclust
with data used to build the dendrogram.
References
Di Rienzo, J. A., Guzman, A. W., & Casanoves, F. (2002). A Multiple-Comparisons Method Based on the Distribution of the Root Node Distance of a Binary Tree. Journal of Agricultural, Biological, and Environmental Statistics, 7(2), 129-142. <jstor.org/stable/1400690>
Examples
data("PlantGrowth")
# Using vectors -------------------------------------------------------
weights <- PlantGrowth$weight
treatments <- PlantGrowth$group
dgc_test(y = weights, trt = treatments, show_plot = FALSE)
#> group
#> ctrl 1
#> trt1 1
#> trt2 2
#> Treatments within the same group are not significantly different
# Using a model -------------------------------------------------------
model <- lm(weights ~ treatments)
dgc_test(y = model, trt = "treatments", show_plot = FALSE)
#> group
#> ctrl 1
#> trt1 1
#> trt2 2
#> Treatments within the same group are not significantly different