-
Chapter 1: Introduction
-
Chapter 2: Background
-
Chapter 3: Administration and Scoring
-
Chapter 4: Interpretation
-
Chapter 5: Case Studies
-
Chapter 6: Development
-
Chapter 7: Standardization
-
Chapter 8: Reliability
-
Chapter 9: Validity
-
Chapter 10: Fairness
-
Chapter 11: CAARS 2–Short
-
Chapter 12: CAARS 2–ADHD Index
-
Chapter 13: Translations
-
Appendices
CAARS 2 ManualChapter 9: Key Findings |
Key Findings |
Internal Structure. Results from confirmatory factor analyses (CFA) provided empirical evidence to support the theoretical structure of the CAARS 2 scales.
-
Content Scales. For Self-Report and Observer, after testing various models, a 5-factor model performed
best
(1. Inattention/Executive Dysfunction, 2. Hyperactivity, 3. Impulsivity, 4. Emotional Dysregulation, 5. Negative
Self-Concept). This 5-factor model had strong fit statistics (across both rater forms, CFI and TLI ≥ .943; SRMR
≤ .042; RMSEA ≤ .047), and all factor loadings were positive, statistically significant, and above a minimum
threshold of .40. The five identified factors were all distinct from each other.
Relation to Conceptually Related Constructs. Evidence to support the convergent validity of the CAARS 2 was supported by moderate to strong correlations between the CAARS 2 and established tests of related constructs.
- CAARS 2 and CAARS (median r): Self-Report = .76, Observer = .83
- CAARS 2 and CEFI Adult (median |r|): Self-Report = .76, Observer = .74
-
CAARS 2 and WFIRS (median r): Self-Report = .48
Relation to Criterion Variables. The CAARS 2 demonstrated a high degree of criterion-related validity as various clinical groups had distinctly different profiles of scores.
-
Meaningful differences between average scores were found between clinical groups, such that ratings of
individuals with ADHD yielded higher scores than ratings of individuals from the General Population
(median
Cohen’s d for significant differences: Self-Report = 2.24 and Observer = 1.29) as well as higher scores
than
ratings of individuals with Depression and/or Anxiety (median Cohen’s d for significant differences:
Self-Report
= 1.26 and Observer = 1.06).
-
Scores on the CAARS 2 differentiated between ADHD Inattentive and ADHD Combined presentations
(e.g., scores on
scales related to hyperactivity and impulsivity tended to be higher in the ADHD Combined groups than in the ADHD
Inattentive group; Self-Report median Cohen’s d = 0.93 and Observer median Cohen’s d = 0.47).
Classification Accuracy. Scores from the CAARS 2 were able to correctly classify individuals from the General Population and those from clinical samples into their respective groups.
-
Moderate to high levels of classification accuracy were demonstrated when distinguishing between individuals
from the General Population and those diagnosed with ADHD (all presentations). Across all analyses,
overall
correct classification rates ranged from 89.7% to 92.5% for Self-Report and 84.7% to 91.7% for Observer.
Interpreting Correlations and Effect Sizes
Throughout this chapter, common statistical methods are used to report results, such as correlation coefficients and effect sizes. In addition to tests of statistical significance, correlations and effect sizes help communicate the magnitude of an observed effect. Correlation coefficients provide us with a statistical measure of the degree of association between two variables. The correlation coefficients presented in this chapter are Pearson’s correlations, ranging from -1 to 1, with higher values indicating greater consistency or agreement between ratings. Although there are several approaches to interpretation, correlation coefficients in this manual are categorized as follows: absolute values lower than .20 are classified as very weak; values of .20 to .39 are weak; values of .40 to .59 are moderate; values of .60 to .79 are strong; and absolute values greater than or equal to .80 are very strong (Evans, 1996).
Effect sizes are another common statistical method used to report results, communicating the magnitude of an observed effect, throughout this chapter. A variety of effect sizes are presented, including Cohen’s d, Cliff’s d, and eta-squared (η2; see Cohen, 1973). Cohen’s d absolute values are measures of the size of the standardized difference between groups and are typically quantified as negligible effects if they are less than .20, small effects if they are .20 to .49, medium effects if they are .50 to .79, and large effects if they are greater than or equal to .80 (Cohen, 1973). Guidelines for interpreting Cliff’s d absolute values (non-parametric effect size metric values) are often quantified as negligible effects if they are less than .15, a small effect if they are .15 to .32, medium effects if they are .33 to .44, and large effects if they are greater than or equal to .45 (Romano et al., 2006). Effect sizes as measured by eta-squared (η2; see Cohen, 1973), a statistic that communicates the percent of variance explained, are also provided. Commonly used guidelines for interpreting η2 include negligible effect sizes for values less than .01, small effect sizes for values of .01 to .05, medium effect sizes for values of .06 to .13, and large effect size for values greater than or equal to .14. Additional statistical methods and guidelines for interpretation are explained within the relevant sections of this chapter.
< Back | Next > |