Practical Meta-Analysis David B. Wilson Evaluators’ Institute July 16-17, 2010 Practical Meta-Analysis -- D.
Download ReportTranscript Practical Meta-Analysis David B. Wilson Evaluators’ Institute July 16-17, 2010 Practical Meta-Analysis -- D.
Practical Meta-Analysis David B. Wilson Evaluators’ Institute July 16-17, 2010 Practical Meta-Analysis -- D. B. Wilson 1 Overview of the Workshop Topics covered will include Review of the basic methods Problem definition Document Retrieval Coding Effect sizes and computation Analysis of effect sizes Publication Bias Cutting edge issues Interpretation of results Evaluating the quality of a meta-analysis Practical MetaAnalysis -- D. B. Wilson 2 Forest Plot from a Meta-Analysis of Correctional Boot-Camps Fa vo rs C om pa rison Fa vo rs B oo tca m p H a rer &K lein-S a ffran ,1 99 6 Jon es &R o ss, 1 997 Fl. D e pt. of JJ (S tu art C o.), 199 7 Fl. D e pt. of JJ (P olk C o., B oys), 19 97 Jon es (FY 9 7), 1 99 8 Jon es (FY 9 4-95 ), 1 99 8 M acke nzie&S o uryal (Illin ois), 1 99 4 M acke nzie&S o uryal (Lo uisia na ), 1 994 Jon es (FY 9 1-93 ), 1 99 8 M acke nzie&S o uryal (Flo rid a), 1 99 4 Jon es (FY 9 6), 1 99 8 M arcus-M end oza (M e n), 1 99 5 M acke nzie ,e t al. 1 99 7 P e nn. D ep t. o fC o rrectio ns, 2 001 Flow e rs, C arr, &R ub ack 199 1 B u reauo fD ata a ndR esea rch , 19 96 M acke nzie&S o uryal (O klaho m a ), 1 99 4 T3A sso ciate s, 20 00 M acke nzie&S o uryal (N ewY o rk), 19 94 P e ters, 1 99 6b C a m p &S a ndh u, 199 5 M acke nzie&S o uryal (S .C ., N ew ), 1 99 4 Jon es, 1 996 N YD C S(88 -96R ele ases), 2 00 0 M arcus-M end oza (W om en ), 1 995 A u stin, Jo ne s, &B olya rd, 199 3 B u rns &V ito, 199 5 P e ters, 1 99 6a Fl. D e pt. of JJ (B ay C o .), 1 99 7 N YD C S(96 -97R ele ases), 2 00 0 N YD C S(97 -98R ele ases), 2 00 0 Fl. D e pt. of JJ (P in ella sC o .), 1 99 6 Fl. D e pt. of JJ (M ana te eC o .), 1 996 C AD e pt. of theY ou thA uth ority, 19 97 B o yles, B o ke nkam p, &M a du ra, 1 99 6 M acke nzie&S o uryal (S .C ., O ld), 19 94 Fl. D e pt. of JJ (P olk C o., G irls), 1 99 7 Jon es, 1 997 Th om as &P e ters, 1 99 6 W righ t &M a ys, 1 998 M acke nzie&S o uryal (G eorg ia ), 19 94 Practical MetaAnalysis -- D. B. Wilson O verall M e anO d ds-R atio 3 The Great Debate 1952: Hans J. Eysenck concluded that there were no favorable effects of psychotherapy, starting a raging debate 20 years of evaluation research and hundreds of studies failed to resolve the debate 1978: To proved Eysenck wrong, Gene V. Glass statistically aggregate the findings of 375 psychotherapy outcome studies Glass (and colleague Smith) concluded that psychotherapy did indeed work Practical“meta-analysis” MetaGlass called his method Analysis -- D. B. 4 Wilson The Emergence of Meta-analysis Ideas behind meta-analysis predate Glass’ work by several decades Karl Pearson (1904) averaged correlations for studies of the effectiveness of inoculation for typhoid fever R. A. Fisher (1944) “When a number of quite independent tests of significance have been made, it sometimes happens that although few or none can be claimed individually as significant, yet the aggregate gives an impression that the probabilities are on the whole lower than would often have been obtained by chance” (p. 99). Practical Meta Source of the idea of cumulating probability values Analysis -- D. B. Wilson 5 The Emergence of Meta-analysis Ideas behind meta-analysis predate Glass’ work by several decades W. G. Cochran (1953) Discusses a method of averaging means across independent studies Laid-out much of the statistical foundation that modern metaanalysis is built upon (e.g., Inverse variance weighting and homogeneity testing) Practical MetaAnalysis -- D. B. Wilson 6 The Logic of Meta-analysis Traditional methods of review focus on statistical significance testing Significance testing is not well suited to this task Highly dependent on sample size Null finding does not carry the same “weight” as a significant finding significant effect is a strong conclusion nonsignificant effect is a weak conclusion Meta-analysis focuses on the direction and magnitude of the effects across studies, not statistical significance Isn’t this what we are interested in anyway? Direction and magnitude areMetarepresented by the effect size Practical Analysis -- D. B. Wilson 7 Illustration Table 1 21 Validity Studies, N = 68 for Each Observed validity Study coefficient 1 0.04 2 0.14 3 0.31 * 4 0.12 5 0.38 * 6 0.27 * 7 0.15 8 0.36 * 9 0.20 10 0.02 11 0.23 12 0.11 13 0.21 14 0.37 * 15 0.14 16 0.29 * 17 0.26 * 18 0.17 19 0.39 * 20 0.22 21 0.21 * p < .05 (two tailed). Simulated data from 21 validity studies. Taken from: Schimdt, F. L. (1996). Statistical significance testing and cumulative knowledge in psychology: implications for training of researchers. Psychological Practical MetaMethods, 1, 115-129. Analysis -- D. B. Wilson 8 Illustration (Continued) Table 2 95% Confidence Intervals for Correlations From Table 1, N = 68 for Each Observed 95% confidence validity interval Study coefficient Lower Upper 1 0.39 0.19 0.59 2 0.38 0.18 0.58 3 0.37 0.16 0.58 4 0.36 0.15 0.57 5 0.31 0.09 0.53 6 0.29 0.07 0.51 7 0.27 0.05 0.49 8 0.26 0.04 0.48 9 0.23 0.00 0.46 10 0.22 -0.01 0.45 11 0.21 -0.02 0.44 12 0.21 -0.02 0.44 13 0.20 -0.03 0.43 14 0.17 -0.06 0.40 15 0.15 -0.08 0.38 16 0.14 -0.09 0.37 17 0.14 -0.09 0.37 18 0.12 -0.12 0.36 19 0.11 -0.13 0.35 20 0.04 -0.20 0.28 21 0.02 -0.22 0.26 Practical MetaAnalysis -- D. B. Wilson 9 When Can You Do Meta-analysis? Meta-analysis is applicable to collections of research that Are empirical, rather than theoretical Produce quantitative results, rather than qualitative findings Examine the same constructs and relationships Have findings that can be configured in a comparable statistical form (e.g., as effect sizes, correlation coefficients, odds-ratios, proportions) Are “comparable” given the question at hand Practical MetaAnalysis -- D. B. Wilson 10 Forms of Research Findings Suitable to Metaanalysis Central tendency research Pre-post contrasts Prevalence rates Growth rates Group contrasts Experimentally created groups Comparison of outcomes between treatment and comparison groups Naturally occurring groups Comparison of spatial abilities between boys and girls Rates of morbidity among high and low risk groups Practical MetaAnalysis -- D. B. Wilson 11 Forms of Research Findings Suitable to Metaanalysis Association between variables Measurement research Validity generalization Individual differences research Correlation between personality constructs Practical MetaAnalysis -- D. B. Wilson 12 Effect Size: The Key to Meta-analysis The effect size makes meta-analysis possible It is the “dependent variable” It standardizes findings across studies such that they can be directly compared Practical MetaAnalysis -- D. B. Wilson 13 Effect Size: The Key to Meta-analysis Any standardized index can be an “effect size” (e.g., standardized mean difference, correlation coefficient, odds-ratio) as long as it meets the following Is comparable across studies (generally requires standardization) Represents the magnitude and direction of the relationship of interest Is independent of sample size Different meta-analyses may use different effect size indices Practical MetaAnalysis -- D. B. Wilson 14 The Replication Continuum Conceptual Replications Pure Replications You must be able to argue that the collection of studies you are meta-analyzing examine the same relationship. This may be at a broad level of abstraction, such as the relationship between criminal justice interventions and recidivism or between schoolbased prevention programs and problem behavior. Alternatively it may be at a narrow level of abstraction and represent pure replications. The closer to pure replications your collection of studies, the easier it is to argue comparability. Practical MetaAnalysis -- D. B. Wilson 15 Which Studies to Include? It is critical to have an explicit inclusion and exclusion criteria (see pages 20-21) The broader the research domain, the more detailed they tend to become Refine criteria as you interact with the literature Components of a detailed criteria distinguishing features research respondents key variables research methods cultural and linguistic range time frame publication types Practical Meta- Analysis -- D. B. Wilson 16 Methodological Quality Dilemma Include or exclude low quality studies? The findings of all studies are potentially in error (methodological quality is a continuum, not a dichotomy) Being too restrictive may restrict ability to generalize Being too inclusive may weaken the confidence that can be placed in the findings Methodological quality is often in the “eye-of-the-beholder” You must strike a balance that is appropriate to your research question Practical MetaAnalysis -- D. B. Wilson 17 Searching Far and Wide The “we only included published studies because they have been peer-reviewed” argument Significant findings are more likely to be published than nonsignificant findings Critical to try to identify and retrieve all studies that meet your eligibility criteria Practical MetaAnalysis -- D. B. Wilson 18 Searching Far and Wide (continued) Potential sources for identification of documents Computerized bibliographic databases “Google” internet search engine Authors working in the research domain (email a relevant Listserv?) Conference programs Dissertations Review articles Hand searching relevant journal Government reports, bibliographies, clearinghouses Practical MetaAnalysis -- D. B. Wilson 19 A Note About Computerized Bibliographies Rapidly changing area Get to know your local librarian! Searching one or two databases is generally inadequate Use “wild cards” (e.g., random? will find random, randomization, and randomize) Throw a wide net; filter down with a manual reading of the abstracts Practical MetaAnalysis -- D. B. Wilson 20 Strengths of Meta-analysis Imposes a discipline on the process of summing up research findings Represents findings in a more differentiated and sophisticated manner than conventional reviews Capable of finding relationships across studies that are obscured in other approaches Protects against over-interpreting differences across studies Can handle a large numbers of studies (this would overwhelm traditional approaches to review) Practical MetaAnalysis -- D. B. 21 Wilson Weaknesses of Meta-analysis Requires a good deal of effort Mechanical aspects don’t lend themselves to capturing more qualitative distinctions between studies “Apples and oranges” criticism Most meta-analyses include “blemished” studies to one degree or another (e.g., a randomized design with attrition) Selection bias posses a continual threat Negative and null finding studies that you were unable to find Outcomes for which there were negative or null findings that were not reported Analysis of between studyMetadifferences is fundamentally Practical correlational Analysis -- D. B. 22 Wilson