About the authors |
|
xi | |
Acknowledgements |
|
xiii | |
List of tables and figures |
|
xv | |
Foreword |
|
xvii | |
Preface |
|
xix | |
1 Validity and validation |
|
1 | (26) |
|
What do we mean by validity? |
|
|
2 | (5) |
|
Validity across disciplines |
|
|
2 | (1) |
|
|
3 | (2) |
|
Validity for measurement: attributes and decisions |
|
|
5 | (2) |
|
Kinds of validity for measurement |
|
|
7 | (3) |
|
Conventions used in the book |
|
|
10 | (4) |
|
Educational and psychological measurement |
|
|
10 | (1) |
|
Attributes or constructs? |
|
|
11 | (1) |
|
Particular kinds of attribute |
|
|
12 | (1) |
|
|
13 | (1) |
|
An outline of the history of validity |
|
|
14 | (13) |
|
The genesis of validity (mid-1800s-1951) |
|
|
16 | (3) |
|
A gestational period (mid-1800s-1920) |
|
|
17 | (1) |
|
A period of crystallization (1921-1951) |
|
|
18 | (1) |
|
The fragmentation of validity (1952-1974) |
|
|
19 | (2) |
|
The (re)unification of validity (1974-1999) |
|
|
21 | (1) |
|
The deconstruction of validity (2000-2012) |
|
|
22 | (2) |
|
Twenty-first-century evaluation |
|
|
24 | (3) |
2 The genesis of validity: mid-1800s-1951 |
|
27 | (36) |
|
A gestational period (pre-1921) |
|
|
27 | (3) |
|
A period of crystallization (post-1921) |
|
|
30 | (8) |
|
Existing accounts of the early years |
|
|
32 | (4) |
|
Explaining the caricature |
|
|
36 | (2) |
|
From quality of measurement to degree of correlation |
|
|
36 | (1) |
|
From degree of correlation to coefficient of validity |
|
|
37 | (1) |
|
Validating tests for different purposes |
|
|
38 | (16) |
|
Validity and school achievement tests |
|
|
39 | (7) |
|
Sampling theory versus test construction practice |
|
|
41 | (1) |
|
Measurement versus evaluation |
|
|
42 | (1) |
|
Sampling theory versus test construction theory |
|
|
43 | (2) |
|
The ongoing tension between empirical and logical |
|
|
45 | (1) |
|
Validity and general intelligence tests |
|
|
46 | (3) |
|
A logical approach to test development |
|
|
46 | (1) |
|
An empirical approach to validation |
|
|
47 | (2) |
|
Beyond correlation with criterion measures |
|
|
49 | (1) |
|
Validity and special aptitude tests |
|
|
49 | (3) |
|
An empirical approach to validation |
|
|
50 | (1) |
|
An empirical approach to test development |
|
|
50 | (1) |
|
|
51 | (1) |
|
Validity and personality tests |
|
|
52 | (2) |
|
|
53 | (1) |
|
|
53 | (1) |
|
|
54 | (1) |
|
Validity and validation by the end of the 1940s |
|
|
54 | (9) |
|
|
55 | (2) |
|
A more interesting and nuanced story |
|
|
57 | (6) |
3 The fragmentation of validity: 1952-1974 |
|
63 | (36) |
|
Professional standards: edition 1 |
|
|
64 | (7) |
|
Different approaches to validation |
|
|
65 | (4) |
|
|
66 | (1) |
|
|
66 | (1) |
|
|
67 | (1) |
|
|
68 | (1) |
|
Different kinds of validity |
|
|
69 | (2) |
|
The invention of construct validity |
|
|
71 | (3) |
|
Professional standards: editions 2 and 3 |
|
|
74 | (3) |
|
|
77 | (3) |
|
The entrenchment of fragmented thinking |
|
|
80 | (8) |
|
|
84 | (4) |
|
|
88 | (3) |
|
Validity and validation by the mid-1970s |
|
|
91 | (8) |
4 The (re)unification of validity: 1975-1999 |
|
99 | (36) |
|
The Messick years: triumph and tribulation |
|
|
100 | (16) |
|
The Messick years prefigured |
|
|
100 | (2) |
|
The new science of validity |
|
|
102 | (14) |
|
Deeply entrenched fragmentation |
|
|
102 | (1) |
|
|
103 | (1) |
|
Why 'content validity' is insufficient |
|
|
103 | (3) |
|
Why 'criterion validity' is insufficient |
|
|
106 | (2) |
|
The general relevance of nomological networks |
|
|
108 | (1) |
|
The ethical imperative underpinning construct validity |
|
|
109 | (1) |
|
All validity is construct validity |
|
|
110 | (4) |
|
The new practice of validation |
|
|
114 | (2) |
|
|
116 | (13) |
|
|
117 | (1) |
|
The illogic of the matrix |
|
|
118 | (3) |
|
The confusing role of social consequences |
|
|
121 | (15) |
|
Consequences and the scope of validity theory |
|
|
122 | (1) |
|
|
122 | (1) |
|
|
123 | (2) |
|
|
125 | (2) |
|
|
127 | (2) |
|
Professional standards: editions 4 and 5 |
|
|
129 | (2) |
|
Validity and validation by the end of the 1990s |
|
|
131 | (4) |
5 The deconstruction of validity: 2000-2012 |
|
135 | (48) |
|
The desire to simplify validation practice |
|
|
136 | (10) |
|
The methodology in principle |
|
|
137 | (4) |
|
The methodology in practice |
|
|
141 | (5) |
|
The desire to simplify validity theory |
|
|
146 | (20) |
|
The nature and significance of construct validity |
|
|
146 | (20) |
|
The construction of construct validity theory |
|
|
146 | (2) |
|
Deconstruction and reconstructions of construct validity |
|
|
148 | (19) |
|
Borsboom (part 1): tests are valid, not interpretations |
|
|
148 | (1) |
|
Lissitz and Samuelsen: no need for nomological networks |
|
|
149 | (1) |
|
Embretson, Pellegrino and Gorin: the cognitive approach |
|
|
150 | (3) |
|
Kane: not all attributes are hypothetical |
|
|
153 | (2) |
|
Borsboom (part 2): validity is ontological, not epistemological |
|
|
155 | (10) |
|
Michell and Maraun: doubts about measurement |
|
|
165 | (1) |
|
Moss: situated validation |
|
|
165 | (1) |
|
Construct validity in the balance |
|
|
166 | (6) |
|
The nature of validation research |
|
|
167 | (2) |
|
The nature of constructs/attributes |
|
|
168 | (1) |
|
|
169 | (1) |
|
The label 'construct validity' |
|
|
170 | (1) |
|
|
171 | (1) |
|
|
172 | (7) |
|
|
172 | (2) |
|
|
174 | (1) |
|
|
175 | (1) |
|
|
176 | (2) |
|
|
178 | (1) |
|
Validity and validation into the 21st century |
|
|
179 | (4) |
6 Twenty-first-century evaluation |
|
183 | (44) |
|
A framework for the evaluation of testing policy |
|
|
184 | (5) |
|
|
189 | (3) |
|
Evaluation of technical quality: cells 1 to 3 |
|
|
192 | (2) |
|
Evaluation of social value: cells 4a to 4c and the Overall Judgement |
|
|
194 | (4) |
|
Illustrating the framework |
|
|
198 | (26) |
|
|
198 | (4) |
|
|
198 | (1) |
|
|
199 | (1) |
|
Policy owner values and purposes |
|
|
200 | (1) |
|
Reconciling values and the 'real world' |
|
|
201 | (1) |
|
|
202 | (8) |
|
Decision-making versus intervention |
|
|
203 | (1) |
|
|
204 | (1) |
|
|
205 | (2) |
|
Different ways of constructing a theory of the decision |
|
|
207 | (3) |
|
|
210 | (2) |
|
|
210 | (1) |
|
|
211 | (1) |
|
|
212 | (6) |
|
|
213 | (1) |
|
Cell 4b primary expenses, pay-offs, impacts and side-effects |
|
|
213 | (3) |
|
Cell 4c secondary expenses, pay-offs, impacts and side-effects |
|
|
216 | (2) |
|
|
218 | (6) |
|
Acceptable to the policy owner |
|
|
221 | (1) |
|
Acceptable to the academy |
|
|
221 | (1) |
|
|
222 | (1) |
|
|
223 | (1) |
|
The context of the Overall Judgement |
|
|
224 | (1) |
|
|
224 | (3) |
References |
|
227 | (20) |
Index |
|
247 | |