| Preface |
|
xiii | |
|
|
|
1 | (20) |
|
1.1 What Is Regression Analysis? |
|
|
1 | (1) |
|
1.2 Publicly Available Data Sets |
|
|
2 | (1) |
|
1.3 Selected Applications of Regression Analysis |
|
|
3 | (4) |
|
1.3.1 Agricultural Sciences |
|
|
3 | (1) |
|
1.3.2 Industrial and Labor Relations |
|
|
3 | (1) |
|
|
|
4 | (2) |
|
|
|
6 | (1) |
|
1.3.5 Environmental Sciences |
|
|
6 | (1) |
|
1.4 Steps in Regression Analysis |
|
|
7 | (10) |
|
1.4.1 Statement of the Problem |
|
|
11 | (1) |
|
1.4.2 Selection of Potentially Relevant Variables |
|
|
11 | (1) |
|
|
|
11 | (1) |
|
1.4.4 Model Specification |
|
|
12 | (2) |
|
|
|
14 | (1) |
|
|
|
14 | (2) |
|
1.4.7 Model Criticism and Selection |
|
|
16 | (1) |
|
1.4.8 Objectives of Regression Analysis |
|
|
16 | (1) |
|
1.5 Scope and Organization of the Book |
|
|
17 | (1) |
|
|
|
18 | (3) |
|
2 Simple Linear Regression |
|
|
21 | (32) |
|
|
|
21 | (1) |
|
2.2 Covariance and Correlation Coefficient |
|
|
21 | (5) |
|
2.3 Example: Computer Repair Data |
|
|
26 | (2) |
|
2.4 The Simple Linear Regression Model |
|
|
28 | (1) |
|
|
|
29 | (3) |
|
|
|
32 | (5) |
|
|
|
37 | (1) |
|
|
|
37 | (2) |
|
2.9 Measuring the Quality of Fit |
|
|
39 | (3) |
|
2.10 Regression Line Through the Origin |
|
|
42 | (2) |
|
2.11 Trivial Regression Models |
|
|
44 | (1) |
|
|
|
45 | (1) |
|
|
|
45 | (8) |
|
3 Multiple Linear Regression |
|
|
53 | (32) |
|
|
|
53 | (1) |
|
3.2 Description of the Data and Model |
|
|
53 | (1) |
|
3.3 Example: Supervisor Performance Data |
|
|
54 | (3) |
|
|
|
57 | (1) |
|
3.5 Interpretations of Regression Coefficients |
|
|
58 | (2) |
|
3.6 Properties of the Least Squares Estimators |
|
|
60 | (1) |
|
3.7 Multiple Correlation Coefficient |
|
|
61 | (1) |
|
3.8 Inference for Individual Regression Coefficients |
|
|
62 | (2) |
|
3.9 Tests of Hypotheses in a Linear Model |
|
|
64 | (10) |
|
3.9.1 Testing All Regression Coefficients Equal to Zero |
|
|
66 | (3) |
|
3.9.2 Testing a Subset of Regression Coefficients Equal to Zero |
|
|
69 | (2) |
|
3.9.3 Testing the Equality of Regression Coefficients |
|
|
71 | (2) |
|
3.9.4 Estimating and Testing of Regression Parameters Under Constraints |
|
|
73 | (1) |
|
|
|
74 | (1) |
|
|
|
75 | (1) |
|
|
|
75 | (7) |
|
Appendix: Multiple Regression in Matrix Notation |
|
|
82 | (3) |
|
4 Regression Diagnostics: Detection of Model Violations |
|
|
85 | (36) |
|
|
|
85 | (1) |
|
4.2 The Standard Regression Assumptions |
|
|
86 | (2) |
|
4.3 Various Types of Residuals |
|
|
88 | (2) |
|
|
|
90 | (3) |
|
4.5 Graphs Before Fitting a Model |
|
|
93 | (4) |
|
4.5.1 One-Dimensional Graphs |
|
|
93 | (1) |
|
4.5.2 Two-Dimensional Graphs |
|
|
93 | (3) |
|
|
|
96 | (1) |
|
|
|
96 | (1) |
|
4.6 Graphs After Fitting a Model |
|
|
97 | (1) |
|
4.7 Checking Linearity and Normality Assumptions |
|
|
97 | (1) |
|
4.8 Leverage, Influence, and Outliers |
|
|
98 | (5) |
|
4.8.1 Outliers in the Response Variable |
|
|
100 | (1) |
|
4.8.2 Outliers in the Predictors |
|
|
100 | (1) |
|
4.8.3 Masking and Swamping Problems |
|
|
100 | (3) |
|
4.9 Measures of Influence |
|
|
103 | (4) |
|
|
|
103 | (1) |
|
4.9.2 Welsch and Kuh Measure |
|
|
104 | (1) |
|
4.9.3 Hadi's Influence Measure |
|
|
105 | (2) |
|
4.10 The Potential-Residual Plot |
|
|
107 | (1) |
|
4.11 What to Do with the Outliers? |
|
|
108 | (1) |
|
4.12 Role of Variables in a Regression Equation |
|
|
109 | (5) |
|
4.12.1 Added-Variable Plot |
|
|
109 | (1) |
|
4.12.2 Residual Plus Component Plot |
|
|
110 | (4) |
|
4.13 Effects of an Additional Predictor |
|
|
114 | (1) |
|
|
|
115 | (1) |
|
|
|
115 | (6) |
|
5 Qualitative Variables as Predictors |
|
|
121 | (30) |
|
|
|
121 | (1) |
|
|
|
122 | (3) |
|
5.3 Interaction Variables |
|
|
125 | (3) |
|
5.4 Systems of Regression Equations |
|
|
128 | (11) |
|
5.4.1 Models with Different Slopes and Different Intercepts |
|
|
130 | (7) |
|
5.4.2 Models with Same Slope and Different Intercepts |
|
|
137 | (1) |
|
5.4.3 Models with Same Intercept and Different Slopes |
|
|
138 | (1) |
|
5.5 Other Applications of Indicator Variables |
|
|
139 | (1) |
|
|
|
140 | (1) |
|
5.7 Stability of Regression Parameters Over Time |
|
|
141 | (2) |
|
|
|
143 | (8) |
|
6 Transformation of Variables |
|
|
151 | (28) |
|
|
|
151 | (2) |
|
6.2 Transformations to Achieve Linearity |
|
|
153 | (2) |
|
6.3 Bacteria Deaths Due to X-Ray Radiation |
|
|
155 | (4) |
|
6.3.1 Inadequacy of a Linear Model |
|
|
156 | (2) |
|
6.3.2 Logarithmic Transformation for Achieving Linearity |
|
|
158 | (1) |
|
6.4 Transformations to Stabilize Variance |
|
|
159 | (5) |
|
6.5 Detection of Heteroscedastic Errors |
|
|
164 | (2) |
|
6.6 Removal of Heteroscedasticity |
|
|
166 | (1) |
|
6.7 Weighted Least Squares |
|
|
167 | (1) |
|
6.8 Logarithmic Transformation of Data |
|
|
168 | (1) |
|
|
|
169 | (4) |
|
|
|
173 | (1) |
|
|
|
174 | (5) |
|
|
|
179 | (18) |
|
|
|
179 | (1) |
|
7.2 Heteroscedastic Models |
|
|
180 | (3) |
|
|
|
180 | (2) |
|
7.2.2 College Expense Data |
|
|
182 | (1) |
|
|
|
183 | (2) |
|
7.4 Education Expenditure Data |
|
|
185 | (9) |
|
7.5 Fitting a Dose-Response Relationship Curve |
|
|
194 | (2) |
|
|
|
196 | (1) |
|
8 The Problem of Correlated Errors |
|
|
197 | (24) |
|
8.1 Introduction: Autocorrelation |
|
|
197 | (1) |
|
8.2 Consumer Expenditure and Money Stock |
|
|
198 | (2) |
|
8.3 Durbin-Watson Statistic |
|
|
200 | (2) |
|
8.4 Removal of Autocorrelation by Transformation |
|
|
202 | (2) |
|
8.5 Iterative Estimation With Autocorrelated Errors |
|
|
204 | (1) |
|
8.6 Autocorrelation and Missing Variables |
|
|
205 | (1) |
|
8.7 Analysis of Housing Starts |
|
|
206 | (4) |
|
8.8 Limitations of Durbin-Watson Statistic |
|
|
210 | (1) |
|
8.9 Indicator Variables to Remove Seasonality |
|
|
211 | (3) |
|
8.10 Regressing Two Time Series |
|
|
214 | (2) |
|
|
|
216 | (5) |
|
9 Analysis of Collinear Data |
|
|
221 | (38) |
|
|
|
221 | (1) |
|
|
|
222 | (6) |
|
9.3 Effects on Forecasting |
|
|
228 | (5) |
|
9.4 Detection of Multicollinearity |
|
|
233 | (6) |
|
9.5 Centering and Scaling |
|
|
239 | (4) |
|
9.5.1 Centering and Scaling in Intercept Models |
|
|
240 | (1) |
|
9.5.2 Scaling in No-Intercept Models |
|
|
241 | (2) |
|
9.6 Principal Components Approach |
|
|
243 | (3) |
|
|
|
246 | (2) |
|
9.8 Searching for Linear Functions of the 3's |
|
|
248 | (4) |
|
9.9 Computations Using Principal Components |
|
|
252 | (2) |
|
|
|
254 | (1) |
|
|
|
254 | (1) |
|
Appendix: Principal Components |
|
|
255 | (4) |
| 10 Biased Estimation of Regression Coefficients |
|
259 | (22) |
|
|
|
259 | (1) |
|
10.2 Principal Components Regression |
|
|
260 | (2) |
|
10.3 Removing Dependence Among the Predictors |
|
|
262 | (2) |
|
10.4 Constraints on the Regression Coefficients |
|
|
264 | (1) |
|
10.5 Principal Components Regression: A Caution |
|
|
265 | (3) |
|
|
|
268 | (1) |
|
10.7 Estimation by the Ridge Method |
|
|
269 | (3) |
|
10.8 Ridge Regression: Some Remarks |
|
|
272 | (3) |
|
|
|
275 | (1) |
|
|
|
275 | (2) |
|
Appendix: Ridge Regression |
|
|
277 | (4) |
| 11 Variable Selection Procedures |
|
281 | (36) |
|
|
|
281 | (1) |
|
11.2 Formulation of the Problem |
|
|
282 | (1) |
|
11.3 Consequences of Variables Deletion |
|
|
282 | (2) |
|
11.4 Uses of Regression Equations |
|
|
284 | (1) |
|
11.4.1 Description and Model Building |
|
|
284 | (1) |
|
11.4.2 Estimation and Prediction |
|
|
284 | (1) |
|
|
|
284 | (1) |
|
11.5 Criteria for Evaluating Equations |
|
|
285 | (3) |
|
11.5.1 Residual Mean Square |
|
|
285 | (1) |
|
|
|
286 | (1) |
|
11.5.3 Information Criteria: Akaike and Other Modified Forms |
|
|
287 | (1) |
|
11.6 Multicollinearity and Variable Selection |
|
|
288 | (1) |
|
11.7 Evaluating All Possible Equations |
|
|
288 | (1) |
|
11.8 Variable Selection Procedures |
|
|
289 | (2) |
|
11.8.1 Forward Selection Procedure |
|
|
289 | (1) |
|
11.8.2 Backward Elimination Procedure |
|
|
290 | (1) |
|
|
|
290 | (1) |
|
11.9 General Remarks on Variable Selection Methods |
|
|
291 | (1) |
|
11.10 A Study of Supervisor Performance |
|
|
292 | (4) |
|
11.11 Variable Selection With Collinear Data |
|
|
296 | (1) |
|
|
|
296 | (3) |
|
11.13 Variable Selection Using Ridge Regression |
|
|
299 | (1) |
|
11.14 Selection of Variables in an Air Pollution Study |
|
|
300 | (7) |
|
11.15 A Possible Strategy for Fitting Regression Models |
|
|
307 | (1) |
|
11.16 Bibliographic Notes |
|
|
308 | (1) |
|
|
|
308 | (5) |
|
Appendix: Effects of Incorrect Model Specifications |
|
|
313 | (4) |
| 12 Logistic Regression |
|
317 | (24) |
|
|
|
317 | (1) |
|
12.2 Modeling Qualitative Data |
|
|
318 | (1) |
|
|
|
318 | (2) |
|
12.4 Example: Estimating Probability of Bankruptcies |
|
|
320 | (3) |
|
12.5 Logistic Regression Diagnostics |
|
|
323 | (1) |
|
12.6 Determination of Variables to Retain |
|
|
324 | (3) |
|
12.7 Judging the Fit of a Logistic Regression |
|
|
327 | (2) |
|
12.8 The Multinomial Logit Model |
|
|
329 | (7) |
|
12.8.1 Multinomial Logistic Regression |
|
|
329 | (1) |
|
12.8.2 Example: Determining Chemical Diabetes |
|
|
330 | (4) |
|
12.8.3 Ordered Response Category: Ordinal Logistic Regression |
|
|
334 | (1) |
|
12.8.4 Example: Determining Chemical Diabetes Revisited |
|
|
335 | (1) |
|
12.9 Classification Problem: Another Approach |
|
|
336 | (1) |
|
|
|
337 | (4) |
| 13 Further Topics |
|
341 | (12) |
|
|
|
341 | (1) |
|
13.2 Generalized Linear Model |
|
|
341 | (1) |
|
13.3 Poisson Regression Model |
|
|
342 | (1) |
|
13.4 Introduction of New Drugs |
|
|
343 | (2) |
|
|
|
345 | (1) |
|
13.6 Fitting a Quadratic Model |
|
|
346 | (2) |
|
13.7 Distribution of PCB in U.S. Bays |
|
|
348 | (4) |
|
|
|
352 | (1) |
| Appendix A: Statistical Tables |
|
353 | (10) |
| References |
|
363 | (8) |
| Index |
|
371 | |