1. Consider two structural models given by the following system of equations (Note: These are two independent models):
Model 1
Model 2
For each system:
a. Evaluate which equations are under-identified, just-identified, and over-identified.
b. Illustrate how you could estimate the identified equations.
2. Using a sample of 545 full-time workers in the USA, a researcher is interested in whether women are systematically underpaid compared to men. First, she evaluates the average hourly wages in the sample for men and women, which are $5.91 and $5.09, respectively.
The researcher also runs a simple regression of an individual's wage on a male dummy, equal to 1 for males and 0 for females. This gives the results reported in Table below:
Table: Hourly wages explained from gender: OLS results
Variable Estimate Standard Error t-ratio
Constant 5.09 0.58 8.78
Male 0.82 0.15 5.47
N=545 s=2.17 R2=26%
a. How will you interpret the coefficient estimate of 0.82? How do you interpret the estimated intercept of 5.09?
b. How do you interpret the R2 value?
c. Illustrate the relationship between the coefficient estimates in the table and the average wage rates of females and males.
e. A student is unhappy with this model because "a female dummy is omitted from the model." Comment upon this criticism.
f. Using the results in Table 1, test the hypothesis that men and women have, on average, the same wage rate, against the one-sided alternative that women earn less. State the assumptions required for this test to be valid.
3. Given the model , answer the following questions.
a. Assume that εt ~ N(0, σ2) What type of time series model is this?
b. Graph the value of y against t for 10 periods when ε1= 0.2, φ1= 0.8, and a0= 0.
c. Draw an appropriate ACF and PACF plot for the model given in this question.
4. Consider the following OLS regression between the 1975 Wages for 428 married women versus their actual experience in the labor market and their years of education (1976 Panel Study of Income Dynamics, Mroz(1987).
log(wage) = - 0.400 + 0.0160 x Exper + 0.1095 x Educ
The data set was analyzed using SAS. Partial output in tabular form is presented below
Analysis of Variance
Source
|
DF
|
Sum of Squares
|
Mean Square
|
F Value
|
Pr>F
|
Model
|
A
|
D
|
16.56623
|
G
|
H
|
Error
|
B
|
E
|
0.44752
|
Corrected Total
|
C
|
F
|
|
Root MSE: I
R-Square: J
Parameter Estimates
Variable
|
DF
|
Parameter Estimate
|
Standard Error
|
t value
|
Pr>|t|
|
Intercept
|
1
|
-0.40017
|
0.19037
|
|
|
Exper
|
1
|
0.01567
|
0.00402
|
K
|
M
|
Educ
|
1
|
0.10949
|
0.01417
|
L
|
N
|
a. Calculate the values associated with the letters A through N.
b. Interpret the coefficients associated with Exper and Educ.
5. A criminologist is interested in studying the following question: "Is the death penalty applied in a racially discriminatory fashion?" To answer this question, data were collected for 100 death penalty cases in the State of Georgia. Logistic regression was used with the binary dependent variable death penalty against a number of independent variables. The analysis is set up to obtain the predicted probability of getting the death penalty (death penalty = 1). The independent variables were defined as follows:
blkdef = 1 if black defendant; 0 otherwise.
whtvict = 1 if white victim; 0 otherwise.
aggcirc = number of aggravating circumstances.
fevict = 1 if female victim; 0 otherwise.
stranger =1 if stranger victim; 0 otherwise.
multvic = 1 if 2 or more victims; 0 otherwise.
multstab = 1 if multiple stabs; 0 otherwise.
yngvict = 1 if victim 12 or younger; 0 otherwise.
A partial output table is given below:
Parameter
|
DF
|
Estimate
|
Standard Error
|
Wald Chi Square
|
P-Value
|
Intercept
|
1
|
-3.5675
|
1.1243
|
10.0682
|
0.0015
|
blkdef
|
1
|
-0.5308
|
0.5439
|
0.9526
|
0.3291
|
whtvict
|
1
|
1.5563
|
0.6161
|
6.382
|
0.0115
|
aggcirc
|
1
|
0.373
|
0.1963
|
3.6096
|
0.0574
|
fevict
|
1
|
0.3707
|
0.5405
|
0.4703
|
0.4928
|
stranger
|
1
|
1.7911
|
0.5386
|
11.0577
|
0.0009
|
multvic
|
1
|
0.1999
|
0.745
|
0.072
|
0.7885
|
multstab
|
1
|
1.4429
|
0.7938
|
3.3047
|
0.0691
|
yngvict
|
1
|
0.1232
|
0.9526
|
0.0167
|
0.8971
|
a. Holding all other variables constant and using a type I error rate of 5%, are black defendants more likely to get the death penalty than white defendants? Why or why not? Interpret the coefficient for blkdef.
b. Calculate the odds ratio of getting the death penalty for a defendant whose crime was against a white defendant. Is this odds ratio statistically significant using a type 1 error rate of 5%. Interpret the odds ratio.
c. What is the predicted probability of getting the death penalty for a black defendant who kills a white (female) victim who is a stranger with two aggravating circumstances, multiple victims, multiple stabs, and a victim younger than 12 years of age? What would the prediction be if all that changed was that the defendant was not black?
d. The regression coefficients for multvic and yngvict are not statistically significant. Make an argument for why we would include these independent variables in the logistic regression model even though their regression coefficients are nonsignificant.