This lab will walk through some basic SSIV analyses using data from Autor, Dorn, and Hanson (ADH, 2013). As discussed in lecture, ADH use a shift-share instrument aggregating Chinese import shocks across 397 manufacturing instruments with exposure weights calculated as the economy-wide share of (baseline) industry employment. We will use three cleaned datasets from their setup:
-
adh_shocks.dta
: an industry-by-year dataset of the shocks -
adh_shares.dta
: a location-by-industry-by-year dataset of the shares -
adh_noIV.dta
: a location-by-year dataset of the main outcome (manufacturing employment growth,y
), treatment (local growth of China import exposure,x
), and other useful variables -- excluding the ADH instrument
- Construct the ADH (location-by-year) instrument by appropriately combining the data on shocks and shares. Merge this into the
adh_noIV
dataset, and estimate an IV regression of the outcome onto the treatment which controls for year (i.e. thepost
variable) and weights by baseline total employment (theweight
variable), clustering bystate
. Then estimate the exact same IV regression replacing the outcomey
with the lagged outcomey_lag
, capturing growth in manufacturing employment that took place before the ADH "China Shock" quasi-experiment. How does the latter IV regression help build support for the former IV regression?
Main IV Estimate:
Standard Error:
Lag Outcome IV Estimate:
Standard Error:
Comments:
- Construct the "sum-of-shares" control from the
adh_shares
dataset and add this control to both of the previous IV regressions. How does the main IV estimate change? Why, intuitively, is this control important to include?
Main IV Estimate:
Standard Error:
Lag Outcome IV Estimate:
Standard Error:
Comments:
- Interact the "sum-of-shares" control with year and add this control to both of the previous IV regressions. How do both IV estimates change? Can you see why, intuitively, the interaction control shifts the main IV estimate so much?
Main IV Estimate:
Standard Error:
Lag Outcome IV Estimate:
Standard Error:
Comments:
- Use the ssaggregate command to run both of the previous IV regressions at the shock level. You should control for year fixed effects in the shock-level IV regressions. The coefficients should be identical to the previous estimates, but the standard errors will be different. Comment on the change.
Main IV Estimate:
Standard Error:
Lag Outcome IV Estimate:
Standard Error: