Jason Blevins’s Notebook
Graham Imbens and Ridder (2006)

Complementarity and Aggregate Implications of Assortative Matching

These slides are based on the paper ”Complementarity and Aggregate Implications of Assortative Matching” by Bryan S. Graham, Guido W. Imbens, and Geert Ridder, May 14, 2006.

Presentation by Jason Blevins, Duke Applied Microeconometrics Reading Group, March 6, 2007

Basic Model

Reallocations

Examples

Estimation

Comparison

Contributions

Model

Notes

  • Holding the marginal distribution of W fixed is appropriate for situations where the input is indivisible (e.g., Teachers, Managers) and when the aggregate stock of the input is hard to augment.

  • X is a scalar because the paper focuses on rank ordered matching. There is no clear natural ordering for vector valued covariates.

Identifying Assumption

That is, the average output we would see if all firms were assigned W=w equals the average output among firms that actually have W=w. The distribution of potential outcomes must be the same in the subpopulation of firms that were assigned W=w as that in the overall population. This is the analogous assumption to that of the binary treatment effect model of Rosenbaum and Rubin (1983).

Production Function

Unconfoundedness implies that among firms with identical X and Z, the (counterfactual) average output of firms if we assigned W=w to all firms is equal to the actual average output of firms that are in fact assigned W=w.

Quantities of Interest

Positive Matching

  • We would expect this redistribution to perform well if there is complementarity between X and W.

  • F WZ 1 (qZ) is a conditional quantile function.

Graphical Example

Graphical example of Positive Assortative Matching

Interpretation

  • The authors focus on redistributions within subpopulations defined by Z because they reflect solely the complementarity or substitutability between W and X. Population-wide redistributions confound these effects by altering the joint distribution of W and Z.

  • However, there is a distinction to be made between what redistribution might help us learn about complementarity and which might be optimal socially.

An Example

An Example

  • Thus, focusing on redistributions across classrooms with similar gender mixes allows us to learn about complementarity.

Negative Matching

Estimation

  • Note that although the model was developed for Z subpopulations, only population-wide estimators are presented.

Estimation of g

ĝ(w,x,z)= iY iK(vV ib) iK(vV ib)
\hat g(w,x,z) = \frac{\sum_i Y_i K\left( \frac{v - V_i}{b} \right)} {\sum_i K\left( \frac{v - V_i}{b} \right)}

Support problems: we are trying to learn about a counterfactual allocation that may involve areas of the support for which we have few observations to estimate g.

Estimation of CDFs

F̂ X(x)=N 1 i1 (X ix)
\hat F_X(x) = N^{-1} \sum_i 1\left( X_i \leq x \right)
F̂ W(w)=N 1 i1 (W iw)
\hat F_W(w) = N^{-1} \sum_i 1\left( W_i \leq w \right)
F̂ W 1 (q)=inf w𝒲1 {F̂ W(w)q}
\hat F_W^{-1}(q) = \inf_{w \in \mathcal{W}} 1 \left\lbrace \hat F_W(w) \geq q \right\rbrace

This is the inverse of the empirical CDF of W.

Estimation

β̂ pam=1 N i=1 Nĝ[F̂ W 1 (F̂ X(X i)),X i,Z i]
\hat \beta^{\text{pam}} = \frac{1}{N} \sum_{i=1}^N \hat g\left[ \hat F_W^{-1}\left( \hat F_X(X_i) \right),\,X_i,\,Z_i \right]
β̂ nam=1 N i=1 Nĝ[F̂ W 1 (1 F̂ X(X i)),X i,Z i]
\hat \beta^{\text{nam}} = \frac{1}{N} \sum_{i=1}^N \hat g\left[ \hat F_W^{-1}\left( 1 - \hat F_X(X_i) \right),\,X_i,\,Z_i \right]
  • Note that here we are averaging over both X and Z.

  • The rate of convergence of β̂ pam and β̂ nam is slower than the parametric rate.

  • Loosely speaking, this is because we estimate a nonparametric function g(w,x,z) with more parameters than we then average over.

Correlated Matching

Correlated Matching

Correlated Matching

Normal Copula

ϕ(x 1 ,x 2 ,ρ)=1 2 π1 ρ 2 exp[1 2 (1 ρ 2 )(x 1 2 2 ρx 1 x 2 +x 2 2 )],
\phi(x_1, x_2, \rho) = \frac{1}{2\pi\sqrt{1 - \rho^2}}\exp\left[-\frac{1}{2(1-\rho^2)}(x_1^2 - 2 \rho x_1 x_2 + x_2^2) \right],
ϕ c(x 1 ,x 2 ,ρ)=ϕ(x 1 ,x 2 ,ρ)Φ(c,c,ρ)Φ(c,c,ρ)[Φ(c,c,ρ)Φ(c,c,ρ)].
\phi_c(x_1, x_2, \rho) = \frac{\phi(x_1, x_2, \rho)}{\Phi(c,c,\rho) - \Phi(c, -c, \rho) - [ \Phi(-c, c, \rho) - \Phi(-c, -c, \rho)]}.
H W,X(w,x)=Φ c[Φ c 1 (F W(w)),Φ c 1 (F X(x));ρ].
H_{W,X}(w,x) = \Phi_c\left[ \Phi_c^{-1}\left( F_W(w) \right), \Phi_c^{-1}\left( F_X(x)\right); \rho \right].

Normal Copula

h W,X(w,z)=ϕ c[Φ c 1 (F W(w)),Φ c 1 (F X(x));ρ]f W(w)f W(x)ϕ c[Φ c 1 (F W(w))]ϕ c[Φ c 1 (F X(x))]
h_{W,X}(w,z) = \phi_c\left[ \Phi_c^{-1}\left( F_W(w) \right), \Phi_c^{-1}\left( F_X(x) \right); \rho \right] \frac{f_W(w)f_W(x)}{ \phi_c\left[ \Phi_c^{-1}\left( F_W(w) \right) \right] \phi_c\left[ \Phi_c^{-1}\left( F_X(x) \right) \right] }

Correlated Matching

β cm(ρ,τ) =τE[Y] +(1 τ)g(w,x,z)dΦ(Φ 1 (F WZ(wz)),Φ 1 (F XZ(xz));ρ)F Z(z).
\begin{split} \beta^{\text{cm}}(\rho, \tau) &= \tau \E[Y] \\ &+ (1-\tau) \int g(w,x,z) d\Phi\left( \Phi^{-1}(F_{W \vert Z}(w \vert z)), \Phi^{-1}(F_{X \vert Z}(x \vert z)); \rho \right) F_Z(z). \end{split}
β cm(ρ,0 )=g(w,x,z)ϕ c[Φ c 1 (F W(w)),Φ c 1 (F X(x));ρ]ϕ c[Φ c 1 (F W(w))]ϕ c[Φ c 1 (F X(x))]f W(w)f X,Z(x,z)dwdxdz
\beta^{\text{cm}}(\rho, 0) = \int \int \int g(w,x,z) \frac{ \phi_c\left[ \Phi_c^{-1}\left( F_W(w) \right), \Phi_c^{-1}\left( F_X(x) \right); \rho \right] }{ \phi_c\left[ \Phi_c^{-1}\left( F_W(w) \right) \right] \phi_c\left[ \Phi_c^{-1}\left( F_X(x) \right) \right] } f_W(w) f_{X,Z}(x,z)\,dw\,dx\,dz
β rm=β cm(0,0 )=[g(w,z,z)dF WZ(wz)dF XZ(xz)]dF Z(z).
\beta^{\text{rm}} = \beta^{\text{cm}}(0,0) = \int\Biggl[ \int \int g(w,z,z)\,dF_{W \vert Z}(w \vert z)\, dF_{X \vert Z} (x \vert z)\Biggr]\, dF_{Z}(z).

Estimation

β̂ cm(ρ,0 )=1 N 2 i=1 N j=1 Nĝ(W i,X j,Z j)ϕ c[Φ c 1 (F̂ W(W i)),Φ c 1 (F̂ X(X j));ρ]ϕ c[Φ c 1 (F̂ W(W i))]ϕ c[Φ c 1 (F̂ X(X i))]
\hat \beta^{\text{cm}}(\rho, 0) = \frac{1}{N^2} \sum_{i=1}^N \sum_{j=1}^N \hat g(W_i,X_j,Z_j) \frac{ \phi_c\left[ \Phi_c^{-1}\left( \hat F_W(W_i) \right), \Phi_c^{-1}\left( \hat F_X(X_j) \right); \rho \right] }{ \phi_c\left[ \Phi_c^{-1}\left( \hat F_W(W_i) \right) \right] \phi_c\left[ \Phi_c^{-1}\left( \hat F_X(X_i) \right) \right] }
β̂ cm(ρ,τ)=τβ̂ sq+(1 τ)β̂ cm(ρ,0 ).
\hat \beta^{\text{cm}}(\rho, \tau) = \tau \hat \beta^{\text{sq}} + (1 - \tau) \hat \beta^{\text{cm}}(\rho, 0).
  • This is linear in the nonparametric regression function ĝ and nonlinear in the empirical CDFs of X and W.

  • The authors claim that under certain conditions, this estimator is consistent and asymptotically normal, but the proofs are omitted in the latest available version (May 2006).

Application

Summary Statistics

VariableMeanStd. dev.
Ed. child13.062.38
Ed. mother11.202.87
Ed. father11.203.64

Regression

VariableCoefficientStd. Err.
Constant11.27000.1900
Ed. mother-0.04100.0360
Ed. father-0.07700.0290
Ed. mother 2 0.01100.0023
Ed. father 2 0.01100.0015
Ed. mother×Ed. father0.00140.0029
  • Nonlinearity of the relationship suggests that education of the child might be sensitive to reallocations.

  • Inspection of the data reveal that there is an asymmetry: it is better to have a mother with high education and a father with low education than vice-versa. The interaction term here doesn’t capture that.

Estimates

ρβ̂ csstd(β̂ cs)
-0.9911.5.069
-0.8011.7.048
-0.6011.9.040
-0.4012.1.037
-0.2012.4.034
0.0012.6.033
0.2012.8.031
0.4012.9.030
0.6013.0.029
0.8013.0.029
0.9913.1.039