Aggregate Productivity Growth Decomposition (part 1 of 2)

11 minutes to read

Part 1 introduces the theory behind Aggregate Productivity Growth decomposition. This model is vital to know how much technological progress and input reallocation contribute to productivity growth.

The theory is derived from the original paper: Measuring aggregate productivity growth using plant-level data by Petrin & Levinsohn (2013). I learned about this model from the Applied Macroeconomics course by Professor Suzuki course.

What is APG?

Conventionally, Solow growth accounting provides a tractable framework for us to see the contribution of capital, labor, and TFP (or Solow residual). However, the problem with this method is that it assumes the perfect efficient allocation of inputs. That is, there is no distortion in the economy, capital and labor are allocated perfectly so that the sole contribution to TFP is simply technological progress. In the long-term, this assumption may be reasonable, however, in the short term, it becomes too restrictive. For instance, the Lehman shock in 2008 may have caused large frictions in the financial sector, causing capital and labor to move to other sectors, which could have improved aggregate productivity. The simple Solow accounting cannot capture this effect.

Petrin & Levinsohn (2013) provide a method to further decompose Solow residual growth into 2 portions: TE (technological efficiency, a.k.a technological progress) and RE (reallocation efficiency of inputs). The inputs include capital, labor, and intermediate goods. The Solow residual is now referred to as APG (aggregate productivity growth). We decompose TE and RE and then use the EUKLEMS sector-specific micro-level data to quantify such effects and examine how much TE and RE contribute to APG.

The economy in this growth accounting exercise is Belgium, but the model can be extended to any country with sufficient data similar to EUKLEMS.

This article is Part 1 in a 2-Part Series.

Production of a representative firm

The gross output production function for firm i:

\[Q_i = Q_i (L_i, X_i, \omega_i)\]


$Q_i = \text{firm i’s output}$

$L_i = \text{firm i’s primary inputs} = {L_{ik}} = (L_{i1}, L_{i2}, …, L_{ik})$ => firm i uses inputs from sector k. Primary inputs include labor and capital

$X_i = \text{firm i’s intermediate inputs} = {X_{ij}} = (X_{i1}, …, X{ij})$ => firm i uses output of firm j

$\omega_i = \text{firm i’s technical efficiency}$

Measure the change in $Q_i$:

\[d Q_i = \frac{\partial Q_i}{\partial L_i}d L_i + \frac{\partial Q_i}{\partial X_i}d X_i + \frac{\partial Q_i}{\partial \omega_i}d \omega_i\]

Notice that:

$d L_i = \frac{\partial L_i}{\partial L_k}dL_{ik}$, $d X_i = \frac{\partial X_i}{\partial X_j}d X_{ij}$


\[d Q_i = \frac{\partial Q_i}{\partial L_k}d L_{ik} + \frac{\partial Q_i}{\partial X_j}d X_{ij} + \frac{\partial Q_i}{\partial \omega_i}d\omega_i\]


Final production that goes to consumers should be:

\[Y_i = Q_i - \sum_j X_{ji}\]


$Q_i = \text{firm i’s total production}$

$\sum_j X_{ji} = \text{total output from firm i that is used as intermediate goods}$

$\sum_i Y_i$ should be the GDP (production goes to consumers, after eliminating all intermediate goods)


\[dY_i = dQ_i - \sum_j dX_{ji}\]

which means:

\[\Rightarrow d Y_i = \frac{\partial Q_i}{\partial L_k}d L_{ik} + \frac{\partial Q_i}{\partial X_j}d X_{ij} + \frac{\partial Q_i}{\partial \omega_i}d\omega_i - \sum_j dX_{ji}\]

Determination of APG

By definition, APG measures the change in aggregate final demand and the change in aggregate inputs.

We want to know how much the economy gains from such changes in input allocation.

\[APG \equiv \color{red}{\sum_i P_i \ dY_i} - \color{blue}{\sum_i\sum_k W_{ik} \ dL_{ik}}\]

where the first term is the gain from output itself (namely by technological innovations) minus the second term, which is the “gain” from the changes in the cost of inputs, which are labor and intermediate goods.

Intermediate steps:

\[\begin{aligned} &\color{red}{\sum_i P_i \ dY_i} = \sum_i P_i \left(\frac{\partial Q_i}{\partial L_k}d L_{ik} + \frac{\partial Q_i}{\partial X_j}d X_{ij} + \frac{\partial Q_i}{\partial \omega_i}d\omega_i - \sum_j dX_{ji}\right) \\ &= \sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i + \sum_i\sum_k P_i\frac{\partial Q_i}{\partial L_k}dL_{ik} + \sum_i\sum_jP_i\frac{\partial Q_i}{\partial X_j}dX_{ij} - \sum_i P_i\left(\sum_j dX_{ji}\right) \\ &= \sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i + \sum_i\sum_k P_i\frac{\partial Q_i}{\partial L_k}dL_{ik} + \sum_i\sum_jP_i\frac{\partial Q_i}{\partial X_j}dX_{ij} - \sum_i\sum_j P_j dX_{ij} \end{aligned}\]

(Note (*) : The last terms is obtained because $\sum_i\sum_j P_i X_{ij} = \sum_i\sum_j P_j X_{ji}$)

so we rewrite the APG formula as follows.

\[\begin{aligned} &APG = \color{red}{\sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i + \sum_i\sum_k P_i\frac{\partial Q_i}{\partial L_k}dL_{ik} + \sum_i\sum_j(P_i\frac{\partial Q_i}{\partial X_j} - P_j)\ dX_{ij}} - \color{blue}{\sum_i\sum_k W_{ik} \ dL_{ik}} \\ &= \color{red}{\sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i} + \color{blue}{\sum_i\sum_k ( P_i\frac{\partial Q_i}{\partial L_k} - W_{ik})\ dL_{ik}} + \color{green}{\sum_i\sum_j(P_i\frac{\partial Q_i}{\partial X_j} - P_j)\ dX_{ij}} \end{aligned}\]


$\color{red}{red}$: gain from technical efficiency changes (TE)

$\color{blue}{blue}$: gain from reallocation of pirimary nput factors such as Labor and Capital (RE_lab, RE_cap)

$\color{green}{green}$: gain from reallocation of intermediate goods (RE_ii)

$\Rightarrow$ We can see that the change in final demand (output) comes from 3 sources: tech growth, RE in inputs, and RE of intermediate goods.

This is one of the most important equations to decompose APG. The next thing we want to do is to measure and decompose its rate of change $g_{APG}$.

The rate of change of APG

Value Added (VA)

To find the formula for $g_{APG}$, we introduce the term VA.

Basically, VA is the additional value a firm i adds to the value of intermediate goods. When firm i buys intermediate goods to produce stuff and then sells it, the difference is the value-added. By definition:

\[{VA}_i \equiv P_iQ_i - \sum_j P_j X_{ij} = P_i(Y_i + \sum_j X_{ji}) - \sum_j P_j X_{ij}\]

Total value added:

\(\sum_i VA_i = \sum_i P_i Y_i + \sum_i\sum_jP_iX_{ji} - \sum_i\sum_jP_jX_{ij} = \sum_i P_iY_i\) (the last 2 terms cancel out, see Note(*))

Since Y is the final output, $\sum_i P_i Y_i$ is equivalent to the nominal GDP.

Furthermore, since APG measures the total changes in production, i.e., APG by itself indicates the change from the baseline output (nominal GDP $\equiv \sum_i VA_i$).

APG Growth

The Aggregate Productivity Growth is defined as:

$g_{APG} = \frac{APG}{\sum_i VA_i}$

There are 2 ways to estimate $g_{APG}$

(1) Use the Solow residual as a proxy

Decomposition of $g_{APG}$ gives:

\[\begin{aligned} &g_{APG} = \frac{APG}{\sum_i VA_i} = \frac{1}{\sum_i VA_i} \left(\sum_i P_i d Y_i - \sum_i\sum_k W_{ik} dL_{ik}\right) \\ &= \sum_i \frac{P_i Y_i}{\sum_i VA_i}\frac{dY_i}{Y_i} - \sum_k\sum_i \frac{W_{ik}L_{ik}}{\sum_i VA_i}\frac{d L_{ik}}{L_{ik}} \end{aligned}\]

This is actually the Solow residual:

\[\frac{\dot{Y}}{Y} - \alpha_K\frac{\dot{K}}{K} - \alpha_L\frac{\dot{L}}{L}\]


\[g_{APG} = \color{red}{\sum_i \frac{P_i Y_i}{\sum_i VA_i}\frac{dY_i}{Y_i}} - \color{blue}{\sum_k\sum_i \frac{W_{ik}L_{ik}}{\sum_i VA_i}\frac{d L_{ik}}{L_{ik}}}\]

since $\sum_i VA_i = \sum_i P_iY_i$ so

\[\color{red}{\sum_i \frac{P_i Y_i}{\sum_i VA_i}\frac{dY_i}{Y_i}} = \sum_i \frac{dY_i}{Y_i} = \frac{dY}{Y}\]

This is the total change in GDP.


\[\color{blue}{\sum_k\sum_i \frac{W_{ik}L_{ik}}{\sum_i VA_i}\frac{d L_{ik}}{L_{ik}}} = \sum_k \left( \frac{\sum_i (W_{ik}L_{ik})}{\sum_i VA_i}\right)\sum_i\left(\frac{W_{ik}L_{ik}}{(W_{ik}L_{ik})}\frac{dL_{ik}}{L_{ik}}\right)\]

The first term $\sum_k$ is the factor income share a la Solow $\equiv \alpha_k$

The second term $\sum_i$ is the total change in input factors

Thus, $g_{APG} = \frac{dY}{Y} - \sum_k \alpha_k \frac{dL_k}{L_k}$

Comment: This estimation might be overestimated because it includes several sectors that are not productive or of foreign entities.

(2) Use VA

Define the change in $VA_i$ as:

\[dVA_i \equiv P_i\ dQ_i - \sum_j P_j \ dX_{ij}\]

Aggregation gives:

\[\sum_i dVA_i = \sum_i P_i dQ_i - \sum_i\sum_j P_j dX_{ij}\]

Plugging it into the following equation gives:

\[\begin{aligned} &APG = \sum_i P_i dY_i - \sum\sum W_{ik}dL_{ik} = \sum_i P_i dQ_i - \sum\sum P_j dX_{ij} - \sum\sum W_{ik}dL_{ik} \\ &\equiv \sum_i dVA_i - \sum\sum W_{ik}dL_{ik} \end{aligned}\]

Using the definition of $g_{APG}$ , we have:

\[g_{APG} = \frac{APG}{\sum_i VA_i} = \frac{\sum_i d VA_i - \sum\sum W_{ik}dL_{ik}}{\sum_i VA_i}\]

Note that, by taking logs and differentiating wrt time, we have:

Let $y = \ln(VA)$

\(\begin{aligned} &\Rightarrow \frac{dy}{dt} = \frac{dy}{d\ VA}\frac{dVA}{dt} = \frac{1}{VA}\frac{dVA}{dt} \\ &\Rightarrow \frac{dVA_i}{dt} = VA_i \frac{dy}{dt} \\ &\Rightarrow d\ VA_i = VA_i d \ln(VA_i) \end{aligned}\) Likewise for the other term:

\[\frac{dL_{ik}}{dt} = W_{ik}L_{ik} \frac{d\ln(L_{ik})}{dt} \Rightarrow W_{ik}dL_{ik} = \frac{W_{ik}L_{ik}}{VA_i} \ VA_i d\ln(L_{ik})\]

So $g_{APG}$ can be rewritten as:

\[\begin{aligned} &g_{APG} = \frac{1}{\sum VA_i} \left(\sum_i dVA_i- \sum\sum W_{ik} dL_{ik}\right) \\ &\Rightarrow g_{APG} = \frac{1}{\sum VA_i} \left(VA_i \sum_i d \ln(VA_i) - VA_i \frac{W_{ik}L_{ik}}{VA_i} d\ln(L_{ik})\right) \\ &\Rightarrow \color{red}{g_{APG}= \sum_i D_i^\nu d \ln(VA_i) - \sum_i D_i^\nu \sum_k s^\nu_{ik} d \ln(L_{ik})} \end{aligned}\]

where: $D_i^\nu = \frac{VA_i}{\sum_{i} VA_i}$, also known as the Domar Weight, $s_{ik}^\nu = \frac{W_{ik}L_{ik}}{VA_i}$

Referring to EUKLEMS dataset

$VA_i$: dataNA / VA (GVA, current prices)

$W_{ik}L_{ik}$: dataGA / LAB (Labor compensation) & CAP (capital compensation)

$\ln(L)$ : take logs of dataGA / LAB_QI (labor services) & CAP_QI (capital services)

$\ln(VA_i)$: take logs of dataNA / VA_Q

Thus, from the formula in red, we can estimate APG growth.

Decomposition of $g_{APG}$


\[APG = \color{red}{\sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i} + \color{blue}{\sum_i\sum_k ( P_i\frac{\partial Q_i}{\partial L_k} - W_{ik})\ dL_{ik}} + \color{green}{\sum_i\sum_j(P_i\frac{\partial Q_i}{\partial X_j} - P_j)\ dX_{ij}}\]

Using the same technique as before, it is not hard to derive that:

$\sum_i P_i\frac{\partial Q_i}{\partial \omega_i}d\omega_i = \sum_i P_i \frac{\partial Q_i}{\partial \omega_i}\omega_i d \ln(\omega_i) = \sum_i P_iQ_i\frac{\partial Q_i}{\partial \omega_i}\frac{\omega_i}{Q_i} d \ln(\omega_i)$

$\sum_i\sum_k ( P_i\frac{\partial Q_i}{\partial L_k} - W_{ik})\ dL_{ik} = \sum_i P_iQ_i\sum_k\left( \frac{\partial Q_i}{\partial L_{ik}} \frac{L_{ik}}{Q_i} - \frac{W_{ik}L_{ik}}{P_iQ_i}\right) d\ln(L_{ik})$

$\sum_i\sum_k ( P_i\frac{\partial Q_i}{\partial X_j} - P_{j})\ dX_{ij} = \sum_i P_iQ_i\sum_j \left( \frac{\partial Q_i}{\partial X_{ij}} \frac{X_{ij}}{Q_i} - \frac{P_{ij}X_{ij}}{P_iQ_i}\right) d\ln(X_{ij})$

Using the formula of $g_{APG} = \frac{APG}{\sum_i VA_i}$ , we get:

\[g_{APG} = \frac{APG}{\sum VA_i} = \color{blue}{\sum_i D_i \sum_k (\beta_{ik} - s_{ik})d \ln(L_{ik})} + \color{green}{\sum_i D_i \sum_j (\beta_{ij} - s_{ij})d \ln(X_{ij})} + \color{red}{\sum_i D_i \beta_w d\ln(\omega_i)}\]

where $\frac{P_iQ_i}{\sum_i VA_i} \equiv D_i$ <- Gross Output Domar Weight

$\beta$ term is the Elasticity of output wrt input $\beta_{ik} = \frac{\partial Q_i}{\partial L_{ik}}\frac{L_{ik}}{Q_i}$, $\beta_{ij} = \frac{\partial Q_i}{\partial X_{ij}}\frac{X_{ij}}{Q_i}$

$s$ as factor income share: $s_{ik} = \frac{W_{ik}L_{ik}}{P_iQ_i}, s_{ij} = \frac{P_{ij}X_{ij}}{P_iQ_i}$

From here, Reallocation term (blue and green) can be dervied explicitly while the red term is derived as the residual

Referring to the EUKLEMS dataset

$P_iQ_i$ : dataNA / GO (Gross Output, current prices

$W_{ik}L_{ik}$: dataGA/ LAB & CAP (labor and capital compensation) -> for calculation of $s_{ik}$

$P_{ij}X_{ij}$: dataNA / II (intermediate inputs) -> for calculation of $s_{ij}$

$d \ln{L_{ik}}$ : the change in the logs of real inputs (take logs of dataGA/ LAB_QI, CAP_QI )

$d \ln(X_{ij})$: the change in the logs of real inputs (take logs of dataNA/ II_Q )

Estimation for $\beta$ is trickier. In the long run, elasticity $\beta$ should converge towards factor income share $s$, but not neccessarily in the short-run. When we examine its dynamics over long period of time, taking the average of factor income share can also be a good approximation for $\beta$ during that period.

Now we have all the necessary ingredients (formulas and data) to conduct APG decomposition. The implementation in R will be introduced in part 2.

Written on June 6, 2021