Skip to main content

Reviewing the Diversification Issues

Reviewing the Diversification Issues

When a national or global aggregate time series is constructed from the contributions of thousands or even millions of agents, a natural question arises: Why don't their idiosyncratic shocks cancel out?

The intuition suggests that the volatility of such a time series should decrease as σ1/N\sigma \sim 1/\sqrt{N}, where NN is the number of agents. This decay rate is typical in a population of agents with additive Gaussian fluctuations. This concept is often referenced, as seen in the observation by Lucas (1977): "... in a complex modern economy, there will be a large number of such shifts in any given period, each small in importance relative to total output. There will be much 'averaging out' of such effects across markets."

Aggregate Standard Deviation

Is the magnitude of the aggregate standard deviation approximately 1/N1/\sqrt{N}? If not, why?

A quick response is that it is not, because all agents can experience growth or decline due to an 'aggregate shock' that exceeds 1/N1/\sqrt{N}. Lucas (1977) also noted: "... there have been many instances of shocks to supply which affect all, or many, sectors of the economy simultaneously. Such shocks will not cancel in the way I have described, and they will induce output fluctuations in the aggregate."

The question then becomes whether the idiosyncratic part of aggregate volatility follows a 1/N1/\sqrt{N} rule. The answer is that it declines more mildly, but this issue requires a careful approach.

Variance and Covariance in Sectoral Fluctuations

Consider a partition of firms into sectors with mild fluctuations. From previous equations, the variance of the log of XX is approximately:

var[log(X)]var[X](ln(10)E[X])2i,jSˉiSˉjXˉ2cov(Fi,Fj)var [\log(X)] \approx \frac{var[X]}{ \left( \ln(10) E[X] \right)^2} \approx \sum_{i, j} \frac{\bar S_i \bar S_j}{\bar X^2} \operatorname{cov} (F_i, F_j)

where i,ji, j denote pairs of parts pp. Sectoral log fluctuations can be expressed as Fpt=mpt+σpϵptF_{pt} = m_{pt} + \sigma_p \epsilon_{pt}. Thus, the covariance is:

cov(Fi,Fj)=cov(mit+σiϵit,mjt+σjϵit)=cov(mit,mjt)+cov(mit,σjϵjt)+cov(σiϵit,mjt)+cov(σiϵit,σjϵjt)\begin{split} \operatorname{cov} (F_i, F_j) = & cov(m_{it} + \sigma_i \epsilon_{it}, m_{jt} + \sigma_j \epsilon_{it}) \\ = & cov(m_{it}, m_{jt}) + cov(m_{it}, \sigma_j\epsilon_{jt}) \\ & + cov( \sigma_i \epsilon_{it}, m_{jt} )+ cov( \sigma_i \epsilon_{it}, \sigma_j \epsilon_{jt}) \end{split}

This equation contains P2P^2 elements to be summed for aggregate variance. In special cases, this expression simplifies, providing insights into real-life settings.

Uncorrelated Shocks and Unit Variance

For uncorrelated shocks of unit variance, var(ϵit,ϵjt)=δijvar(\epsilon_{it}, \epsilon_{jt}) = \delta_{ij} (Kronecker's delta), the equation simplifies to:

var[log(X)]i,jSˉiSˉjXˉ2[cov(mit,mjt)+σjcov(mit,ϵjt)+σicov(ϵit,mjt)+σiσjδij]var [\log(X)] \approx \sum_{i, j} \frac{\bar S_i \bar S_j}{\bar X^2} \left[ cov(m_{it}, m_{jt}) + \sigma_j cov(m_{it}, \epsilon_{jt}) + \sigma_i cov(\epsilon_{it}, m_{jt} )+ \sigma_i \sigma_j \delta_{ij} \right]

If time series mptm_{pt} are uncorrelated with idiosyncratic shocks, then:

var[log(X)]i,jSˉiSˉjXˉ2[cov(mit,mjt)+σiσjδij]var [\log(X)] \approx \sum_{i, j} \frac{\bar S_i \bar S_j}{\bar X^2} \left[ cov(m_{it}, m_{jt}) +\sigma_i \sigma_j \delta_{ij} \right]

Aggregate and Idiosyncratic Shocks

Aggregate shocks and idiosyncratic shocks both contribute to aggregate volatility, and either can dominate. Early solutions assumed idiosyncratic variance vanishes at a rate σ21/N\sigma^2 \sim 1/N, yet empirical evidence shows a milder decay.

If all parts pp share a single mptm_{pt} term:

var[log(X)]i,jSˉiSˉjXˉ2[var(mpt)+σiσjδij]var [\log(X)] \approx \sum_{i, j} \frac{\bar S_i \bar S_j}{\bar X^2} \left[ var(m_{pt}) +\sigma_i \sigma_j \delta_{ij} \right]

For parts of nearly equal size, Sˉp/Xˉ1/P\bar S_p / \bar X \approx 1/P, the sum yields:

var[log(X)]Sˉp2Xˉ2 P2var(mpt)σm2+pSˉp2Xˉ2σp2σϵ2=var(mpt)σm2+1P2pσp2σϵ2var [\log(X)] \approx \underbrace{\frac{\bar S_p^2}{\bar X^2} \ P^2 var(m_{pt})}_{\text{$\sigma^2_m$}} + \underbrace{ \sum_{p} \frac{\bar S_p^2}{\bar X^2} \sigma_p^2}_{\text{$\sigma^2_\epsilon $}} = \underbrace{var(m_{pt})}_{\text{$\sigma^2_m$}} + \underbrace{ \frac{1}{P^2} \sum_{p} \sigma_p^2 }_{\text{$\sigma^2_\epsilon $}}

Linear variance is:

var[X]ln2(10)(Xˉ2var(mpt)+pSˉp2σp2)var [X] \approx \ln^2(10) \left( \bar X^2 var(m_{pt}) + \sum_p \bar S^2_p \sigma^2_p\right)

Effective Number of Agents

The effective number of agents NeffN_{eff} may be less than the actual number NN, due to the concentration of contributions among a few large agents. For example, 100 companies might account for half the exports, and 10,000 for the other half, leading to Neff330N_{eff} \approx 330, not N=10100N = 10100.

Idiosyncratic Variance Decay

Gabaix (2011) suggests a power law size distribution leads to a milder decay σϵ2Nα\sigma_\epsilon^2 \sim N^{-\alpha}. However, this is derived for the Herfindahl index and not directly applicable to aggregate idiosyncratic variance. The assumption that σk\sigma_k is constant across agents is unrealistic.

The proposition in Gabaix (2011) should be limited to Herfindahl index decay, not aggregate idiosyncratic variance. Further investigation is needed to understand the decay of idiosyncratic variance with population size.

In subsequent sections, tools are provided for studying the σϵ2Nα\sigma_\epsilon^2 \sim N^{-\alpha} dependence, showing how nonlinearities contribute to comovement terms among agents.