Skip to main content

Dependence of Variance with Population Size

Understanding the Dependence of Variance with Population Size

The framework in this section is useful for approaching the problem of dependence with population size (NN). We denote a reference population size as N0N_0 and consider a different population size N1=k N0N_1 = k\ N_0.

Scheme for analysing the decay of volatility with population size implied by a power law

A power law of variance with NN implies a linear relation between log(σ2)\log(\sigma^2) (equivalent to 2log(σ)2 \log(\sigma)) and log(N)\log(N). The slope of this line is α-\alpha, which determines the ratio Dy/DxDy/Dx. If we consider two populations, where the second is a multiple kk of the first, then Dx=log(k)Dx = \log(k), and the difference in variance between these two populations is given by αDx=αlog(k)-\alpha Dx = -\alpha \log(k). These relationships apply not only to the total population but also to its parts.

To link changes in total population to changes in parts' population, consider: if we sample N1=k N0N_1 = k\ N_0 agents from a population, on average, the population of each part pp is k np(N0)k\ n_p(N_0), where np(N0)n_p(N_0) is the expected population at part pp when the total population size is N0N_0. In logarithmic scale, this means that if log(N1)=log(N0)+log(k)\log(N_1) = \log(N_0) + \log(k), then log(np(N1))=log(np(N0))+log(k)\log(n_p(N_1)) = \log(n_p(N_0)) + \log(k) for all parts pPp \in P.

Empirically, the dependence of a part's log variance with changes in the part's log population can be approximated qualitatively by a line of slope α-\alpha:

log(σp2(np))=cαplog(np)σp2(np)=10cnpαp\log(\sigma^2_p(n_p)) = c - \alpha_p \log(n_p) \Leftrightarrow \sigma^2_p(n_p) = \frac{10^c}{n_p^{\alpha_p}}

The accuracy of this model can be tested a posteriori. When changing npn_p for knpk n_p, the levels of log(σp2)\log(\sigma^2_p) change as:

log(σp2(np(N0)))αplog(k)=log(σp2(k np(N0)))1kαpσp2(np(N0))=σp2(k np(N0))\log(\sigma^2_p(n_p(N_0))) - \alpha_p \log(k) = \log(\sigma^2_p(k\ n_p(N_0))) \hspace{.5cm} \Leftrightarrow \hspace{.5cm} \frac{1}{k^{\alpha_p}} \sigma^2_p(n_p(N_0)) = \sigma^2_p(k\ n_p(N_0))

If all parts pp present a common αpα\alpha_p \equiv \alpha exponent, then when replacing this value in the expression of the idiosyncratic term of aggregate variance, the dependence with α\alpha comes out as a common factor:

1kα1P2pσp2(np(N0))=1P2pσp2(k np(N0))\frac{1}{k^\alpha} \frac{1}{P^2} \sum_p \sigma^2_p(n_p(N_0)) = \frac{1}{P^2} \sum_p \sigma^2_p(k\ n_p(N_0))

Thus, the relation shown by the parts is itself valid for the aggregate:

log(σϵ2(N0))α log(k)=log(σϵ2(k N0))1kασϵ2(N0)=σϵ2(k N0)\log(\sigma^2_\epsilon(N_0)) - \alpha\ \log(k) = \log(\sigma^2_\epsilon(k\ N_0))\hspace{.5cm} \Leftrightarrow \hspace{.5cm} \frac{1}{k^\alpha} \sigma^2_\epsilon(N_0) = \sigma^2_\epsilon(k\ N_0)

This leads to the equation:

log(σϵ2(N))=cαlog(N)σϵ2(N)=CNα\log(\sigma^2_\epsilon(N)) = c' - \alpha \log(N) \Leftrightarrow \sigma^2_\epsilon (N) = C' N^{-\alpha}

This equation indicates that if we plot the idiosyncratic term of var(X)var(X) as a function of population sampling size NN in a log-log scale, it will show a slope α-\alpha.

Decay of idiosyncratic volatility with population size

In the special case where all parts have the same variance σp2\sigma^2_p, the idiosyncratic part of aggregate variance fulfills σϵ2=Pσp2/P2=σp2/P\sigma^2_\epsilon = P \sigma^2_p/P^2 = \sigma^2_p/P. Therefore, log(σϵ2)=log(σp2)log(P)\log(\sigma^2_\epsilon) = \log(\sigma^2_p) - \log(P). In our case, P=10P = 10, so log(P)=1\log(P) = 1. This determines the 1-1 variance drop when comparing parts to aggregate.

We have expressed the idiosyncratic part of aggregate variance both as a function of total population NN and as a function of parts' population npn_p. They both should show a common α\alpha. Empirically, the observed slope α\alpha of variance decay with population size is αX=0.48\alpha_X = -0.48 for exports data, and αM=0.49\alpha_M = -0.49 for imports data. These values were computed from parts' variances (blue lines) and can be extended to describe aggregate idiosyncratic variance (yellow lines).

So far, we can measure the rate of decay of aggregate variance with population size (α\alpha). We know that the rate of decay of parts is related to the rate of decay in the aggregate. However, we have yet to understand why this slope has its particular value. To explore this, we need to examine what occurs within the parts themselves, which will be the focus of the following section.