Size Distribution of Firms
Size Distribution of Firms
The size distribution of firms in the French traders dataset is illustrated in the figure below, with exporters on the left and importers on the right. The distributions for multiple years are superimposed, with blue and yellow dots representing the distribution of the number of agents by size bin and traded value by size bin, respectively. The plots use a log-log scale, where log-normal size distributions appear as quadratic parabolas.
It is evident that the size distribution of French traders is compatible with a log-normal distribution (truncated at the minimum value ). This aligns with previous evidence regarding populations of economic agents. However, there is some drift and widening of the fitted log-normal parameters over the years, suggesting that the size distribution may not be stationary. This widening would be more pronounced if firms experienced random log jumps akin to Brownian motion, indicating that negative autocorrelation of shocks may stabilize agents' time series.
A log-normal size distribution implies a concentration of significant value among the largest agents. For French traders, 90% of exports are concentrated among firms exporting more than , and 90% of imports are concentrated among firms importing more than .
If we focus on these largest agents, a Pareto (power law) rule is also partly compatible with the observed size distribution. Thus, a log-normal and a Pareto rule are not contradictory descriptions of the size distribution of firms. The Pareto model provides a linear approximation to the upper tail of the distribution, making it a useful model for studying the consequences of size distributions, even if not completely accurate empirically.
Both log-normal and Pareto size distributions imply concentration, leading to the familiar 80%-20% concentration rule. This feature highlights the high concentration of value among a small portion of agents, given the empirically observed distribution parameters.
Formalizing Size Distributions
Consider a histogram that shows the number of agents with total sales in each interval of a partition of the real numbers. The total number of firms is the sum of the population of all bins . If is small enough, we can approximate firm sizes by . The total value associated with firms in the bin is . The total value disaggregated by firm size is:
Let us denote firm sizes in linear scale as and the normalized probability density function (PDF) of firm sizes as . Similarly, there is a PDF of value: . In the continuous limit, equation becomes:
Here, value and population are linked by the fact that the first moment of the value of the population distribution is the zeroth moment of the value distribution. The average firm sales allows us to express . For each size bin: .
The integrands in the equation tell us:
X p_{val}(s) = N p_{cnt}(s)\ s\For a given value , the amount is how much the holders of such sum hold altogether. For example, if people hold 100 dollars () each, this group has dollars.
From the equation, we see that the distribution of firm sizes determines the distribution of value . Using the log variable in place of , i.e., , the size distribution becomes: . The product of and results in a shifted parabola , making the distribution of value log-normal if the distribution of agent sizes is log-normal. This shift causes the concentration of value among large agents.
The most crucial characteristic of empirical distributions of firm-level sales is their definition on a log scale. They can be expressed as with a normal distribution or an exponentially decreasing distribution (Pareto rule). This has been consistently observed in various populations of economic agents and is key to our study. World trade, split by different categories, also shows log-normal distributions and fluctuations, suggesting broader applicability of these tools.