In practice, it is beneficial to set the coexistence of an industry with itself to be equal to 1. This requires a normalization in the definition of the dot product and the joint probability. We rescale the joint probability so that when computed for a function on itself, the result is 1. This normalization ensures that the joint probability remains independent of proportional scaling of the density function of some industries (e.g., changing Fy to 2Fy).
The expression for the normalized joint probability is:
R∬Fx2dRR∬Fy2dRR∬FxFydR
An analogous requirement applied to the dot product of areal vectors leads to an expression of cosine similarity:
a∑Ex,a2a∑Ey,a2a∑Ex,a.Ey,a
We aim to minimize the summations B and C, ensuring the amplitudes in both continuous and discrete cases are linked:
For exponential decay density functions at Δ=0, we have:
R∬fx,i(x)fy,j(x)dR=2π(bi+bj)2titj
To summarize, for self-overlap of an industry (Δ=0, i=j), these results are expressed as:
R∬hx,i2(x)dR=2πsi2ti2
where si≡2σi2 for Gaussian and si=4bi2 for Laplace.
For an industry’s similarity with itself, Eq. disc_cont_self links overlaps in continuous space with observed employee counts. By replacing self_overlap_gen into disc_cont_self, we derive the intensity of the density function in terms of observed employment:
ti=2πsi∣∣Ex,a∣∣∫AFx2dREx,i
This equation indicates that the intensity of the probability density function is proportional to the number of employees, with the proportionality factor determined by the ratio of norms in discrete and continuous space and the width of influence.