Skip to main content

Introduction to Location Quotients

Introduction

Location quotients (LQ) are a transformation of cross-sectional data frequently used in various contexts in empirical economics. The LQ transformation gained prominence in studies involving international trade flows, notably after Balassa's introduction of the revealed comparative advantage index (RCA) (Balassa, 1965). Beyond international trade, location quotients have been applied to measure economic intensity, such as employment or sales by place and industry, and the number of patents by place and technological field (Florence, 1939; de Solla Price, 1981).

Purpose and Challenges

The primary goal of the LQ transformation is to account for the size of entities when evaluating data magnitudes. For instance, a billion USD is significant for a small country but minor for larger countries. Thus, it is reasonable to weigh raw data by the size of the entities they belong to.

However, a critical issue with location quotients is that LQ values cannot be compared across entities, time periods, or datasets. There is no absolute measure of intensity or scale for LQ values due to the influence of entity size and table total on LQ value spread. Smaller entities tend to exhibit wider fluctuations in LQ levels, a phenomenon observed in business sales data (Stanley, 1996). These effects, stemming from entity sizes and observations, are termed size effects.

Notation and LQ Transformation

Let us define some useful notation. Typically, we have a total SWS_W (e.g., world exports) disaggregated into at least two independent categories, C and P (e.g., countries, products). This notation applies to various empirical settings, like patent counts by country and technology class, or employment levels by city and industry. The value associated with entities cCc \in C and pPp \in P is denoted as scps_{cp}, with all values being non-negative. The table total is SW=cpscpS_W = \sum_{cp} s_{cp}. The entity sizes are Sc=pscpS_c = \sum_p s_{cp} and Sp=cscpS_p = \sum_c s_{cp}. Applying the location quotient transformation to raw data scps_{cp} in a contingency table involves replacing entries scps_{cp} with:

LQcp=scpSWScSpLQ_{cp} = \frac{s_{cp} S_W}{S_c S_p}

When LQ_cp=1LQ\_{cp} = 1, the corresponding scps_{cp} value aligns with the expected distribution from independent marginal distributions ScS_c and SpS_p. Also, LQ_cp=0LQ\_{cp} = 0 if and only if scp=0s_{cp} = 0. All other positive LQ values relate to this relation, but complete identification between scps_{cp} and marginal probabilities only occurs if LQ=1LQ = 1. Thus, LQ=1LQ = 1 is generally considered the only LQ level with a clear interpretation.

Size Effects and Probabilistic LQ

Consider the magnitude of size effects on LQ. For instance, for observations where LQ0.8LQ \approx 0.8 in export data, the probability of shifting to LQ>1LQ > 1 within a year is 23%23\% for values between 10410^4 USD and 10510^5 USD, 8.4%8.4\% for values between 10610^6 USD and 10710^7 USD, and 1.0%1.0\% for values between 10810^8 USD and 10910^9 USD. This indicates that a single LQ level implies a more optimistic scenario for small observations than large ones.

This paper leverages these patterns to measure size distortions. Correctly characterizing these distortions is crucial for reliable results using LQ indices.

Solutions and Framework

A quick solution might involve taking z-scores within entities, as suggested by O'Donoghue and Zheng. However, deciding the scope of these z-scores—whether within rows, columns, across time, or combinations—is essential. Alternatively, some have tweaked the LQ expression or abandoned it for other indices.

What is needed is an organized understanding of size effects in location quotients and their measurement. Examining the width of LQ level jumps over time is useful, as the local width of these jumps relates to the local width of the LQ metric. When LQ varies widely, the chances of crossing the LQ=1LQ = 1 threshold in any time step are larger for equal LQ0LQ_0 starting levels.

This paper proposes a formal framework using the difference between the log of observed values (x1x_1) and the log of expected values from uncorrelated margins (x2x_2), expressed as log(LQ)=x1x2\log(LQ) = x_1 - x_2. This creates a two-dimensional space (x1x_1, x2x_2) for LQ observations from entities of different sizes. After describing empirical information, I present results from computing probabilities of surpassing the LQ=1LQ = 1 threshold based on sizes x1x_1, x2x_2, termed probabilistic LQ or pLQ. This method reveals equivalent distances to the LQ=1LQ = 1 level, addressing size distortion characterization.

Conclusion and Structure

I test a model clarifying the relationship between size volatility and LQ volatility, translating these into jump distributions and estimating chances of surpassing the LQ=1LQ = 1 threshold. This exercise formally closes the discussed elements.

Overall, this paper offers formal paths and practical solutions for robustly assessing location quotient issues. The tools presented enable new techniques, such as consistently defining LQ level categories for all entities, enriching studies using 'revealed comparative advantage' as a regression variable.

The following sections are organized as follows: Section 2 reviews literature on location quotient indices, Section 3 discusses the dataset, Section 4 presents a mathematical framework for LQ indices, Section 5 introduces the probabilistic LQ index, and Section 6 offers models describing size effects. Section 7 concludes.