Who invented epidemic modeling?

Published:
Updated:
Who invented epidemic modeling?

The quest to pinpoint the single inventor of epidemic modeling often leads down a winding path, much like the spread of the very diseases these models seek to describe. There isn't a single “eureka” moment or one solitary genius who first scribbled down an equation for disease spread; rather, the field coalesced from centuries of statistical observation into rigorous mathematical structures. However, the discipline as we recognize it today—using differential equations to track populations through stages of infection—has clear foundational pillars established in the early 20th century. [8]

# Early Observations

Who invented epidemic modeling?, Early Observations

Long before complex mathematical frameworks were developed, people observed, recorded, and tried to make sense of outbreaks. Early efforts focused on simple counting and charting, attempting to understand the speed and scope of plagues. These initial steps were observational, noting patterns like the time it took for a disease to move from one town to the next, or the relationship between population density and case counts. Think of early epidemiological efforts, which laid the groundwork by showing that epidemics were measurable phenomena, not just random divine acts. [7] While these statistical snapshots were crucial for public health responses—like implementing quarantines—they lacked the predictive power that mathematical models later provided. They described what happened, but not precisely how the system worked dynamically over time.

# Mathematical Foundation

Who invented epidemic modeling?, Mathematical Foundation

The true shift into mathematical modeling occurred when researchers began to treat the transmission process as a continuous, dynamic system. The landmark contribution here belongs to W. O. Kermack and A. G. McKendrick. In 1927, they published seminal work that introduced the SIR model. [6][8] This was a game-changer because it moved the study from static counts to dynamic rates of change, using differential equations to describe the flow of individuals between compartments. [6]

The SIR model divides the population into three groups:

  1. Susceptible: Individuals who can catch the disease.
  2. Infected: Individuals who currently have the disease and can transmit it.
  3. Removed: Individuals who have recovered (and are immune) or have died, meaning they can no longer catch or transmit the disease. [6][9]

Kermack and McKendrick’s work established key epidemiological concepts, most notably the basic reproduction number, denoted as R0R_0 (R-naught). [6] This number, derived from their equations, estimates the average number of secondary infections produced by a single typical infected individual in a completely susceptible population. [6] The framework they built was so elegant in its simplicity that it remains the conceptual scaffolding for nearly all subsequent, more complex compartmental models in mathematical epidemiology today. [8]

# Discrete Modeling

While Kermack and McKendrick focused on continuous change, another significant line of thought emerged focusing on transmission in discrete steps, which is often easier to visualize for localized, small-scale interactions. This leads us to the Reed-Frost model, developed by Wade Hampton Frost and Lowell J. Reed in the 1920s. [5]

The Reed-Frost model is fundamentally different because it operates in discrete time steps rather than continuous flow. [5] Instead of using differential equations, it relies on probability: if an infected person comes into contact with a susceptible person, there is a certain probability that transmission occurs during that specific contact interval. [5] This approach is particularly useful for understanding how a disease might spread through a closed population, such as an institution or a small, isolated community, where events happen in clear, sequential steps rather than a smooth continuum. [5]

Comparing the two, the SIR model excels at modeling large populations where the aggregate behavior smooths out individual stochastic (random) events, allowing for the predictive power of calculus. [6] The Reed-Frost model, conversely, preserves the randomness inherent in contact, making it more suitable for examining the initial seeding and propagation within small, defined groups. [5] Both models, emerging around the same era, demonstrate that the invention of modeling was not singular but involved parallel development of continuous and discrete mathematical thinking applied to health crises.

# Model Evolution

The core SIR structure provided the necessary vocabulary—S, I, R—but real-world pathogens rarely fit so neatly. Over the decades following Kermack and McKendrick's work, the modeling discipline exploded as researchers added complexity to account for biological reality. [1][3]

For instance, the simple assumption of immediate recovery or removal was often inadequate. This led to models like the SEIR structure, which adds an Exposed (E) compartment for individuals who have been infected but are not yet infectious (the latent period). [9] The progression would be SEIRS \to E \to I \to R. [9] This refinement became particularly important for diseases like tuberculosis or diseases with long incubation periods. [6]

The ability to test and refine these models was historically constrained by computational limits. Imagine trying to solve the complex differential equations for thousands of individuals manually; it was impossible. [3] The advent of electronic computers transformed modeling from a theoretical exercise into a practical tool for policy. When an outbreak hits, like the recent COVID-19 pandemic, the need to rapidly estimate key parameters—like the current reproduction number, RtR_t—demands significant computing power to process real-time case data against established mathematical structures. [3]

It is fascinating to consider how the intention behind the modeling changed over time. Early models, like SIR, were often developed to understand the natural history of a disease in the absence of interventions, establishing a baseline for comparison. [4] Today, modeling is almost always intervention-focused: asking what happens if we implement a mask mandate, close schools, or rapidly vaccinate 40% of the population. [1]


When applying these foundational concepts to practical public health decisions, a crucial, often overlooked, step involves translating model outputs back into actionable policy parameters. For example, the simple SIR model might calculate an R0R_0 of 2.5. This number, on its own, doesn't tell a mayor what capacity their hospital ICU needs to handle the surge. A policy analyst must then take that theoretical R0R_0, factor in local demographic data (like age distribution, which affects severity), existing healthcare capacity, and the expected time lag between infection and hospitalization. This translation step—moving from mathematical abstraction to concrete resource allocation—is where the expertise of applied epidemiology truly shines, a bridge not explicitly built by Kermack or Reed and Frost, but necessary for modern governance. [7]


Furthermore, the choice between a continuous model (like SIR) and a stochastic, discrete model (like Reed-Frost) often boils down to the scale and stage of the epidemic being studied. For a massive, ongoing epidemic like seasonal influenza across a continent, the continuous SIR model provides an excellent, stable approximation of the system dynamics. [6] However, if you are tracking the very first few introductions of a novel virus into a cluster of college dorms—say, only 10 people are initially infected—the noise and randomness inherent in discrete steps become dominant. In that early, low-count scenario, the continuous assumption breaks down, and a discrete, probability-based approach might offer a more truthful picture of whether the outbreak will die out immediately or take hold. [5] This is an analytical choice driven by the data availability and the specific question being posed to the mathematical structure.

# Enduring Principles

While computational models today incorporate intricate elements like age stratification, network structures, mobility patterns, and viral evolution (making them vastly more complicated than the originals), [1] they all still rely on the core principle established by the pioneers: compartmentalization and rate change. [8]

The enduring genius of the early work lies not just in the equations themselves, but in establishing the language—Susceptible, Infected, Removed—that allows epidemiologists across different fields, from virology to public health administration, to communicate about disease dynamics using a shared, mathematical dialect. [4] Whether one credits the continuous mathematics of Kermack and McKendrick or the probabilistic rigor of Reed and Frost, the invention of epidemic modeling was the moment disease spread moved from being merely observed history to being mathematically predictable science.

#Citations

  1. Mathematical modeling applied to epidemics: an overview - PMC
  2. Reed-Frost Epidemic Model - Ohio Supercomputer Center
  3. Pandemic Puts Mathematical Modeling Through Its Paces
  4. the first epidemic model: - a historical note on pd en'ko
  5. Reed–Frost model - Wikipedia
  6. SIR models of epidemics – Theoretical Biology | ETH Zurich
  7. Infectious Disease Modeling — A brief history of disease modelers
  8. Computational Modeling and Simulation of Epidemic Infectious ...
  9. [PDF] John Cross, epidemic theory, and mathematically modeling the ...

Written by

Donna Edwards
inventionHistorymodelepidemic