The geography of the covid-19 pandemic: A data-driven approach to exploring geographical driving forces

Frederik Seeup Hass, Jamal Jokar Arsanjani*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

4 Citations (Scopus)
4 Downloads (Pure)


The Covid-19 pandemic emerged and evolved so quickly that societies were not able to respond quickly enough, mainly due to the nature of the Covid-19 virus’ rate of spread and also the largely open societies that we live in. While we have been willingly moving towards open societies and reducing movement barriers, there is a need to be prepared for minimizing the openness of society on occasions such as large pandemics, which are low probability events with massive impacts. Certainly, similar to many phenomena, the Covid-19 pandemic has shown us its own geography presenting its emergence and evolving patterns as well as taking advantage of our geographical settings for escalating its spread. Hence, this study aims at presenting a data-driven approach for exploring the spatio-temporal patterns of the pandemic over a regional scale, i.e., Europe and a country scale, i.e., Denmark, and also what geographical variables potentially contribute to expediting its spread. We used official regional infection rates, points of interest, temperature and air pollution data for monitoring the pandemic’s spread across Europe and also applied geospatial methods such as spatial autocorrelation and space-time autocorrelation to extract relevant indicators that could explain the dynamics of the pandemic. Furthermore, we applied statistical methods, e.g., ordinary least squares, geographically weighted regression, as well as machine learning methods, e.g., random forest for exploring the potential correlation between the chosen underlying factors and the pandemic spread. Our findings indicate that population density, amenities such as cafes and bars, and pollution levels are the most influential explanatory variables while pollution levels can be explicitly used to monitor lockdown measures and infection rates at country level. The choice of data and methods used in this study along with the achieved results and presented discussions can empower health authorities and decision makers with an interactive decision support tool, which can be useful for imposing geographically varying lockdowns and protectives measures using historical data.

Original languageEnglish
Article number2803
JournalInternational Journal of Environmental Research and Public Health
Issue number6
Pages (from-to)1-19
Number of pages19
Publication statusPublished - 2 Mar 2021

Bibliographical note

Funding Information:
Funding: has received funding from the European Union’s Horizon Programme call H2020-INFRAEOSC-05-2018-2019, grant Agreement number 831644.”

Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland.


  • Covid-19 pandemic
  • Machine learning
  • Public health
  • Spatial au-tocorrelation
  • Spatio-temporal analysis
  • COVID-19
  • Geography
  • SARS-CoV-2
  • Pandemics
  • Europe/epidemiology
  • Humans
  • Communicable Disease Control


Dive into the research topics of 'The geography of the covid-19 pandemic: A data-driven approach to exploring geographical driving forces'. Together they form a unique fingerprint.

Cite this