I am a researcher in outbreak response analytics, with a background in biostatistics, population genetics, and R programming. My research focusses on developing new methodologies and tools for understanding how infectious diseases spread, and how we can control them.

I wear several hats, including:

  • Senior Lecturer in Genetic Analysis at Imperial College London
  • Consultant in Outbreak Analytics and Epidemics Modelling
  • Extreme metal vocalist and mandolin player
  • Since 2023, volunteering in various non-profit organizations takes about half of my time

Other hats I used to wear include:

  • Software Design and Implementation Lead for Epiverse, at Data.org
  • Associate Professor in Outbreak Analytics at the London School of Hygiene and Tropical Medicine, 2018-2022
  • Member of the advisory group for COVID-19 modelling, SPI-M, in the UK
  • Founder and President of the R Epidemics Consortium (RECON), an NGO for the development of free data analytics resources for health emergencies, 2016-2021
  • Member of the UK Public Health Rapid Support Team, 2018-2021
  • WHO consultant and member of the WHO COVID-19 analytics team

You can find my CV here, and my PhD thesis (in French) there.

Outbreak response analytics

I am interested in developing a holistic approach to outbreak data analysis, with a strong focus on emergency outbreak response context, in which analytics directly inform public health decision making. Beyond infectious disease modelling techniques used in academia, I focus on the development of operational analysis tools, including reproducible and auditable data cleaning, interactive data visualisation tools, and automated report generation systems. On a more theoretical side, I am also interested in the estimation of key delay distributions (e.g. incubation period, serial interval distribution), and in robust estimations of transmissibility and the use of branching processes for short term incidence forecasting.

I regularly deploy to outbreak responses in the field, or close to it. In 2019, I spent a total of 6 months in North Kivu, DRC, for the response to the Ebola outbreak. I set up the analytics pipelines used first in Béni, then in Goma for informing the leadership of the response on various aspects of the outbreak in real time. From February 2020 to late 2021, I was working full-time on COVID-19, setting up data pipelines for the CMMID group at LSHTM, developing statistical approaches for informing the response in the UK alongside many other members of SPI-M, as well as for the COVID-19 analytics team at the WHO.

Evidence synthesis approaches for epidemics analysis

Part of my research focusses on integrating epidemiological and genomic data for analysing epidemics. I have pioneered the field of statistical outbreak reconstruction by publishing outbreaker in 2014, the first tool integrating epidemiological and genomic data for inferring who infects whom during an epidemic. I am supervising a PhD student on this topic, who carries further the integration of multiple data sources for outbreak reconstruction through the development of outbreaker2. I am also developing fast, scalable algorithms for outbreak detection by combining various type of data including spatial, temporal, and genetic information on reported cases.

RECON: the R Epidemics Consortium

In September 2016, I have create the R Epidemics Consortium (https://www.repidemicsconsortium.org), an international network of experts in infectious disease modelling, public health, and software developers interested in creating the next generation of tools for disease outbreak analysis using the R software.

RECON learn

In December 2017, I have created RECON learn a platform for sharing free, open training material for epidemics analysis. This includes a collection of lectures, practicals and case studies, most of which are distributed under CC-BY license.

Statistical genetics

My earlier research was mostly dedicated to developing multivariate approaches (factorial methods, clustering algorithms), for exploring genetic data. I am still involved in some of these aspects. I am the author of adegenet, a popular R package for genetic/genomic data analysis. With Zhian Kamvar, I am also running a course on data science for population genetic with PR statistics.