Data Science Intermediate

Using administrative data for research: a brief introduction
3 JULY 2023
pre-recorded lectures available from 26 June 2023

Pia Hardelid and Linda Wijlaars

Administrative data, sometimes referred to as routinely collected data, provide large and rich datasets for research. However, they require careful cleaning, management and interpretation. This online course is for those who are interested in whether they might want to use administrative data for research and would like a short introduction to this topic. The course will use administrative health data (national hospital inpatient data – the Hospital Episode Statistics database) as an example, but the principles apply to all administrative data.

The aim is to provide an introduction to the following topics

  • What is the difference between administrative data and other research data?
  • Who collects administrative health data and why?
  • What are some of the benefits of challenges of using administrative health data for research?
  • A short introduction to applying for access to administrative health data (UK perspective)

By the end of this course you will be able to:

  • Understand what special considerations need to be taken into account when analysing and interpreting administrative health data
  • Describe the strengths and weaknesses of administrative health data for research.
  • Elaborate on the key steps required for accessing administrative health data in the UK

There will be three pre-recorded lectures sessions and one live computer practical session in R or Stata.




From 26th June 2023

Pre-recorded lectures available

Lecture 1

Introduction: what is so special about administrative health data?

Lecture 2

Strengths and weaknesses of administrative health data

Lecture 3

How to apply for access to administrative data

Monday 3rd July 2023

10:30am -1:30pm

Live session

RADIANCE appetisers on Information Governance, Trusted Research Environment, Ethical considerations, Reproducible and Open Data Science.


This is a UKRI funded project offering rigorous training in longitudinal data science. Please note that this training is NOT available to undergraduate or masters students.