Data Linkage in R

Open to
All staff in the Government Statistical Service
Training category
Analytical, Data science
Type of training
Online
Length
12 hours
Organiser
GSS Capability
Provider
GSS Capability
Location
Online

Performing data linkage is the process of joining multiple datasets together and linking records. It ensures that the resources spent collecting data are most effectively used by increasing the ways each dataset can be used for various research needs.

This course aims to cover the practical application of linking data in R. A similar course will be available for those who prefer working in Python.

Learning outcomes

By the end of the session, participants will be able to conduct:

  • pre-linkage (preparing data)
  • exact matching
  • rule-based matching
  • score-based matching
  • Fellegi-Sunter probabilistic matching
  • post-linkage and quality evaluation

How to book

Please use your UKSA Learning Hub account to access the course online. Alternatively, please email gss.capability@statistics.gov.uk.

Related courses