Introduction to data linkage

Open to
Government analysts
Training category
Type of training
Face to face
One day
GSS Capability
GSS Capability
London, Newport, Titchfield, Leeds, Other locations available on request


This course is a classroom-based session aimed at providing participants with an introduction to the principles, theory and practice of data linking. This may be taken as a one-off module, or in combination with other modules as part of the Statistical Analyst (SA) Scheme.

Data linking is playing an increasing role in producing statistics, to support a wide range of users and uses, through integration of data from multiple sources.

There are no pre-requisites for this course, unless being taken as part of the SA scheme.

Learning outcomes

By the end of the course, participants will be able to understand:

  • the difficulties involved in data linkage
  • desirable qualities of matching variables
  • how to prepare datasets before matching
  • different types of linkage methods
  • how to link very large datasets
  • how to evaluate the quality of your matches

Participants will also be shown some matching methods using Python code.

How to book

These courses are provided depending on demand. If you would like access to this course, please contact



Related courses

Awareness in data linkage