Skip to content
GSS > Training courses > Face to face > Introduction to data linkage

Introduction to data linkage

Open to
All staff in the Government Statistical Service
Training category
Type of training
Face to face
One day
GSS Capability
GSS Capability
London, Newport, Titchfield, Leeds, Other locations available on request


This course is a classroom-based session aimed at providing participants with an introduction to the principles, theory and practice of data linking. This may be taken as a one-off module, or in combination with other modules as part of the Statistical Analyst (SA) Scheme.

Data linking is playing an increasing role in producing statistics, to support a wide range of users and uses, through integration of data from multiple sources.

There are no pre-requisites for this course, unless being taken as part of the SA scheme.

Learning outcomes:

By the end of the course, participants will be able to understand:

  • the difficulties involved in data linkage
  • desirable qualities of matching variables
  • how to prepare datasets before matching
  • different types of linkage methods
  • how to link very large datasets
  • how to evaluate the quality of your matches

Participants will also be shown some matching methods using Python code.

How to book:

Please email:



Related courses:

Awareness in data linkage