Probabilistic record linkage

These pages present some introductory training material on probabilistic record linkage using the Fellegi Sunter model. Many of the articles are interactive.

This material presents a simplified version of the model used by Splink, a piece of probabalistic linkage software for which I'm lead developer.

Many of the graphics presented re-use Splink's graphical output, and the representation of model parameters used is the same as Splink's settings object.

Training materials on probabilistic linkage

Further reading (external links)

  1. Splink: MoJ’s open source library for probabilistic record linkage at scale
  2. Splink homepage
  3. Try Splink live in your browser
  4. Interactive settings editor