Splink: a software package for probabilistic record linkage and deduplication at scale

Splink: a software package for probabilistic record linkage and deduplication at scale

16.710 Lượt nghe
Splink: a software package for probabilistic record linkage and deduplication at scale
In this seminar, we will introduce Splink, a software package developed for probabilistic record linkage at scale. This is free software provides a toolkit for record linkage of datasets of tens or even hundreds of millions of records, guiding the user through the various stages of linkage