Nov 2022

PyData NYC 2022: Customizable probabilistic record linkage with Name Match

Melissa McNeill

Linking individuals across records or datasets is often a critical prerequisite for building useful data tools and answering interesting research or business questions. But doing it right is difficult and time-consuming, in part because current off-the-shelf tools do not provide a measure of linking accuracy and are too rigid to incorporate the user’s domain knowledge. In this talk, we’ll 1) define high-quality record linkage and discuss why it matters, 2) show how record linkage can be boiled down to a simple prediction problem, and 3) introduce Name Match, a new open source tool for customizable probabilistic record linkage.

Latest Updates

The best way to cut gun violence, and it’s almost free
Op-Ed
Crain's Chicago Business
Jul 2025

The best way to cut gun violence, and it’s almost free

In an op-ed for Crain’s Chicago Business, Crime Lab Pritzker Director Jens Ludwig highlights the importance of using data-informed practices to improve public safety and shares key insights from behavioral economics that provide a playbook for addressing gun violence that is both effective and low-cost.

Research on cognitive behavioral therapy for at-risk youth
Podcast
Probable Causation
Jul 2025

Research on cognitive behavioral therapy for at-risk youth

Dr. Nour Abdul-Razzak joins host Jennifer Doleac on the Probable Causation podcast to discuss the Choose to Change program—an intervention that integrates trauma-informed therapy with comprehensive support to reduce youth violence and improve educational outcomes.

Deaths of decision-making are killing American teens. Schools can fix it.
Op-Ed
Brookings
Jul 2025

Deaths of decision-making are killing American teens. Schools can fix it.

Crime Lab executive director Katie Hill pens an op-ed for Brookings about how cognitive behavioral programs can teach teens decision-making skills that can dramatically reduce violence and save lives – often at little or no additional cost.