Nov 2022

Customizable probabilistic record linkage with Name Match | PyData NYC 2022

Melissa McNeill

Linking individuals across records or datasets is often a critical prerequisite for building useful data tools and answering interesting research or business questions. But doing it right is difficult and time-consuming, in part because current off-the-shelf tools do not provide a measure of linking accuracy and are too rigid to incorporate the user’s domain knowledge. In this talk, we’ll 1) define high-quality record linkage and discuss why it matters, 2) show how record linkage can be boiled down to a simple prediction problem, and 3) introduce Name Match, a new open source tool for customizable probabilistic record linkage.

Latest Updates

More spending won’t be enough to reduce Chicago’s gun violence
Op-Ed
Chicago Tribune
May 2024

More spending won’t be enough to reduce Chicago’s gun violence

Read Crime Lab Faculty Director Jens Ludwig’s latest op-ed arguing that to reduce violence, we should leverage data science to figure out how to get more social good out of what the city is already spending on evidence-based strategies such as community violence intervention programs.

Reset with Sasha-Ann Simons: Can police misconduct be stopped before it starts?
Podcast
WBEZ
May 2024

Reset with Sasha-Ann Simons: Can police misconduct be stopped before it starts?

Crime Lab Senior Research Director Greg Stoddard joins Patrick Smith on WBEZ Reset to discuss results from a new study of an algorithm that can help identify which officers are likely to commit misconduct.

U. of C. study shows cops at high risk of misconduct also at elevated risk for off-duty trouble
Media Mention
Chicago Tribune
May 2024

U. of C. study shows cops at high risk of misconduct also at elevated risk for off-duty trouble

The Chicago Tribune’s Caroline Kubzansky speaks with Crime Lab Senior Research Director Greg Stoddard to discuss results from a new study of an officer support system.