No video

25 April 2024: Brendan Murphy (University College Dublin)

  Рет қаралды 60

UCL Statistical Science seminars

UCL Statistical Science seminars

Күн бұрын

Title: Model-Based Approach to Enhance Individual Matching Across Different Databases by Using Household Information
The field of record linkage is focused on matching information from the same entity across diverse sources without unique identifiers. Record linkage is gaining importance in applications ranging from medical record enhancement to the study of population mobility between censuses or surveys. Conventional record linkage models primarily concentrate on direct individual matching, often disregarding valuable group-level information inherent in the data. Motivated by recent research indicating enhanced performance when incorporating group information into the matching process, we propose a novel model-based approach that facilitates the joint estimation of individual and household match status, while also estimating the feature matching probabilities, given the match status of both individuals and their households. To illustrate the methodology we use the Italian Survey of Household Income and Wealth from 2014 and 2016. Our results, which account for different initialization methods, demonstrate a notable improvement in the $F_1$ score, with values around 80% when household information is considered, compared to approximately 46% for methods directly matching individuals without leveraging group information. Additionally, our findings underscore the model's robustness, as it consistently yields favourable outcomes across various initialization methods and in the presence of implemented blocking strategies. This work is in collaboration with Thais Pacheco Menezes and Michael Fop from the School of Mathematics and Statistics, University College Dublin.

Пікірлер
01 March 2024 - Yiyong Luo (PhD seminar) - Econometric Modelling with High Dimensional Data
44:12
21 February 2024 - James Briant (PhD seminar) - Bayesian Calibration of Computer Models
56:04
managed to catch #tiktok
00:16
Анастасия Тарасова
Рет қаралды 45 МЛН
а ты любишь париться?
00:41
KATYA KLON LIFE
Рет қаралды 3,4 МЛН
Мы сделали гигантские сухарики!  #большаяеда
00:44
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 101 М.
18 April 2024: Nina Deliu (Sapienza University of Rome)
54:41
UCL Statistical Science seminars
Рет қаралды 70
Bikram Karmakar: A new paradigm for causal inference in the presence of unmeasured confounders
1:07:10
ASA Statistical Learning and Data Science
Рет қаралды 98
9 May 2024: David Rossell (Universitat Pompeu Fabra in Barcelona)
54:28
UCL Statistical Science seminars
Рет қаралды 67
Scottish accent vs Irish accent (funny)
3:55
Lifey
Рет қаралды 12 МЛН
Study Geography and Geoscience at Trinity College Dublin
19:01
Trinity College Dublin
Рет қаралды 1,8 М.
A Day in the Life of David Cameron
11:33
The Sun
Рет қаралды 3,2 МЛН
managed to catch #tiktok
00:16
Анастасия Тарасова
Рет қаралды 45 МЛН