Пікірлер
@BarneyLawrence
@BarneyLawrence 25 күн бұрын
Good work Calum and Tom. That was a really good Databricks overview!
@Caldimus
@Caldimus 25 күн бұрын
Thanks Barney! Fun day recording in the office :)
@simpsonassociates5948
@simpsonassociates5948 22 күн бұрын
Thanks so much, Barney. Completely agree.
@mfayed19
@mfayed19 Ай бұрын
Thank you
@Caldimus
@Caldimus 3 ай бұрын
Great chat about Splink! How do you go about chaining the deterministic match to the Splink match in a pipeline?
@BarneyLawrence
@BarneyLawrence 3 ай бұрын
Hi @Caldimus, I'm glad you enjoyed the video. The Splink code usually runs in a notebook which can be scheduled as a step inside a pipeline. Typically it gets sandwiched between steps that extract the data we need to match from source and following ones to process the results and create a combined person record from the matched sources. If we need a deterministic element to the matching this is handled by a set of overrides we store in a table which take priority over any probabilistic results or exclude bad matches. Those overrides can be pushed directly to the table or can be managed through a governance app we have developed which sits on top of the probabilistic results allowing someone to approve or reject them.
@katherinetarbitt3139
@katherinetarbitt3139 3 ай бұрын
Super interesting, thanks for the insight Barney!
@PKAsh7
@PKAsh7 3 ай бұрын
Great job, really enjoyed this one.
@YusufKhan-x4x
@YusufKhan-x4x 4 ай бұрын
How do you add the floors ?
@KayDobson
@KayDobson 8 ай бұрын
Great interview!
@simpsonassociates5948
@simpsonassociates5948 8 ай бұрын
Thanks Kay 😀
@BarneyLawrence
@BarneyLawrence 8 ай бұрын
Great interview! I'm looking forwards to seeing future episodes.
@simpsonassociates5948
@simpsonassociates5948 8 ай бұрын
Thanks Barney! We'll have to get you on as a guest!