Good work Calum and Tom. That was a really good Databricks overview!
@Caldimus25 күн бұрын
Thanks Barney! Fun day recording in the office :)
@simpsonassociates594822 күн бұрын
Thanks so much, Barney. Completely agree.
@mfayed19Ай бұрын
Thank you
@Caldimus3 ай бұрын
Great chat about Splink! How do you go about chaining the deterministic match to the Splink match in a pipeline?
@BarneyLawrence3 ай бұрын
Hi @Caldimus, I'm glad you enjoyed the video. The Splink code usually runs in a notebook which can be scheduled as a step inside a pipeline. Typically it gets sandwiched between steps that extract the data we need to match from source and following ones to process the results and create a combined person record from the matched sources. If we need a deterministic element to the matching this is handled by a set of overrides we store in a table which take priority over any probabilistic results or exclude bad matches. Those overrides can be pushed directly to the table or can be managed through a governance app we have developed which sits on top of the probabilistic results allowing someone to approve or reject them.
@katherinetarbitt31393 ай бұрын
Super interesting, thanks for the insight Barney!
@PKAsh73 ай бұрын
Great job, really enjoyed this one.
@YusufKhan-x4x4 ай бұрын
How do you add the floors ?
@KayDobson8 ай бұрын
Great interview!
@simpsonassociates59488 ай бұрын
Thanks Kay 😀
@BarneyLawrence8 ай бұрын
Great interview! I'm looking forwards to seeing future episodes.
@simpsonassociates59488 ай бұрын
Thanks Barney! We'll have to get you on as a guest!