Web Crawler - System Design Interview Question

  Рет қаралды 10,866

TechPrep

TechPrep

Күн бұрын

Пікірлер: 13
@TechPrepYT
@TechPrepYT Ай бұрын
🔍 Full Write Up + Bonus Section (What Top Tech Interviewers Really Want to See) → www.techprep.app/system-design 🎯
@games-are-for-losers
@games-are-for-losers 10 ай бұрын
The KZbin algorithm has picked up your channel. Really good content
@LouisDuran
@LouisDuran 8 ай бұрын
I like that these are short and sweet. It shouldn't take an hour to explain TinyURL or web crawler. Thanks!
@TechPrepYT
@TechPrepYT 6 ай бұрын
Exactly 👍
@ChimiChanga1337
@ChimiChanga1337 10 ай бұрын
Excellent! Could also talk about what kind of network protocols will be used for services to talk to eachother?
@WINDSORONFIRE
@WINDSORONFIRE 6 ай бұрын
How does the design of a web crawler not include geo located servers etc?
@sayantanscs
@sayantanscs 3 ай бұрын
is this really a good use case for bloom filters ? they will have false positive which means they might say something is visited while it is not i.e assuming we keep a list of visited url's. So we will have roughly 0.1 to 1% of URL's which are never visited ! Now since this is a continuous process if there is a way to ensure the values in bloom filters changes with every run so even if something is missed first time in next run it's not automatically missed, this might be a work around.
@rajaryanvishwakarma8915
@rajaryanvishwakarma8915 10 ай бұрын
Great video man
@LearningNewThings0407
@LearningNewThings0407 8 ай бұрын
Is it Font queue prioritizer or Front queue prioritizer ?
@dibll
@dibll 10 ай бұрын
During duplicate detection step, how Content Cache is being used? Could someone please explain?
@jjlee4883
@jjlee4883 10 ай бұрын
Awesome video. Would it make sense for the url seen detector and url filter to come after the html parser step?
@TechPrepYT
@TechPrepYT 10 ай бұрын
Thanks for the comment! You wold want the duplicate detection to occur directly after the HTML parser as we don't want to process the same data and extract the same URLs from the same page and that's why the URL Seen Detector and URL filter happen later on in the system. Hope this makes sense!
Web Crawler System Design Concepts Nobody Talks About
21:42
Pratiksha Bakrola
Рет қаралды 14 М.
Quando eu quero Sushi (sem desperdiçar) 🍣
00:26
Los Wagners
Рет қаралды 15 МЛН
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН
System Design Interview Question: Design URL Shortener
13:25
Hayk Simonyan
Рет қаралды 15 М.
Twitter / Newsfeed  System Design Interview Question
13:01
TechPrep
Рет қаралды 11 М.
I Made a FAST Search Engine
8:17
conaticus
Рет қаралды 161 М.
System Design Interview: Design Twitter (X)
12:19
Hayk Simonyan
Рет қаралды 5 М.
Systems Design in an Hour
1:11:00
Jordan has no life
Рет қаралды 37 М.
I ACED my Technical Interviews knowing these System Design Basics
9:41
System Design Interview Prep | Twitter
15:31
Keep On Coding
Рет қаралды 72 М.