I made a SEARCH ENGINE from scratch!

  Рет қаралды 14,713

Daniel Zhang

Daniel Zhang

Күн бұрын

Пікірлер: 89
@JeffreyZang
@JeffreyZang 16 күн бұрын
big chungus
@cardboardbox_tech
@cardboardbox_tech 12 күн бұрын
among us
@sasodoma
@sasodoma 14 күн бұрын
The first thing I thought when you said href will always be a valid link was "not really". And then you showed the descent into madness, love it.
@mathman0569
@mathman0569 12 күн бұрын
I was about to say this lol
@cinderwolf32
@cinderwolf32 12 күн бұрын
Client side hash string routing: "I'd like to introduce myself"
@reed6514
@reed6514 11 күн бұрын
I suggest scraping your local news websites and making a search engine just for local news. It could actually be useful for your community.
@HootMoot
@HootMoot 16 күн бұрын
Good stuff and nicely scripted and edited. Once you get a mic and some traction, you'll be on your way to 100k subs. Everything but audio is extremely high quality and well done!
@smb1397
@smb1397 11 күн бұрын
"my hard drive was getting too full" shows 112 gb used. walled garden problems
@ujjwaldimri2392
@ujjwaldimri2392 14 күн бұрын
amazing video, it is well made and very informative. hard to believe you only have a 100 subs
@vitaliykishchenko4365
@vitaliykishchenko4365 12 күн бұрын
This is awesome work! Have no idea how you're doing all this at this age, you have a bright future. Research and editing all done very well. I love the TF-IDF explanation, super concise.
@deepbrar1
@deepbrar1 10 күн бұрын
This is awesome. The video is really informative and you explained everything so perfectly. Great content, got a lot to learn from it.
@fakejimhalpert
@fakejimhalpert 14 күн бұрын
GREAT video this is so cool!! keep going
@kevinnielsendev
@kevinnielsendev 12 күн бұрын
Nice project! This is really impressive :D
@casperdong
@casperdong 16 күн бұрын
the new GOOGLE I cannot wait
@varram3488
@varram3488 12 күн бұрын
I've been in the webscriping space for years. The robots.txt page is a suggestion. It is completely legal to ignore the robots.txt file. Big tech companies want you to think it's illegal even though its not. As long as you don't commercialize human made content you will be fine :D
@hexxt
@hexxt 11 күн бұрын
naughty boy
@fujinshu
@fujinshu 11 күн бұрын
Right now it's illegal, but I wouldn't be surprised if Congress and Trump rule that ignoring the robots.txt file is illegal, especially with some light bribery and a compromised SCOTUS.
@cinderwolf32
@cinderwolf32 11 күн бұрын
Have you ever been blacklisted for requesting routes that are denied by the file? Especially if your user agent is noticeable
@reed6514
@reed6514 11 күн бұрын
Webscraping can be illegal depending what you do with the content. The u.s. copyright office has a Fair Use Index online that summarizes findings. Search should be fine in most cases, but it's good to be informed, especially if your use case is remotely iffy. Even better to talk to a lawyer.
@varram3488
@varram3488 10 күн бұрын
​@@reed6514 yep exactly, and the use case in the video falls under fair use. Also, it's really funny how the entire AI industry is based around a grey area (The law is pretty outdated and vague); which we should see resolved really soon through lawsuits going on right now lmfao.
@Gaarlicc
@Gaarlicc 12 күн бұрын
Such a cool video , keep it up
@TheRetroEngine
@TheRetroEngine 11 күн бұрын
Dude I'm new to your site and when the FBI turned up, I spilled my tea.
@zinck_dome7072
@zinck_dome7072 12 күн бұрын
Cool project man
@cake0539
@cake0539 11 күн бұрын
tfidf sounds like a DOOM cheat code
@LegendBegins
@LegendBegins 12 күн бұрын
This was great! Nice work!
@Mikko-Maggie-More
@Mikko-Maggie-More 11 күн бұрын
fun fact: google purposefully removes results so that you'll use gemini instead
@ClarkeMacbeth
@ClarkeMacbeth 10 күн бұрын
Just in time for 2121!
@casperdong
@casperdong 16 күн бұрын
daniel zhang is the next joma tech!
@mwguy
@mwguy 13 күн бұрын
Good stuff. You can improve crawler algorithm by spliting it to small worker nodes that coordinates with kafka/rabbitmq to parallelize page downloading. Also just thow away prisma and use raw sql to squize all performanice from database.
@C4CH3S
@C4CH3S 11 күн бұрын
Or yk, don't use toy languages like JS in the backend for performance heavy tasks...
@reed6514
@reed6514 11 күн бұрын
It is worth considering load on the aites you're crawling. Don't want to ddos on accident.
@joshchen7993
@joshchen7993 Күн бұрын
Such a informative video! How did you speed up the querying process? I’m also trying to create a search engine and calculating the tf idf of every page is taking a long time
@MeowVR
@MeowVR 14 күн бұрын
Very cool video
@casperdong
@casperdong 15 күн бұрын
I am ur only female viewer
@jir_UwU
@jir_UwU 12 күн бұрын
nuh uh me too
@casperdong
@casperdong 12 күн бұрын
@@jir_UwU us girls gotta stick together
@deleted_handle
@deleted_handle 12 күн бұрын
girls aren't real. stop trolling
@LEMON_2U
@LEMON_2U 12 күн бұрын
Nuh uh me too
@Imtitled
@Imtitled 12 күн бұрын
R30 There are no girls on the Internet
@BiuerBoris
@BiuerBoris 14 күн бұрын
Wonderful video and project! Would be interesting to have it spawn the crawler from somewhere else than Wikipedia.
@petermarshall1634
@petermarshall1634 12 күн бұрын
Amazing video how does your channel only have 200 subs
@felixranesberger3846
@felixranesberger3846 12 күн бұрын
Awesome video!
@horntoad1616
@horntoad1616 7 күн бұрын
Horntoad here
@staniekkkkkkkkkkkkkkkkkkkkkkkk
@staniekkkkkkkkkkkkkkkkkkkkkkkk 5 күн бұрын
good shif vru
@Matusevichfilms
@Matusevichfilms 12 күн бұрын
Kagi is pretty good
@gljames24
@gljames24 11 күн бұрын
What is that random sound in the background of your voice? Do you have your hand on your mic? Put a gain cutoff or something.
@garf510
@garf510 16 күн бұрын
I love giggle 😀
@mathman0569
@mathman0569 12 күн бұрын
Should've used the URL API to check for valid URLs
@Christian-ry3ol
@Christian-ry3ol 10 күн бұрын
I think I just found my next project to work on. This inspired me. So fucking cool.
@CerebrumReality
@CerebrumReality 14 күн бұрын
Nice Video :0
@casperdong
@casperdong 12 күн бұрын
I love you.
@exp5261
@exp5261 11 күн бұрын
@@casperdong thanks
@ShaikhRehanShakil
@ShaikhRehanShakil 14 күн бұрын
everythiing is just wikipedia :((
@ironislife9857
@ironislife9857 12 күн бұрын
Why not use a faster language that would allow you to search for things fastee
@cinderwolf32
@cinderwolf32 12 күн бұрын
The language is probably not relevant to the time it takes. It'd likely be unnoticeable whether you use Python or C++ for this (assuming you don't write terrible code). A network call / database write is multiple orders of magnitude slower than whatever the code is doing
@reed6514
@reed6514 11 күн бұрын
​@@cinderwolf32caching to text files and then writing bulk queries could speed up the db stuff. The language speed hardly matters, but a couple milliseconds per page adds up when you're doing many thousands of pages.
@solmateusbraga
@solmateusbraga 11 күн бұрын
I am ur only femboy viewer
@_jb_
@_jb_ 11 күн бұрын
I enjoyed the animations and the effort. But your coding abilities need to improve. Keep on
@rixanito
@rixanito 14 күн бұрын
The video is amazing, but i would still suggest you put more visual effects like zooming ins and transitions(especially zoom ins)
@rixanito
@rixanito 14 күн бұрын
@bogxd would be a great source of inspiration
@therealpersonion
@therealpersonion 16 күн бұрын
is this manim??
@danielcsthings
@danielcsthings 16 күн бұрын
It's motion canvas motioncanvas.io
@Zhane4994
@Zhane4994 12 күн бұрын
YOU LIAR, THERE IS MORE ALTERNATIVES 😱😱😱😱😱😱😱😱😱😱😱😱😱😱😱😱
@sandwich-plays
@sandwich-plays 12 күн бұрын
skibidi
@justwingingit8110
@justwingingit8110 16 күн бұрын
20 views in 14 hours? you fell ofd
@danielcsthings
@danielcsthings 15 күн бұрын
it's joever
@Jetway-Yefan
@Jetway-Yefan 12 күн бұрын
GIGGLE
I Made a FAST Search Engine
8:17
conaticus
Рет қаралды 159 М.
Python Developer learns Rust (and remaking my chess engine)
16:18
TheSandwichCoder
Рет қаралды 59 М.
Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 35 МЛН
Cheerleader Transformation That Left Everyone Speechless! #shorts
00:27
Fabiosa Best Lifehacks
Рет қаралды 13 МЛН
Чистка воды совком от денег
00:32
FD Vasya
Рет қаралды 5 МЛН
I Made an Electronic Chessboard Without Turns
14:32
From Scratch
Рет қаралды 829 М.
Minecraft's Forgotten Mechanics
25:28
Legitimoose
Рет қаралды 112 М.
How This Missing Shell Option Took Down Cloudflare
9:45
Kevin Fang
Рет қаралды 120 М.
Breaking the No.1 Rule in Solo Game Development | Devlog 0
13:26
Smaller Than Pixel Art: Sub-Pixel Art!
6:11
Japhy Riddle
Рет қаралды 371 М.
Hacking An Obscure Game From 2000 To Run On Windows 11
16:22
Nathan Baggs
Рет қаралды 92 М.
Moore's Law is Dead - Welcome to Light Speed Computers
20:27
Mandelbrot's Evil Twin
7:47
2swap
Рет қаралды 420 М.
the 7zip rabbit hole goes extremely deep. (1000's of crashes)
12:50
Ladybird browser update (November 2024)
13:03
Ladybird
Рет қаралды 26 М.
Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 35 МЛН