Coding OCR with machine learning from scratch in Python - no libraries or imports! (From Scratch #2)

  Рет қаралды 88,660

clumsy computer

clumsy computer

Күн бұрын

In this video, we implement OCR/image recognition using simple machine learning in Python with no imports! This was streamed live on / clumsycomputer on 2020/07/06.
Our program uses the k-nearest neighbors algorithm to classify images from the mnist and fashion-mnist datasets, meaning it takes digits (1, 2, 3, …) and garments (shoe, t-shirt, …) and guesses what they are (classifies them). It can get up to 93% accuracy on mnist. I wanted this to be a simple introduction to machine learning for those not very familiar with it.
The full code including more details is available at git.sr.ht/~vla...
Because this is a recording of a live stream, you'll hear me sometimes talking to the chat.
If you'd like to see more, please follow the clumsy computer Twitch channel at / clumsycomputer .
If you have any questions, write a comment or contact me on / clumsycomputer .
What would you like to see me code from scratch next? Let me know in the comments! :)
Wishing you a swell day,
Vlad

Пікірлер: 59
@flashbao1922
@flashbao1922 3 жыл бұрын
He explains everything from scratch. This is the scratch I actually needed!
@haniajarlaczynska4518
@haniajarlaczynska4518 4 жыл бұрын
I've just started learning machine learning and I really like this video - you explain everything very clearly. It's nice to see how to do this from scratch instead of using some huge libraries.
@clumsycomputer
@clumsycomputer 4 жыл бұрын
Thank you for the feedback! It makes me really happy to hear that.
@ANSELAbitsxb
@ANSELAbitsxb 2 жыл бұрын
@@clumsycomputer Dude you make really good videos. If you get time again in the future please make more.
@Muphet
@Muphet 3 жыл бұрын
most underrated software programmer and evangelist on youtube. i find this explanation/tutorial better than any other youtube video.
@youssef-attai
@youssef-attai 3 жыл бұрын
I swear to god you are a legend, no one could have explained it better. Thank you so much.
@clumsycomputer
@clumsycomputer 3 жыл бұрын
That means a lot, thank you! ❤
@zodiaminc1474
@zodiaminc1474 3 жыл бұрын
Please don't get dis-heartened or demotivated, this video is a total gem (sorry too late for encouragement after 10 months) but this really helped, thanks bruv, keep up the good work !!!!!
@ArabianKnight63
@ArabianKnight63 2 ай бұрын
followed, that the from scratch thing everyone was looking for.
@frindle18
@frindle18 3 ай бұрын
Brilliant tutorial, everything is so clearly explained, and it actually is from scratch.
@nelhdez3126
@nelhdez3126 3 жыл бұрын
This was so refreshing, you made it way better than some full-of-pride professors at some universities. Thanks so much mate. Could you do at some point the most basic “non-lazy-training” algorithm from scratch? Whichever you deem is fine
@myquoraismine2953
@myquoraismine2953 3 жыл бұрын
I was looking around for this stuff, it's amazing, You nailed it clumsy computer, can you please implement hyper parameters in it ?
@adhamsalama4336
@adhamsalama4336 2 жыл бұрын
This is amazing! Thanks for this awesome video! Subscribed!
@ruoyuguo3945
@ruoyuguo3945 2 жыл бұрын
Thanks for sharing, PaddleOCR seems friendly for training the OCR models. Could you please shoot a video about how to use PaddleOCR?
@johnmele720
@johnmele720 Жыл бұрын
Hey, the website you've got those files from now requires a password and username to get in. Is there another place we can get the files from?
@clumsycomputer
@clumsycomputer Жыл бұрын
Hi! I can download the files fine, could you tell me which website you're referring to?
@naufalafels
@naufalafels Жыл бұрын
@@clumsycomputer this one, I also tried to get into the website but it needs password and username.
@JorgeGonzalez-jo3zv
@JorgeGonzalez-jo3zv 4 жыл бұрын
Instructional video about machine learning, it would be good if you could make a video on how to configure vi with python with the basic instructions. Best regards.
@clumsycomputer
@clumsycomputer 4 жыл бұрын
It's planned! :)
@hemantchauhan6437
@hemantchauhan6437 6 ай бұрын
NEED HELP! I am making a website where user can upload a pdf but I want that pdf to upload only if that pdf has images of only HANDWRITTEN text. Thank you for reading.
@benoitd94
@benoitd94 Жыл бұрын
Hi Clumsy VERY GOOD JOB ! Just one question, is it possible to decode Water Counter with that method ? do i need a special MNIST DATASET to download ? thx
@prakharpathak2358
@prakharpathak2358 2 жыл бұрын
How to preprocess a image of PNG file format downloaded from the web to recognise the digit and what changes to do in read images function in the above code to test that image
@abdullahshoukat7848
@abdullahshoukat7848 3 жыл бұрын
603. you deserve more, I hope you'll get soon
@jaycobclavel7098
@jaycobclavel7098 9 ай бұрын
How do you make your own custom datasets.? Provide your own dataset froms cratch. Just want to detect numbers. sometimes keras ocr detects 3 as 5.
@munteanionut3993
@munteanionut3993 2 жыл бұрын
Does this tutorial only build a digit ocr? Or does it also work for characters as well?
@PrakashKing8342
@PrakashKing8342 3 жыл бұрын
Do you have solutions for alpha numeric recognition I have tried all the method finally I’m getting fail tesseract easy ocr etc for example ABC00OO1 and SI1234H
@munteanionut3993
@munteanionut3993 2 жыл бұрын
Did you manage to find a solution for alpha numeric characters too?
@jonas1351
@jonas1351 Жыл бұрын
why did you stop dude, your explanation is so well
@clumsycomputer
@clumsycomputer Жыл бұрын
Thank you! New videos are coming, I just have a lot of other stuff to do. ❤
@Tech_Inside.
@Tech_Inside. 5 ай бұрын
I want to ask how to test it on your img file?
@AJ93-lx6on
@AJ93-lx6on 6 ай бұрын
Can this work on seven segment font though
@SarthakRauts
@SarthakRauts 3 жыл бұрын
Amazing video
@trungthanhbp
@trungthanhbp 3 жыл бұрын
where are you now, we need your video :D
@SourabhChavan-1997
@SourabhChavan-1997 Жыл бұрын
Hey, Thanks for explaining everything, i like this video. which IDE are you using ?
@clumsycomputer
@clumsycomputer Жыл бұрын
Hi! I don't use an IDE, I only ever use vim.
@mechpunk
@mechpunk Жыл бұрын
@@clumsycomputer your content is really amazing , If you have any spear time consider making more videos.
@arvindshelke8889
@arvindshelke8889 2 жыл бұрын
It really helped👌👌
@ElectroProgrammingFreakJuanGCJ
@ElectroProgrammingFreakJuanGCJ 2 жыл бұрын
Very useful.
@xhyhbdka
@xhyhbdka 3 жыл бұрын
Good video but christ, presentation wise and viewer experience, an absolute car crash!! Max res 720P, you should have zoomed a bit more!!! That background chance from black to white just fucked my eyes up at 14:25
@konalaravichandanreddy9886
@konalaravichandanreddy9886 2 жыл бұрын
How to run this on jupyter notebook? Can someone please tell?
@snenscoleman7095
@snenscoleman7095 3 жыл бұрын
What features does "extract_features" extract?
@beserious2773
@beserious2773 Жыл бұрын
I wanted to create an ocr app for Android for myself only. But I need to implement maths also equations. Like Mathpix software did. I don't have money. To pay. I want to learn from very much basic. Like I am noob don't know coding also. But I wanted to learn. And wanted to create my own style data to get extracted from image.
@iamHamza
@iamHamza 3 жыл бұрын
Can u please tell us which editor did u used?
@simondeis7432
@simondeis7432 3 жыл бұрын
It's vim
@simondeis7432
@simondeis7432 3 жыл бұрын
Or more precise nvim
@kumaraguru3193
@kumaraguru3193 2 жыл бұрын
can i use my webcam to recognize any face
@truongzgiang
@truongzgiang 10 ай бұрын
Too many videos trianing copy from each other but no video talk about how to use it after trianed ,
@Champe19
@Champe19 10 ай бұрын
Wtf are the || swiggle characters
@fahadsajeem2897
@fahadsajeem2897 3 жыл бұрын
next time please code in light mode. In dark mode its hard to read. Your videos are really useful and helpful. loved it 🙂
@ilyaosver4987
@ilyaosver4987 3 жыл бұрын
Nice
@adwaithks
@adwaithks 3 жыл бұрын
@MadhuKumar-hi2qs
@MadhuKumar-hi2qs 3 жыл бұрын
507th subscriber
@sammy9453
@sammy9453 4 жыл бұрын
I think you should remove ocr tag. I am dissappointed.
@shivu.sonwane4429
@shivu.sonwane4429 2 жыл бұрын
777
@siyabonganxumalo4574
@siyabonganxumalo4574 4 жыл бұрын
I like the video but I was looking for something that will teach me from scratch because I don't have a experience in programming so I feel like you skipped a lot of steps because some people are already familiar with programming and I downloaded Python onto my laptop, so your coding was confusing me because I have gotten used to using Python that I downloaded.
@clumsycomputer
@clumsycomputer 4 жыл бұрын
Thank you for watching! If you just started learning Python, I don't think this is a good introduction, and I definitely don't think you should start with machine learning. In the future I plan to make more basic tutorials too, but until then, I recommend you start with some more beginner-level Python resources! :)
@siyabonganxumalo4574
@siyabonganxumalo4574 4 жыл бұрын
Thanks man, noted.
How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)
53:24
Python Tutorials for Digital Humanities
Рет қаралды 159 М.
Machine Learning for Everybody - Full Course
3:53:53
freeCodeCamp.org
Рет қаралды 6 МЛН
大家都拉出了什么#小丑 #shorts
00:35
好人小丑
Рет қаралды 95 МЛН
SHAPALAQ 6 серия / 3 часть #aminkavitaminka #aminak #aminokka #расулшоу
00:59
Аминка Витаминка
Рет қаралды 408 М.
отомстил?
00:56
История одного вокалиста
Рет қаралды 6 МЛН
Let's build GPT: from scratch, in code, spelled out.
1:56:20
Andrej Karpathy
Рет қаралды 4,7 МЛН
Linux from Scratch
2:35:42
Titus Tech Talk
Рет қаралды 174 М.
Optical Character Recognition (OCR) - Computerphile
14:16
Computerphile
Рет қаралды 190 М.
GEOMETRIC DEEP LEARNING BLUEPRINT
3:33:23
Machine Learning Street Talk
Рет қаралды 180 М.
Math for Game Devs [2022, part 1] • Numbers, Vectors & Dot Product
3:57:35
Deep Learning: A Crash Course (2018) | SIGGRAPH Courses
3:33:03
ACMSIGGRAPH
Рет қаралды 3 МЛН