Text Detection with OpenCV in Python | OCR using Tesseract (2020)

  Рет қаралды 421,495

Murtaza's Workshop - Robotics and AI

Murtaza's Workshop - Robotics and AI

Күн бұрын

Пікірлер: 360
@ishtiaqahmad4118
@ishtiaqahmad4118 3 жыл бұрын
By chance I entered last 5 years before in programming languages. You will note my background is Accounting. For now it is sufficient that I am 75 years old man but I am learning Django, Ai, Machine Learning and etc., Last five years I asked my instructor of Java (Note= At that time, I did not know A,B, C of any programming but I know a sufficient knowledge about Programming - I am a lot thankful of Respectable Great Zia Khan) that I can check cross medical bill of the any person, but he could not replay. I am very happy of this watching Video, I can solve this last 5 years dream can be solved by python with AI Insha-Allah. Now I am trying to launch my E - Commerce Website very soon with Python, I shall contact very soon with you for a big project Insha-Allah. Ishtiaq Ahmed Siddiqui KARACHI
@Abcsciencemusic
@Abcsciencemusic 3 жыл бұрын
For windows , Instead of down manually downloading and installing just do : pip install tesseract
@shuterino9454
@shuterino9454 3 жыл бұрын
thx bro
@jurajzovinec
@jurajzovinec 3 жыл бұрын
tx man
@victorgiaff8625
@victorgiaff8625 3 жыл бұрын
Tnx
@jonascampos5564
@jonascampos5564 3 жыл бұрын
How can we know if that's official?
@d.e.a.dgaming654
@d.e.a.dgaming654 Жыл бұрын
Nah bro thats not official
@nithinprincejohn7762
@nithinprincejohn7762 4 жыл бұрын
Your tutorials are awesome. I already shared your channel with my students. I visited so many channels for learning opencv and out of that your videos stands out.
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
I am glad i could be of help. and thanx for sharing
@aham3687
@aham3687 3 жыл бұрын
Awesome video. I was looking for technology that could read image captcha and extract text in the backend. Then submit text to the server without users requiring to type the captcha and your video solved my issue.
@choogiesaur
@choogiesaur 3 жыл бұрын
amazing, would you share your code as an example?!
@quasa0
@quasa0 3 жыл бұрын
@@choogiesaur did you figure it out?
@praffulbisht8520
@praffulbisht8520 3 ай бұрын
For macUser : first write at terminal - which tesseract then copy that path and paste it in tesseract cmd path pytesseract.pytesseract.tesseract_cmd = ‘path '
@kevinalexander4959
@kevinalexander4959 3 жыл бұрын
used this with a webcam to take an image and parse text from (like a piece of paper or menu on a phone), then you can take the .png snapped by the webcam and analyze with image_to_string(), takes about half a second to read text presented and the webcam's settings has to be on point with brightness and all that stuff
@mehmettatli07
@mehmettatli07 3 жыл бұрын
so we cannot use this one as a real time word detection metod? How many frames did you get when you use with your webcam? i just want to use it for real time detection ,if you answer to me. Thx a lot
@hirdeshchhipa8238
@hirdeshchhipa8238 2 жыл бұрын
Pp
@jairajsahgal5062
@jairajsahgal5062 3 жыл бұрын
23:13 " I have no idea why they did this" made me chuckle.
@johntamplin
@johntamplin 4 жыл бұрын
Good video. From 20:20 onwards you are using the variable 'x' for both the loop count and the x-dimension of the bounding box. You got lucky that it didn't impact the results of the code.
@AmanVerma-wo8hd
@AmanVerma-wo8hd 3 жыл бұрын
Hay I can't able to fine window installer , please give me a link
@zaidilyas7842
@zaidilyas7842 3 жыл бұрын
hey..! Did you find that?
@MegaR97
@MegaR97 2 жыл бұрын
Hello, sorry I can't donwload the tesseract version you are using the page says "currently there is no official windows installer for newer versions
@WLeigh-pt6qs
@WLeigh-pt6qs 3 жыл бұрын
Hi Murtaza, I indeed learnt something new! It's amazing! I really appreciate your time and effort and sharing :) !
@avikro9235
@avikro9235 3 жыл бұрын
Windows binaries download is not available in tesseract documentation
@santibogo
@santibogo 3 жыл бұрын
Hi murtaza! A question: If i only want to retrieve specific text from the image, how can i do it? i'm already have the coordinates of the text that i want to get
@richmail
@richmail 3 жыл бұрын
take screen shot useing those cords, and send it via internet, tho, you need to know whee you store the images, hope this helped (srry for the bad spelling, I type fast)
@Marek-Marek
@Marek-Marek 4 жыл бұрын
Great job! I was learn something new, thanks a lot.
@edSabio572
@edSabio572 3 жыл бұрын
Thanks Doctor Murtaza! This lesson is a class act!
@SliderBlues
@SliderBlues 3 жыл бұрын
Thanks for the tutorial! It was very helpful for me getting a handle on OCR and Opencv for a redaction project I was working on. Now looking at fictionalizing text in an image. Ideally would like to replace words, like for an address, with different ones using the same font and size for the text. Not finding any solid examples of this that work. Would love to hear your take on how that might be accomplished.
@murtazasworkshop
@murtazasworkshop 3 жыл бұрын
You could do it with opencv python but it you want an end product you might want to look at unity.
@SliderBlues
@SliderBlues 3 жыл бұрын
@@murtazasworkshop - Hi. Worked on it a bit and watched more of your videos but still stuck. Posted on stackoverflow and so far no responses that might work. If you have the time would like to know your high level ideas on resolution. Thanks! stackoverflow.com/questions/64541909/how-to-fictionalize-text-data-like-an-address-within-a-document-image
@sohailali5741
@sohailali5741 4 жыл бұрын
Hi Murtaza, Your videos are awesome and very detailed and easy to understand each and every part. Can you please make a tutorial on Semantic Segmentation with Keras and tensorflow?
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
i will look into it
@1phong.djk.392
@1phong.djk.392 4 жыл бұрын
wonderful. I comment before watching. Keep creating more wonderful tutor videos
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Many thanks
@YazeedAlkosai
@YazeedAlkosai 4 жыл бұрын
thanks so much, Dear Murtaza...could you do a tutorial for detecting the shape feature and make the dimensions of the shape of the workpiece appear once the camera start to detect.
@srvanikar
@srvanikar Жыл бұрын
wow....mighty helpful.. Thank you Murtaza
@shan2739
@shan2739 4 жыл бұрын
Thank you very much for this video. Can you also make some video on Deep learning self driving car using raspberry pi and jeston nano
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
I have already started the raspberry pi series . once the basics are covered we will move to more advance topics like selfdriving, ML, DL
@shan2739
@shan2739 4 жыл бұрын
@@murtazasworkshop yea..I already got them waiting for you.. Thank you
@davidgarza7355
@davidgarza7355 2 жыл бұрын
This was sick!! Thanks, It was so easy to follow :)
@akhilpandey
@akhilpandey 3 жыл бұрын
Thank you, just what I needed.
@mabo2031
@mabo2031 3 жыл бұрын
hi Murtaza I have question: I want to convert to pdf(with ocr text) only few page from multipage tif image - is it possible without any other conversion ? multipage tif - 70 pages - but I want to convert to readable pdf pages from 10 to 20? best regards :)
@sooryaram439
@sooryaram439 4 жыл бұрын
Great video sir, I just have a small doubt. How can we detect text in live video stream using the webcam ?
@livetop2749
@livetop2749 4 жыл бұрын
@Dharmendra Kumar Singh How much to deploy this app with camera detect?
@lakshit6429
@lakshit6429 3 жыл бұрын
same question
@surajprakash3558
@surajprakash3558 4 жыл бұрын
This is an amazing video and effort. Thanks a lot !
@ADNANAHMED-eo5xx
@ADNANAHMED-eo5xx 3 жыл бұрын
Amazing TUtorial man, thanks a lot
@chickenz4604
@chickenz4604 4 жыл бұрын
Nice video, btw could you give me suggestion when it comes to recognize handwritten digit and then manipulate the data?? should I use OpenCV & tesseract?? or tensorflow & keras? I am still not sure about this, since im new for this application. Thank You !
@Mikasks
@Mikasks 3 жыл бұрын
So what did you use? I'm having this problem aswell.
@Lindvni
@Lindvni 3 жыл бұрын
How do you guys fix the path problem? Everything looks fine ,until I run it. It just gives me an error
@dimitheodoro
@dimitheodoro 3 жыл бұрын
The Binaries for Windows ( 1:06) has not this selection to download, so i cannot find how to download the file for Windows. Can you help me pls?
@sudhakarp1
@sudhakarp1 4 жыл бұрын
Excellent explanation, Thanks Murtaza
@iaconst4.0
@iaconst4.0 11 ай бұрын
MUCHAS GRACIAS MURTAZA!! ME AYUDASTE MUCHO!, TE DEJE LIKE!
@chunhoutan677
@chunhoutan677 4 жыл бұрын
This tutorial is short and sweet. I like it. Currently, i'm having problem in detecting digits only. Instead of detecting digits only, it detects everything. i follow this cong: r'--oem 3 --psm 6 outputbase digits' pycharm community 2020 python-3.8.2
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
make sure you are using the configurations in the main function. If the problem persists send me a screenshot to my fb page.
@aironium
@aironium Жыл бұрын
Is it possible to get the whole bounding box of a word by combining boxes from every character?
@MrNadeem1984
@MrNadeem1984 3 жыл бұрын
It doesn't work on text in larger and smaller font size. Can you help me how I can use extract those text from image.
@animeshmaji9515
@animeshmaji9515 3 жыл бұрын
Keep making video like this and we keep learning
@fakhruddintahery1561
@fakhruddintahery1561 3 жыл бұрын
Great explanation. Thank you so much. That was useful
@imadsaddik
@imadsaddik Жыл бұрын
Thanks, this helped me.
@Shraddhanath
@Shraddhanath 4 жыл бұрын
Thanks for this workshop. Could you also post a video on how to detect tables(ruled as well as not ruled) in document images or segregate tables, text, charts and figures. Tx.
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Thanks. i will look into it
@amortalbeing
@amortalbeing 2 жыл бұрын
Thanks a lot. The `image_to_boxes()` returns `x1, height1, x2, height2`. if you renamed them like this it would be much more clear.
@shailavijay1
@shailavijay1 3 жыл бұрын
Thanks for your video. Very well explained :)
@TheRyanjay14
@TheRyanjay14 3 жыл бұрын
Where can i download 1.png??
@sidhantipatil1930
@sidhantipatil1930 4 жыл бұрын
Great content! Keep going!😇
@gowthamns8228
@gowthamns8228 4 жыл бұрын
Wow very good, but the problem is "If the text is very clear and crisp its is showing output correctly", But I want to know if the image has multiple data not only text for example "bills, taken calendar photo or any kind of images", How to print the string from that, I tried my self it's not printing anything, Any idea for this?
@vcjayan8206
@vcjayan8206 2 жыл бұрын
Great help..Thank you
@WorkHardThinkBetter
@WorkHardThinkBetter 2 жыл бұрын
Hi Murtaza, Your videos are excellent. I need to ask where is 1.png since on your website I can't find it in the projects section. Thank you so much for everything!
@Jockeponkenboy
@Jockeponkenboy 2 жыл бұрын
Awesome video as usual! But I have a simple question, how do I comment and uncomment a code section?
@fixitunes30
@fixitunes30 4 жыл бұрын
Awesome videos....Any plan for video on Deep Learning Chat bot in near future?
@DrWho2008t101
@DrWho2008t101 3 жыл бұрын
Great Video!
@nicoarasamuel1173
@nicoarasamuel1173 3 жыл бұрын
Love the video... I've try to make a project on wich I try to send a text to a monitor, more like a power point presentation or a teleprompter app... Do you have any suggestions...or can you make a video about something like that?
@gonzalohernan4169
@gonzalohernan4169 3 жыл бұрын
Thanks so much for this!
@mandrivaal
@mandrivaal 3 жыл бұрын
You're a legend! thanks
@ravindarmadishetty736
@ravindarmadishetty736 3 ай бұрын
Nice!, how are you loading those extracted information into a file?
@siddharthbansal1030
@siddharthbansal1030 2 жыл бұрын
Sir in today the tesdoc is changed not able to download plz help
@manugomez7319
@manugomez7319 3 жыл бұрын
At 9:27 the w and h are the diagonal points means? Which diagonal point? Bottom or up?
@makelabsindia3092
@makelabsindia3092 4 жыл бұрын
your video and explaining method amazing ,keep it up
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Thanks a lot
@rehmatsayyed8594
@rehmatsayyed8594 2 жыл бұрын
it is the android studio you are using for coding right??? and how to use python in android studio????
@silviacolabianchi8261
@silviacolabianchi8261 3 жыл бұрын
Hi! thanks for the video. What do you think of tesseract version 5.0.0. Which one do you suggest to download? Version 4 or 5? thank you
@shreyanshdwivedi7871
@shreyanshdwivedi7871 3 жыл бұрын
Hello Murtaza, It was good explanatory video which was easy to digest and understand, loved your work, I want to know that if you can make a video on creating the language model of sanskrit for tesseract from the certain bunch of images consisting of sanskrit text , that would be appreciated. Hope to hear from you soon.
@birdeye700
@birdeye700 3 жыл бұрын
Excellent work! You might want to create functions to show digits, show alphabets and etc. This will improve readability and reusability of code.
@TheAnmolPunetha
@TheAnmolPunetha 4 жыл бұрын
There is no change when i add the config, it is still detecting text too. Can anyone pls tell why is it so?
@radoslavtotev4401
@radoslavtotev4401 2 жыл бұрын
Hey, thank you for the lesson but do you have it for linux because I want to use Raspberry pi 3 for similar project?
@harshitsrivastava9903
@harshitsrivastava9903 4 жыл бұрын
Great work..very helpful!!
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Glad to hear that!
@riztya7846
@riztya7846 2 жыл бұрын
that's so preety cool i maen tips with ctrl + left click, is so helpfull for me just learning python, thank you ! btw i'm already subscribe you 👍
@namtongdosa9626
@namtongdosa9626 4 жыл бұрын
Thank you so much, its very helpful for me. Could you please make a tutorial about train model to detect text in identity image ?
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
I have one tutorial on classifying digits. you might want to have a look at that.
@arthurswanson3285
@arthurswanson3285 4 жыл бұрын
Very useful. Thanks.
@ans_human
@ans_human 3 жыл бұрын
can we use it for other languages? Like indic languages for example hindi.
@muthuji8053
@muthuji8053 4 жыл бұрын
videos really helpful for me ,but if we apply this dull image or text are present in bottom that won't detected
@vittal255
@vittal255 3 жыл бұрын
hello Sir, our videos are really informative. They motivate me to build things that I have in my brain. Thanks for those videos. I have small doubt can we implement the same project using raspberry pi instead of a laptop/pc. If yes please suggest the necessary modification I need to make. I am really to new to IoT and computer vision started learning about them
@_kayos_
@_kayos_ Жыл бұрын
did you find an answer?
@pijushkumarghosh9589
@pijushkumarghosh9589 2 жыл бұрын
Can it be recognized handwritten words or numbers?
@nisarggandhewar8242
@nisarggandhewar8242 4 жыл бұрын
Its very helpful video, Thanks for preparing it, can you please prepare video on to detect text by designing & training our own model on some dataset just like digit detection, that will be more helpful.
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Yes, soon
@MadanLal-hl5mw
@MadanLal-hl5mw 2 жыл бұрын
@@murtazasworkshop l
@muhammadfayyaz8544
@muhammadfayyaz8544 3 жыл бұрын
hello sir i am the new one as from videos i got many concepts cleared but i have one more question how we can get any specific word from the bunch of sentences
@vimsen2221
@vimsen2221 4 жыл бұрын
I get following error. --- pytesseract.pytesseract.TesseractError: (1, 'Error opening data file C:\\Program Files\\Tessaract-OCR/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory ---- Where do I set this?
@hassanalhalabi7563
@hassanalhalabi7563 3 жыл бұрын
do you thimk there is a methode to detect text n rotated images?
@filmvids5596
@filmvids5596 3 жыл бұрын
can we use this for webcam?
@javierpachon4424
@javierpachon4424 2 жыл бұрын
Thanks for your tutorial, I want to capture an email from an image but Tesseract does not recognize the @ symbol, how can I solve it?
@ritviksharma404
@ritviksharma404 3 жыл бұрын
well how to print the red text we got from image ?
@anindyakumar4853
@anindyakumar4853 3 жыл бұрын
at 23.28 you adjusted the box and yours is working fine but when i did the same the boxes are formed in transpose manner, like the height of box = length of word and viceversa Help me out please
@sanchimittal9614
@sanchimittal9614 4 жыл бұрын
Amazing tutorial! Great content!! Can you do a video or suggest something for noise removal like unrequired non-text portion and other kind of preprocessing before text detection in images? Thank You
@fortfanop
@fortfanop 4 жыл бұрын
I think denoising with autoencoders will do the job...
@samnicz
@samnicz 2 жыл бұрын
Can we use visual studio code, instead of pycharm?
@akashrastogi6431
@akashrastogi6431 3 жыл бұрын
great job and thank you so much
@franktheballer23
@franktheballer23 3 жыл бұрын
The downloads page for tesseract is not as it is in your video.
@jamadarzoya6301
@jamadarzoya6301 2 жыл бұрын
Does this library work with 7 segment-display digit numbers in images ??
@ashok_ign5623
@ashok_ign5623 4 жыл бұрын
Thank you so much your all tutorials are helpful keep It up 😍😍
@murtazasworkshop
@murtazasworkshop 4 жыл бұрын
Glad you like them!
@innotech4079
@innotech4079 4 жыл бұрын
Hi, it's amazing but is there any train part? 1. detecting the text edges, 2. img to text training
@raynnstudios2394
@raynnstudios2394 3 жыл бұрын
how do i get image into file show on left side. cant get it. thanks
@janssenuy9397
@janssenuy9397 3 жыл бұрын
i have an error please help hImg, wImg = img.shape ValueError: too many values to unpack (expected 2)
@janssenuy9397
@janssenuy9397 3 жыл бұрын
i think the way to access the width and height has changed. I used imgShape instead and select the height by typing imgShape[0]
@9319964634
@9319964634 4 жыл бұрын
Good video, can we also find micr font and store using this method?
@grimreaperstudio
@grimreaperstudio 2 жыл бұрын
Can you please help me, i have an error if i try to run the test file. "Exception has occurred: AttributeError 'Sequential' object has no attribute 'predict_classes' File "C:\Digits-Classification-master\OCR_CNN_Test.py", line 36, in classIndex = int(model.predict_classes(img))"
@rajeshmeesala9470
@rajeshmeesala9470 3 жыл бұрын
Nice explanation
@trevordoeseverything219
@trevordoeseverything219 2 жыл бұрын
Can you use this with image grab?
@ashu60071
@ashu60071 3 жыл бұрын
can we use these code to detect captchas
@nikethadinesh7123
@nikethadinesh7123 3 жыл бұрын
Thank you sir for this video. Cud u also explain with handwritten text recognition?
@youcefahmanache3298
@youcefahmanache3298 3 жыл бұрын
Hi Murtaza, you see your loop to DISPLAY SOME SAMPLES IMAGES OF ALL THE CLASSES, if the folder names were categorical/strings instead of numerical, i.e. (sign1, sign2, sign3...etc instead of 0,1,2,3..etc) how would you iterate through these folders? would you still use iterrows? Thank you
@sagihaviv5675
@sagihaviv5675 3 жыл бұрын
sir i want to extract only the specific word or phrase, how do i do that? do i need some nlp?
@lamelo39
@lamelo39 Жыл бұрын
how to export the result to excel or csv?
@expertchannel5448
@expertchannel5448 3 жыл бұрын
What about rotated letters to some angles?
@007predatoreITA
@007predatoreITA 2 жыл бұрын
If tesseract has to refer to the exe in your computer, how do you export an application in another computer that doesnt have it?
@ganeshtelagamsetti5879
@ganeshtelagamsetti5879 3 жыл бұрын
Is it possible with CNN instead of OCR?
@cifro
@cifro 4 жыл бұрын
when running the program like you at 4:24 I get the following error: Traceback (most recent call last): File "C:\Python\Python385\lib\site-packages\pytesseract\pytesseract.py", line 21, in from PIL import Image File "C:\Python\Python385\lib\site-packages\PIL\Image.py", line 94, in from . import _imaging as core ImportError: cannot import name '_imaging' from 'PIL' (C:\Python\Python385\lib\site-packages\PIL\__init__.py)
@wavyrenac
@wavyrenac 4 жыл бұрын
you have imported the path incorrectly inssted of .py use .exe
How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)
53:24
Python Tutorials for Digital Humanities
Рет қаралды 159 М.
Touching Act of Kindness Brings Hope to the Homeless #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 19 МЛН
Как мы играем в игры 😂
00:20
МЯТНАЯ ФАНТА
Рет қаралды 1,6 МЛН
Feature Detection and Matching + Image Classifier Project | OPENCV PYTHON
45:34
Murtaza's Workshop - Robotics and AI
Рет қаралды 160 М.
OCR OpenCV in FORMS and RECEIPTS | Text Detection p.1/2
27:24
Murtaza's Workshop - Robotics and AI
Рет қаралды 68 М.
Google Gemini AI Vision - OCR Text Extraction with Python
6:31
Tech Expert Tutorials
Рет қаралды 640
No, Einstein Didn’t Solve the Biggest Problem in Physics
8:04
Sabine Hossenfelder
Рет қаралды 273 М.
Facial Landmarks and Face Filter using OpenCV Python
42:57
Murtaza's Workshop - Robotics and AI
Рет қаралды 31 М.
LEARN OPENCV in 3 HOURS with Python | Including 3xProjects | Computer Vision
3:09:08
Murtaza's Workshop - Robotics and AI
Рет қаралды 3,3 МЛН
Text detection with Python and Opencv | OCR using EasyOCR | Computer vision tutorial
15:39
How to make advanced image recognition bots using python
15:01
Kian Brose
Рет қаралды 1,4 МЛН