Text Detection with OpenCV in Python | OCR using Tesseract (2020)

Рет қаралды 424,782

Murtaza's Workshop - Robotics and AI

Күн бұрын

Пікірлер: 360

@ishtiaqahmad4118 4 жыл бұрын

By chance I entered last 5 years before in programming languages. You will note my background is Accounting. For now it is sufficient that I am 75 years old man but I am learning Django, Ai, Machine Learning and etc., Last five years I asked my instructor of Java (Note= At that time, I did not know A,B, C of any programming but I know a sufficient knowledge about Programming - I am a lot thankful of Respectable Great Zia Khan) that I can check cross medical bill of the any person, but he could not replay. I am very happy of this watching Video, I can solve this last 5 years dream can be solved by python with AI Insha-Allah. Now I am trying to launch my E - Commerce Website very soon with Python, I shall contact very soon with you for a big project Insha-Allah. Ishtiaq Ahmed Siddiqui KARACHI

@Abcsciencemusic 3 жыл бұрын

For windows , Instead of down manually downloading and installing just do : pip install tesseract

@shuterino9454 3 жыл бұрын

thx bro

@jurajzovinec 3 жыл бұрын

tx man

@victorgiaff8625 3 жыл бұрын

Tnx

@jonascampos5564 3 жыл бұрын

How can we know if that's official?

@d.e.a.dgaming654 2 жыл бұрын

Nah bro thats not official

@nithinprincejohn7762 4 жыл бұрын

Your tutorials are awesome. I already shared your channel with my students. I visited so many channels for learning opencv and out of that your videos stands out.

@murtazasworkshop 4 жыл бұрын

I am glad i could be of help. and thanx for sharing

@aham3687 4 жыл бұрын

Awesome video. I was looking for technology that could read image captcha and extract text in the backend. Then submit text to the server without users requiring to type the captcha and your video solved my issue.

@choogiesaur 3 жыл бұрын

amazing, would you share your code as an example?!

@quasa0 3 жыл бұрын

@@choogiesaur did you figure it out?

@jairajsahgal5062 3 жыл бұрын

23:13 " I have no idea why they did this" made me chuckle.

@kevinalexander4959 4 жыл бұрын

used this with a webcam to take an image and parse text from (like a piece of paper or menu on a phone), then you can take the .png snapped by the webcam and analyze with image_to_string(), takes about half a second to read text presented and the webcam's settings has to be on point with brightness and all that stuff

@mehmettatli07 4 жыл бұрын

so we cannot use this one as a real time word detection metod? How many frames did you get when you use with your webcam? i just want to use it for real time detection ,if you answer to me. Thx a lot

@hirdeshchhipa8238 2 жыл бұрын

@johntamplin 4 жыл бұрын

Good video. From 20:20 onwards you are using the variable 'x' for both the loop count and the x-dimension of the bounding box. You got lucky that it didn't impact the results of the code.

@praffulbisht8520 5 ай бұрын

For macUser : first write at terminal - which tesseract then copy that path and paste it in tesseract cmd path pytesseract.pytesseract.tesseract_cmd = ‘path '

@WLeigh-pt6qs 3 жыл бұрын

Hi Murtaza, I indeed learnt something new! It's amazing! I really appreciate your time and effort and sharing :) !

@srvanikar Жыл бұрын

wow....mighty helpful.. Thank you Murtaza

@1phong.djk.392 4 жыл бұрын

wonderful. I comment before watching. Keep creating more wonderful tutor videos

@murtazasworkshop 4 жыл бұрын

Many thanks

@edSabio572 3 жыл бұрын

Thanks Doctor Murtaza! This lesson is a class act!

@davidgarza7355 2 жыл бұрын

This was sick!! Thanks, It was so easy to follow :)

@animeshmaji9515 4 жыл бұрын

Keep making video like this and we keep learning

@chunhoutan677 4 жыл бұрын

This tutorial is short and sweet. I like it. Currently, i'm having problem in detecting digits only. Instead of detecting digits only, it detects everything. i follow this cong: r'--oem 3 --psm 6 outputbase digits' pycharm community 2020 python-3.8.2

@murtazasworkshop 4 жыл бұрын

make sure you are using the configurations in the main function. If the problem persists send me a screenshot to my fb page.

@sudhakarp1 4 жыл бұрын

Excellent explanation, Thanks Murtaza

@Marek-Marek 4 жыл бұрын

Great job! I was learn something new, thanks a lot.

@iaconst4.0 Жыл бұрын

MUCHAS GRACIAS MURTAZA!! ME AYUDASTE MUCHO!, TE DEJE LIKE!

@amortalbeing 3 жыл бұрын

Thanks a lot. The `image_to_boxes()` returns `x1, height1, x2, height2`. if you renamed them like this it would be much more clear.

@SliderBlues 4 жыл бұрын

Thanks for the tutorial! It was very helpful for me getting a handle on OCR and Opencv for a redaction project I was working on. Now looking at fictionalizing text in an image. Ideally would like to replace words, like for an address, with different ones using the same font and size for the text. Not finding any solid examples of this that work. Would love to hear your take on how that might be accomplished.

@murtazasworkshop 4 жыл бұрын

You could do it with opencv python but it you want an end product you might want to look at unity.

@SliderBlues 4 жыл бұрын

@@murtazasworkshop - Hi. Worked on it a bit and watched more of your videos but still stuck. Posted on stackoverflow and so far no responses that might work. If you have the time would like to know your high level ideas on resolution. Thanks! stackoverflow.com/questions/64541909/how-to-fictionalize-text-data-like-an-address-within-a-document-image

@sohailali5741 4 жыл бұрын

Hi Murtaza, Your videos are awesome and very detailed and easy to understand each and every part. Can you please make a tutorial on Semantic Segmentation with Keras and tensorflow?

@murtazasworkshop 4 жыл бұрын

i will look into it

@surajprakash3558 4 жыл бұрын

This is an amazing video and effort. Thanks a lot !

@riztya7846 2 жыл бұрын

that's so preety cool i maen tips with ctrl + left click, is so helpfull for me just learning python, thank you ! btw i'm already subscribe you 👍

@YazeedAlkosai 4 жыл бұрын

thanks so much, Dear Murtaza...could you do a tutorial for detecting the shape feature and make the dimensions of the shape of the workpiece appear once the camera start to detect.

@shan2739 4 жыл бұрын

Thank you very much for this video. Can you also make some video on Deep learning self driving car using raspberry pi and jeston nano

@murtazasworkshop 4 жыл бұрын

I have already started the raspberry pi series . once the basics are covered we will move to more advance topics like selfdriving, ML, DL

@shan2739 4 жыл бұрын

@@murtazasworkshop yea..I already got them waiting for you.. Thank you

@shailavijay1 3 жыл бұрын

Thanks for your video. Very well explained :)

@santibogo 4 жыл бұрын

Hi murtaza! A question: If i only want to retrieve specific text from the image, how can i do it? i'm already have the coordinates of the text that i want to get

@richmail 4 жыл бұрын

take screen shot useing those cords, and send it via internet, tho, you need to know whee you store the images, hope this helped (srry for the bad spelling, I type fast)

@MegaR97 2 жыл бұрын

Hello, sorry I can't donwload the tesseract version you are using the page says "currently there is no official windows installer for newer versions

@ADNANAHMED-eo5xx 4 жыл бұрын

Amazing TUtorial man, thanks a lot

@fakhruddintahery1561 3 жыл бұрын

Great explanation. Thank you so much. That was useful

@imadsaddik 2 жыл бұрын

Thanks, this helped me.

@fixitunes30 4 жыл бұрын

Awesome videos....Any plan for video on Deep Learning Chat bot in near future?

@akhilpandey 4 жыл бұрын

Thank you, just what I needed.

@Lindvni 4 жыл бұрын

How do you guys fix the path problem? Everything looks fine ,until I run it. It just gives me an error

@sidhantipatil1930 4 жыл бұрын

Great content! Keep going!😇

@Jockeponkenboy 2 жыл бұрын

Awesome video as usual! But I have a simple question, how do I comment and uncomment a code section?

@gawaderajesh 4 жыл бұрын

Great...Murtaza's Workshop - Robotics and AI... Posted on Facebook....... :-)

@aironium Жыл бұрын

Is it possible to get the whole bounding box of a word by combining boxes from every character?

@petergoodall6258 3 жыл бұрын

My understanding is that ‘text detection’ tells you if you have text or not, rather than giving the value of the text. It’s a simpler process.

@nicoarasamuel1173 3 жыл бұрын

Love the video... I've try to make a project on wich I try to send a text to a monitor, more like a power point presentation or a teleprompter app... Do you have any suggestions...or can you make a video about something like that?

@DrWho2008t101 4 жыл бұрын

Great Video!

@rajeshmeesala9470 3 жыл бұрын

Nice explanation

@r-beanmondy6203 3 жыл бұрын

Great video dude. But, how to make it in live video?

@silviacolabianchi8261 3 жыл бұрын

Hi! thanks for the video. What do you think of tesseract version 5.0.0. Which one do you suggest to download? Version 4 or 5? thank you

@birdeye700 3 жыл бұрын

Excellent work! You might want to create functions to show digits, show alphabets and etc. This will improve readability and reusability of code.

@sooryaram439 4 жыл бұрын

Great video sir, I just have a small doubt. How can we detect text in live video stream using the webcam ?

@livetop2749 4 жыл бұрын

@Dharmendra Kumar Singh How much to deploy this app with camera detect?

@lakshit6429 3 жыл бұрын

same question

@vcjayan8206 2 жыл бұрын

Great help..Thank you

@Shraddhanath 4 жыл бұрын

Thanks for this workshop. Could you also post a video on how to detect tables(ruled as well as not ruled) in document images or segregate tables, text, charts and figures. Tx.

@murtazasworkshop 4 жыл бұрын

Thanks. i will look into it

@muhammadfayyaz8544 3 жыл бұрын

hello sir i am the new one as from videos i got many concepts cleared but i have one more question how we can get any specific word from the bunch of sentences

@MasterZeroFX 3 жыл бұрын

Nice vid pahjeet

@mandrivaal 3 жыл бұрын

You're a legend! thanks

@makelabsindia3092 4 жыл бұрын

your video and explaining method amazing ,keep it up

@murtazasworkshop 4 жыл бұрын

Thanks a lot

@namtongdosa9626 4 жыл бұрын

Thank you so much, its very helpful for me. Could you please make a tutorial about train model to detect text in identity image ?

@murtazasworkshop 4 жыл бұрын

I have one tutorial on classifying digits. you might want to have a look at that.

@chickenz4604 4 жыл бұрын

Nice video, btw could you give me suggestion when it comes to recognize handwritten digit and then manipulate the data?? should I use OpenCV & tesseract?? or tensorflow & keras? I am still not sure about this, since im new for this application. Thank You !

@Mikasks 3 жыл бұрын

So what did you use? I'm having this problem aswell.

@arthurswanson3285 4 жыл бұрын

Very useful. Thanks.

@akashrastogi6431 3 жыл бұрын

great job and thank you so much

@radoslavtotev4401 2 жыл бұрын

Hey, thank you for the lesson but do you have it for linux because I want to use Raspberry pi 3 for similar project?

@gonzalohernan4169 3 жыл бұрын

Thanks so much for this!

@dimitheodoro 4 жыл бұрын

The Binaries for Windows ( 1:06) has not this selection to download, so i cannot find how to download the file for Windows. Can you help me pls?

@thenujansandramohan8930 3 жыл бұрын

For arch based linux users tesseract is in aur . installation is easy

@nisarggandhewar8242 4 жыл бұрын

Its very helpful video, Thanks for preparing it, can you please prepare video on to detect text by designing & training our own model on some dataset just like digit detection, that will be more helpful.

@murtazasworkshop 4 жыл бұрын

Yes, soon

@MadanLal-hl5mw 2 жыл бұрын

@@murtazasworkshop l

@sanchimittal9614 4 жыл бұрын

Amazing tutorial! Great content!! Can you do a video or suggest something for noise removal like unrequired non-text portion and other kind of preprocessing before text detection in images? Thank You

@fortfanop 4 жыл бұрын

I think denoising with autoencoders will do the job...

@gowthamns8228 4 жыл бұрын

Wow very good, but the problem is "If the text is very clear and crisp its is showing output correctly", But I want to know if the image has multiple data not only text for example "bills, taken calendar photo or any kind of images", How to print the string from that, I tried my self it's not printing anything, Any idea for this?

@avikro9235 3 жыл бұрын

Windows binaries download is not available in tesseract documentation

@shuaishuai2009 3 жыл бұрын

good video~ Thanks for sharing

@Kwuasimoto 4 жыл бұрын

For people following this tutorial as of 11/9/2020 or later on windows 10 version 2004 'How do you fix “runtimeError: package fails to pass a sanity check” for numpy and pandas?' - stackoverflow.com/questions/64654805/how-do-you-fix-runtimeerror-package-fails-to-pass-a-sanity-check-for-numpy-an The OPENCV-PYTHON library installs version 1.19.4 numpy which is incompatible with newer versions of windows. In order to get the imports to work, if you're already stuck 4 minutes into the video... run: pip install numpy==1.19.3 in the terminal to down grade your numpy version to get it working. Have fun :)

@yuval1405 4 жыл бұрын

Invalid requirement "=="

@Kwuasimoto 4 жыл бұрын

Ahh yea sorry there shouldn't be a space so numpy==1.19.3

@yuval1405 4 жыл бұрын

@@Kwuasimoto did it but now when I import cv2 it tells me "Collecting cv2 Could not find a version that satisfies the requirement cv2 (from versions: ) No matching distribution found for cv2"

@Kwuasimoto 4 жыл бұрын

@@yuval1405 are you sure that you installed the new version of numpy in the same environment with your cv2?? I recommend adding a --global tag to your cv2 install so its available everywhere if you use it regularly Pip install --global opencv-python (i think that's the cv one) And make sure your in your project root dir and run Pip install numpy==1.19.3

@yuval1405 4 жыл бұрын

@@Kwuasimoto I fixed it by going to settings>project interpeterthen I clicked on the + in the right and then typing opencv-python and clicking 'install package'

@ans_human 3 жыл бұрын

can we use it for other languages? Like indic languages for example hindi.

@muhammadusama2360 4 жыл бұрын

Amazing Sir.. Sir you said to make fully functional self driving Car..????? When you start this......I want to work on this Project so Get help from your tutorials

@murtazasworkshop 4 жыл бұрын

The raspberry pi series has started will be applying some self driving methods on that.

@muhammadusama2360 4 жыл бұрын

@@murtazasworkshop You can use LiDar in Self driving car??

@smrutidabhole961 3 жыл бұрын

thank you so much sir...

@javierpachon4424 3 жыл бұрын

Thanks for your tutorial, I want to capture an email from an image but Tesseract does not recognize the @ symbol, how can I solve it?

@samnicz 2 жыл бұрын

Can we use visual studio code, instead of pycharm?

@muthuji8053 4 жыл бұрын

videos really helpful for me ,but if we apply this dull image or text are present in bottom that won't detected

@AmanVerma-wo8hd 3 жыл бұрын

Hay I can't able to fine window installer , please give me a link

@zaidilyas7842 3 жыл бұрын

hey..! Did you find that?

@MrNadeem1984 4 жыл бұрын

It doesn't work on text in larger and smaller font size. Can you help me how I can use extract those text from image.

@PritishMishra 4 жыл бұрын

IF anyone is getting like TESTDATA_PREFIX error on windows just go to your tesseract-OCR file and navigate to testdata and then add this directory to your path C:\Program Files (x86)\Tesseract-OCR\tessdata (check if this exists) and add it system variables and set Variable name as TESSDATA_PREFIX !!

@harshitsrivastava9903 4 жыл бұрын

Great work..very helpful!!

@murtazasworkshop 4 жыл бұрын

Glad to hear that!

@nikethadinesh7123 4 жыл бұрын

Thank you sir for this video. Cud u also explain with handwritten text recognition?

@MehediHasan-vx5fz 3 жыл бұрын

Hello sir, please make a videos how to train custom data in tesseract ocr. thank you...

@anajajkkkk8490 3 жыл бұрын

Thanks sir!!

@manugomez7319 3 жыл бұрын

At 9:27 the w and h are the diagonal points means? Which diagonal point? Bottom or up?

@aaryangupta4815 4 жыл бұрын

Love ❤ this tutorial, can we do this same on live video also???

@ashok_ign5623 4 жыл бұрын

Thank you so much your all tutorials are helpful keep It up 😍😍

@murtazasworkshop 4 жыл бұрын

Glad you like them!

@shreyanshdwivedi7871 4 жыл бұрын

Hello Murtaza, It was good explanatory video which was easy to digest and understand, loved your work, I want to know that if you can make a video on creating the language model of sanskrit for tesseract from the certain bunch of images consisting of sanskrit text , that would be appreciated. Hope to hear from you soon.

@mabo2031 4 жыл бұрын

hi Murtaza I have question: I want to convert to pdf(with ocr text) only few page from multipage tif image - is it possible without any other conversion ? multipage tif - 70 pages - but I want to convert to readable pdf pages from 10 to 20? best regards :)

@9319964634 4 жыл бұрын

Good video, can we also find micr font and store using this method?

@alooooshm 4 жыл бұрын

Keep it up👍

@ravindarmadishetty736 5 ай бұрын

Nice!, how are you loading those extracted information into a file?

@grimreaperstudio 2 жыл бұрын

Can you please help me, i have an error if i try to run the test file. "Exception has occurred: AttributeError 'Sequential' object has no attribute 'predict_classes' File "C:\Digits-Classification-master\OCR_CNN_Test.py", line 36, in classIndex = int(model.predict_classes(img))"

@goat6836 2 жыл бұрын

Thanks Man

@raghavgarg1258 4 жыл бұрын

Awesome, Please upload more videos...

@murtazasworkshop 4 жыл бұрын

Thank you, I will

@WorkHardThinkBetter 2 жыл бұрын

Hi Murtaza, Your videos are excellent. I need to ask where is 1.png since on your website I can't find it in the projects section. Thank you so much for everything!

@vittal255 3 жыл бұрын

hello Sir, our videos are really informative. They motivate me to build things that I have in my brain. Thanks for those videos. I have small doubt can we implement the same project using raspberry pi instead of a laptop/pc. If yes please suggest the necessary modification I need to make. I am really to new to IoT and computer vision started learning about them

@_kayos_ Жыл бұрын

did you find an answer?

@jeswanthjagu7054 2 жыл бұрын

Can you please make a tutorial on converting scanned table image to MS excel data?

@pratyushtripathy1463 4 жыл бұрын

sir it is accurate approch.. becoz when background color varies.. its not able to detect the characters

@murtazasworkshop 4 жыл бұрын

Every algorithm has it limitaions. You need to find a way to get the best results.

@colbrac720 3 жыл бұрын

Thanks a lot !

@pingwin36 3 жыл бұрын

I'm not entirely sure where I went wrong but following your steps to detect only digits it instead reads everything EXCEPT digits. Anyone know how to solve this?

@anindyakumar4853 4 жыл бұрын

at 23.28 you adjusted the box and yours is working fine but when i did the same the boxes are formed in transpose manner, like the height of box = length of word and viceversa Help me out please

@jamadarzoya6301 2 жыл бұрын

Does this library work with 7 segment-display digit numbers in images ??

@remconet 4 жыл бұрын

Can you learn it to recognize made-up characters like runes or non-existing symbols?

@murtazasworkshop 4 жыл бұрын

you will have to train your own model

@remconet 4 жыл бұрын

@@murtazasworkshop So that is a possibility? I know Arduino (some experience) but I'm new to AI. I would love some pointers on where to start if I want to train my own model.