How to extract data from PDF's with RegEx in UiPath - Full Tutorial

  Рет қаралды 50,081

Anders Jensen

Anders Jensen

Күн бұрын

Пікірлер: 179
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Learn Uipath from the beginning with +2 hour Uipath Beginners Course: kzbin.info/www/bejne/aYuuq4d-mdWmitU
@VK-nx9hq
@VK-nx9hq 3 жыл бұрын
Hey thank you for this video. Can you please help us with a video to show us regex where data is in multiple rows. For example - Invoice Address - abc, def, Ghi, jkl And regex for the below as well Receiving Partner/Mgr - Vikas It would really help Thanks again
@mckennasavoca5777
@mckennasavoca5777 Жыл бұрын
@andersjensenorg I am under a deadline and im being thrown a similar error as others. The "assign" activity throws the error "Object reference not set to an instance of an object." I have tried some of the solutions posed in the comments, and googled, but have not been able to figure it out. I have gone through the video multiple times, and read the comments below to try and solve, and i just dont understand what is wrong. I have also looked at my code, and it looks good in RegexR. Thank you so so much for your help!! I would really appreciate it.
@patto2k358
@patto2k358 3 жыл бұрын
I wouldn't get anywhere without these guides. Thanks
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Patto2k, these comments are gold for me - Thank you :) Kind regards, Anders
@mihaeladraghici7108
@mihaeladraghici7108 2 жыл бұрын
Thank you Anders for sharing the knowledge with us! You are great ! I work in HR and you gave me the idea of how to extract the salaries, ranks and positions from employees' offer letters. I look forward to build my own robot. Thank you so much for the lesson and good luck !
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Thanks a lot, Mihaela 😊🙌 Building robots and solving my own problems have taught me the most during the years. Let me know how you progress! Kind regards, Anders
@pnard4130
@pnard4130 4 жыл бұрын
that was exactly what I was searching for. Thank you for the great work!
@adrianosouza580
@adrianosouza580 4 жыл бұрын
Anders thank you very much for this video ... it was very good the way you did, it was exactly what I was looking for, please continue with your channel I hope it grows much more Kind Regards
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Thank you very much, Adriano 😊
@martaduino1600
@martaduino1600 4 жыл бұрын
Thanks, thanks a lot per this amazingly valuable tutorial! Not only about the content, but also about your clear explanation.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Marta. Thank you very much 😊 Your comments are really valuable to me. And don't hold yourself back, if you find something, that you think could be better (either the tutorial itself or e.g. the video quality or sound quality). Kind regards, Anders
@АнастасияКузьмина-р6о
@АнастасияКузьмина-р6о 3 жыл бұрын
Thank you, Anders, for this awesome video!
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Thanks a lot, Анастасия. I think every RPA developer should master Regex. Kind regards, Anders
@danielsmobile8154
@danielsmobile8154 4 жыл бұрын
Good morning Anders, I have been watching your videos for a few days now and I am thrilled, thank you very much. I now have the following error message, maybe you can help me. Compiler error(s) encountered processing expression "(dtData.rows(0).Item("Value").ToString,dtData.rows(1).Item("Value").ToString,dtData.rows(2).Item("Value").ToString)". Thanks a lot Daniel
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Daniel. Thank you very much 🙂 Can you paste in the entire exception and in what activity it occurs? Kind regards, Anders
@danielsmobile8154
@danielsmobile8154 4 жыл бұрын
Thanks Anders, but I don't know what you want to know from me, so I'll write what might help you. It's about the video "How to extract data from PDF's with RegEx in UiPath - Full Tutorial" in the minute 12:15 you write in the Add data Row in Arrayrow the following "(dtData.rows(0).Item("Value").ToString,dtData.rows(1).Item("Value").ToString,dtData.rows(2).Item("Value").ToString) but then I get the error message when I go over the blue symbol. The value of type String cannot be converted into 1 demesional array of object. I hope you understand what i mean :-) Do you speak German, that would be easier for me :-) Daniel
@danielsmobile8154
@danielsmobile8154 4 жыл бұрын
​@@andersjensenorg Here I also found something else, maybe it has something to do with this, thanks for your help. "The restore inputs for 'PDF RAW' have not changed. No further actions are required to complete the restore.Assets file has not changed. Skipping assets file writing. Path: C:\Users\Daniel\Documents\UiPath\PDF RAW\.local\AllDependencies.jsonNo-Op restore. The cache will not be updated. Path: C:\Users\Daniel\Documents\UiPath\PDF RAW\.local uget.cache"
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Replace the outer ( and ) with { and }
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Haha, Daniels. You would not want to hear me speak German 😄
@sayalichincholikar3272
@sayalichincholikar3272 4 жыл бұрын
Hi Anders, I saw mostly all your videos. They are so great and clear all concepts. Thanks a lot. I just want you to know that if possible can you please demonstrate all the request methods of Http request activity and if it is already there can you please just share it.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Sayali. Wow, thanks for the kind words 🙂 I have done a tutorial of the http get, kzbin.info/www/bejne/fpq2kJ-HZtyVepY and I would love to do more, when I find the time. Kind regards, Anders
@ashokdondapati7666
@ashokdondapati7666 3 жыл бұрын
Thanks for your support Anders but one question i am having butch of documents in one pdf from this i need to extract some important fields & Can i continue with is video or did i make any changes
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Ashok. Check this: kzbin.info/www/bejne/jYGudn2CjJySgNU 🙂 Can you use it? Kind regards, Anders
@ashokdondapati7666
@ashokdondapati7666 3 жыл бұрын
@@andersjensenorg thanks Anders for your Support..👍👍
@ra77le
@ra77le 4 жыл бұрын
Great job! Very helpful video, i am waiting for next elaborations
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Thank you, Lukasz 😊
@jnblnr2560
@jnblnr2560 3 жыл бұрын
Hello Anders, perfect video all works very well, I am only struggling a little bit with regex, could you please help me? I need to match the line below the finding.
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey JNBLNR. Thanks a lot. Can you post the problem on my Discord: kzbin.info/door/Pdtz4gd_iYebJFYq9N8pWAcommunity?lb=UgxD1Qe9lWKL1I7HyAJ4AaABCQ We're a lot of developers to help you there :) Kind regards, Anders Ps. I'm on a +3 month paternity leave at the moment, so my response time is a bit delayed :) Kind regards, Anders
@VICKY100510
@VICKY100510 2 жыл бұрын
Hi Anders, thank you very much for all of your videos. Have you ever worked RPA on an Application that has Silverlight installed? Silverlight basically prevented RPA to detect the UI elements, didn't it? I was thinking that PDF has same properties as an application that has Silverlight. The reason I mention about PDF was that Silverlight is now deprecated, I couldn't install Silverlight on my app to test it out. Does RPA work on PDF files the same way it works on Silverlight? Please advise?
@andersjensenorg
@andersjensenorg Жыл бұрын
Thanks for writing 😊 I'm getting more than 50 messages daily. While I read all of them, I can’t reply to everyone 😔 But I’ve created an RPA/Automation community where we’re 5800+ RPA Developers helping each other with solutions and our careers. Here’s the video on how to join (the invitation link is in the video description): kzbin.info/www/bejne/roipq2CJbpuLfdE Kind regards, Anders
@lamkayiu5378
@lamkayiu5378 2 жыл бұрын
Hi Andres, Thx your video it very useful, If I have many pdf and they all have some different field (for example, PDF 1 have 3 field call Item Name, Color and Size. PDF 2 have 5 field call Item Name, Location, Remarks, Color and Price), they only have 1 field is the same, is there any chance to add some condition to finish my task ?
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Ka Yiu Lam Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@RobertWalthery1975
@RobertWalthery1975 2 жыл бұрын
Hi Anders Thank you again for the very useful video. It is just what I was looking for. The difficulty I see is to manage the real PDF and PDF containing only image. Is there way in UiPath to identify it ? So I can use the read PDF or the OCR.
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Robert Walthery Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@jeremylee6103
@jeremylee6103 3 жыл бұрын
Hi Anders, I love your tutorial videos, they are simple and easy to understand. Can i check with you if you know how to Read a downloaded PDF and rename it with text extracted from within the PDF?
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Jeremy. Many thanks! Sure, but the easiest thing is to split our "problem" into pieces. Create the solution, that downloads it. Then reads it. Extract the text (Regex?). Then rename it. Which part are you stuck in? Feel free to post the problem for feedback on my Discord (many skilled developers): kzbin.info/door/Pdtz4gd_iYebJFYq9N8pWAcommunity?lb=UgxD1Qe9lWKL1I7HyAJ4AaABCQ Kind regards, Anders
@rajayushgupta3246
@rajayushgupta3246 3 жыл бұрын
thanks for helping me to learn UIpath, I have learned a lot from your videos. This is my humble request, can you please Enlarge the font when you enter the code in uipath. Sometimes its hard to read because of small font
@maikjunior2811
@maikjunior2811 3 жыл бұрын
Hey friend try using the windows magnifying glass to zoom when you have a lot of difficulty seeing.
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Raj Ayush Gupta Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@MaxAltitude112
@MaxAltitude112 3 жыл бұрын
Thank you for the wonderful video Anders. I have a question that comes to mind: do you have to do the RegEx procedures again for every type of invoice or is there a way to classify the invoices beforehand, so you can assign RegEx values to them based on a pre-screen? Best, Max
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Max. Thanks a lot. RegEx works well, when we have structured data. We can classify them, as you suggest, but imagine you have 100 different types. Meaning this approach will only work well with a limited amount of structures. But take a look at this cool package: kzbin.info/www/bejne/maGUmZV5nd-Ih5Y It works extremely well. Let me know, if that can help solve your case? Kind regards, Anders
@MaxAltitude112
@MaxAltitude112 3 жыл бұрын
@@andersjensenorg Thank you Anders! Works like a charm. Do you have any idea how to merge the results of the two data extraction scopes into one Present Validation Station? No worries if you don't, have a nice day/evening!
@andersjensenorg
@andersjensenorg 3 жыл бұрын
@@MaxAltitude112 Awesome! No, so these are just thoughts from top of my head: We can have a For Each of each of our files in a folder (check how to do it: kzbin.info/www/bejne/qme3fWR-fNmVodk) and simply just place your extraction scope in here. If I understood you correctly? :D Kind regards, Anders
@saalemrafiq2661
@saalemrafiq2661 4 жыл бұрын
Thank you for this awesome video, just one question though Is there a way to add file name too, from which the text is being extracted from? if yes please let me know how kindly let me know ,,,,it'll be helpful
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Saalem Rafiq Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@icstrim76
@icstrim76 6 ай бұрын
Thanks for sharing this. I have invoices with multiple product codes, and i was wondering how can I loop through each product code, in the same time, the other data remain the same, like invoice number, etc, and in the end to check prices of each product with quantities, with value of total invoice.. Too much ?
@andersjensenorg
@andersjensenorg 6 ай бұрын
Thanks for writing 😊 I'm getting more than 50 messages daily. While I read all of them, I can’t reply to everyone 😔 But I’ve created an RPA/Automation community where we’re 12,000+ RPA Developers helping each other with solutions and our careers. Here’s the video on how to join (the invitation link is in the video description): kzbin.info/www/bejne/roipq2CJbpuLfdE Kind regards, Anders
@TevfikBagcivan
@TevfikBagcivan 3 жыл бұрын
Hi Anders, Thank you this video. I tried it too. It worked great and very fast. How can i get the invoice items (multiple line) to excel?
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Tevfik. Check this: kzbin.info/www/bejne/maGUmZV5nd-Ih5Y Or you can use Regex as in the video, where we use the lines as anchors (e.g. line ID). Let me know, if we can help - you can join the Discord here and paste your problem in the UiPath channel: kzbin.info/door/Pdtz4gd_iYebJFYq9N8pWAcommunity?lb=UgxD1Qe9lWKL1I7HyAJ4AaABCQ Kind regards, Anders
@damandeepsingh2859
@damandeepsingh2859 4 жыл бұрын
Hi Anders, I tried this project but getting error on Assign: Object reference not set to an instance of an object. Is there any solution?
@Arjunprasadjr
@Arjunprasadjr 4 жыл бұрын
Even I am getting the same issue. Did you find any solution to this?
@comercialbreno9927
@comercialbreno9927 4 жыл бұрын
​@@Arjunprasadjr I have the same problem, if you know something, please let me know.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Damandeep,​@@Arjunprasadjr and @Comercial Breno. When getting an Object reference not set to an instance of an object, it is because you are trying to assign a null value to the variable. Here it most likely means, that you try to extract the data with either an incorrect pattern (nothing will be extracted = null) or you simply don't have any matches in the document (also null). What you want to do, is first check the pattern at regexr and then you want to handle, what should happen, if there is no match. Like an If with a condition of ienMatches.Count = 0 Then..... Let me know, if that helped? Kind regards, Anders
@ismailasohail8924
@ismailasohail8924 2 жыл бұрын
Nice video i want to know if we want to extract the name for example path is the name in the invoice how do we do that and save it as a name: path
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Ismail A Sohail Thanks for writing. I'm getting more than 50 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@Nagendra0912
@Nagendra0912 2 жыл бұрын
Hi Jenson could you please provide the sample input which u have used for document understanding
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey siva D Thanks for writing. I'm getting more than 50 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@kreigfields3157
@kreigfields3157 4 жыл бұрын
Hi Anders, great video but I am getting a output message of "Assign: Object reference not set to an instance of an object" when I try to "Assign row.Item("Value") = ienMatch(0).Value" Do I need to be setting a default value somewhere? Or any other guesses on what I may be doing wrong?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Kreig. Can you set a breakpoint on the assign. Then run a debug and check what value(s) are in your ien. My guess it's empty aka null and that gives you the exception. Or you can just do a ienMatch.Count.ToString and print it out. Let me know. Kind regards, Anders
@mpumisetshedii
@mpumisetshedii 3 жыл бұрын
Hi did you manage to find a solution
@nryalife2459
@nryalife2459 Жыл бұрын
Di you solve it?
@cl1116
@cl1116 Жыл бұрын
try changing to CurrentRow.Item("Value")
@nikhilmahajan7298
@nikhilmahajan7298 4 жыл бұрын
great thanks for this Anders, I am facing one issue, I need extract table from PDF to excel and only able to extract one row at a time using this in assign activity "ienMatch(0).Value", If I change the value from (0) to (1) then, second row is extracted, how can i get all rows from PDF to excel?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Nikhil. Thanks for the words. Drag in a 'For Each' and then you can write all the matches with item.Tostring I've made it in this video (look from 06:31) kzbin.info/www/bejne/jIe1lZ13l82Jgrc Let me know, if that helped you. Kind regards, Anders
@nikhilmahajan7298
@nikhilmahajan7298 4 жыл бұрын
@@andersjensenorg thanks for your prompt reply, after trying same I got the same result i.e. only one row out of 4 rows with 6 columns each extracted in excel, I guess I'm missing something 🙄
@andersjensenorg
@andersjensenorg 4 жыл бұрын
@@nikhilmahajan7298 It's most likely your RegEx pattern. If your material is not confidential, send it to me and I gladly make a video 🙂 Kind regards, Anders
@nikhilmahajan7298
@nikhilmahajan7298 4 жыл бұрын
@@andersjensenorg thanks it worked, it was the placement of an activity, my bad 😬
@beckcm5698
@beckcm5698 3 жыл бұрын
Hi Anders, I am new to RPA & found your instructional videos on youtube. This particular video is very useful & I am able to follow thr. That is until I have this same problem, which is only 1 row is extracted. You have mentioned to drag in a 'For each' & I placed it right under 'Matches' but I am still getting only 1 row output. Can you advise if this is correct placement or point me to the right direction ? TQVM in advance.
@andrewandrew9928
@andrewandrew9928 4 жыл бұрын
Hi Anders, I have a question for you about the data table output. Is it possible to assign variables to columns in the data table? For example, I would like to have invoice number as a variable. Great video by the way. I learned a lot! Thank you.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Andrew. Thank you 😊 No, you cannot have a variable in a Data Table, but you can have a value. That value you can take out and assign it to a variable with an assign: strInvoiceNumber = dtYourDataTable.rows(2).item("strYourColumnHeader").ToString, where 2 is the row your want to target. You can also put it in a For Each row and write: strInvoiceNumber = dtYourDataTable.row.item("strYourColumnHeader").ToString, which will assign all your invoice numbers. Did that help? 😊 Kind regards, Anders
@andrewandrew9928
@andrewandrew9928 4 жыл бұрын
@@andersjensenorg Thank you Anders! You're a kind and generous man.
@arunarumugam1721
@arunarumugam1721 4 жыл бұрын
Hey Andersen ,this is great learning ! Could you please help me on "How can we extract data from multiple images to excel using OCR", lets say if its a expense receipt, and each receipt will have different formats, so how to exact Name, Date, Description & Amount from that image. Any help, please !
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Arun. Thank you 🙂 Are all the expense reciepts formatted the same? Kind regards, Anders
@arunarumugam1721
@arunarumugam1721 4 жыл бұрын
@@andersjensenorg Thanks for the reply Anders ! No, it will be different. But if will have the columns i mentioned above. I tried with Microsoft OCR to scrap all the text from an image and save it in a text or excel file , so I can use VBA to get desired output but OCR is not extracting all the text from that image. I'm not sure if uipath will consider only certain types of images.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Arun. Check this, kzbin.info/www/bejne/maGUmZV5nd-Ih5Y 🙂 Kind regards, Anders
@littlebbylw
@littlebbylw 4 жыл бұрын
Hi Anders! Came across your video and wanted to ask if we would like to extract scanned identity card information like name, identity number, date of birth etc...but the problem for now is that the placement of the scanned identity card might not be located in the center all times and it is not as clear as coloured scanned document. How do I extract the information for both scanned b&w and colour when the document position may not be fixed? @Anders Jensen Thank you in advance!
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Lil. Then we need larger "weapons". I would suggest the Document Understanding package, which is quite versatile. I've did it here with invoices: kzbin.info/www/bejne/maGUmZV5nd-Ih5Y But you can easily apply it to passports, id cards etc. Kind regards, Anders
@littlebbylw
@littlebbylw 4 жыл бұрын
@@andersjensenorg Hi, I've watched the video on document understanding and trying to replicate but for configure extractors step, we copied the link for invoice template but our passport doesn't have elements like billing name etc for the dropdown to change etc. How am I supposed to change? (the elements i am extracting are like passport number, country of birth etc)
@iziadnasim
@iziadnasim 3 жыл бұрын
Anders thank you so much for this effective video. I got stuck to fetch values from a column in a tabular pdf. That pdf contains : starting with a bank name and multiple columns with cheque numbers, account numbers, account names etc. But actually i want to fetch that specific account number column values that means all the account numbers. I wrote a program using regular expression and able fetch account numbers which account numbers are in between 10 to 30 digits. But the problem i am facing which acconut numbers are less than 10 digit, those accounts are not unable to fetch. Note that, all cheque numbers are 9 digit, so i can't set logic \d{9,30}, cause in that case i am fetching account numbers with cheque numbers. Using regular expression is it can be solved? Would be grateful if can hear from you.
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Ziad. If it's tabular data, then check this video: kzbin.info/www/bejne/f3_Rk3qGe5qArac That will sort it automatically :) Kind regards, Anders
@andersjensenorg
@andersjensenorg 3 жыл бұрын
And thanks a lot 🤩
@boyanavramov4019
@boyanavramov4019 3 жыл бұрын
Anders, When I follow this step by step, the "assign" activity throws the error "Object reference not set to an instance of an object". How do I fix this? Thanks for all of your videos, they are very helpful!!! :)
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Boyan Avramov Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@NguyenAnh-sp2hb
@NguyenAnh-sp2hb 3 жыл бұрын
Thanks a lot but i have 2 question can you help me 1. in folder pdf i have 1pdf no data, when run have error "Assign: Object reference not set to an instance of an object." how can skip this pages 2. with this tutorial how to run a pdf with many pages Best Regards
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Nguyen. Thanks for the question and sorry for the late reply. At the moment, I’m on paternity leave, meaning I don’t produce videos and solve problems until the 23rd of August. Feel free to post the problem at my Discord, where many bright automation minds are gathered: kzbin.info/door/Pdtz4gd_iYebJFYq9N8pWAcommunity?lb=UgxD1Qe9lWKL1I7HyAJ4AaABCQ Kind regards, Anders
@azimbalbatti71
@azimbalbatti71 4 жыл бұрын
Hi Andres I believe all three invoices are having the same format and structure, How about extracting data from dynamic data from different vendors?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Azim. Then we need to apply some sort of rules to it, either manual or some deeper learning. The first is easy (but requires manual work, yuk) and the last is smarter. I would love to dig into the last, when I got some time. Kind regards, Anders.
@ucnguyenqui7630
@ucnguyenqui7630 4 жыл бұрын
hi Anders, First of all, i want to say thank you for your video sharing. I am getting a problem that when i run 5 test invoices, it worked. After that, i run more 5 invoices, but the excel is override, still 5 rows there instead of 10 rows as i want to. Appreciate much if you can help me I build robot exactly same as your video.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Duic. Yes, if you run them in seperate runs it will overwrite. What you can do is to read the excel sheet as a datatable (read range), then add the data to it (add data row) and then write it out (write range). Did that help? 🙂 Kind regards, Anders
@ucnguyenqui7630
@ucnguyenqui7630 4 жыл бұрын
Anders Jensen thank you very much. I will follow your guide and let you know if any concerns ^^
@saurabhchaudhari3761
@saurabhchaudhari3761 3 жыл бұрын
Hello Anders I have a image of excel like data table in pdf and I have to extract the same table data in excel from the pdf? Could you help.?
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Saurabh. Let's split the problem. We want to extract a table in a pdf right? Check this: kzbin.info/www/bejne/jYGudn2CjJySgNU Let me know, if that helps. Kind regards, Anders
@anashasab5417
@anashasab5417 3 жыл бұрын
What Programming Language do you use in Uipath ? JavaScript Or VB Because I use UiPath In VB and this not work correctly with JavaScript RegEx
@andersjensenorg
@andersjensenorg 3 жыл бұрын
VB.NET or C# 🤩
@magenta_243
@magenta_243 7 ай бұрын
Hi Anders i'd like to know where i'll get pdf files that you're working with ?
@andersjensenorg
@andersjensenorg 7 ай бұрын
Thanks for writing 😊 I'm getting more than 50 messages daily. While I read all of them, I can’t reply to everyone 😔 But I’ve created an RPA/Automation community where we’re 11,000+ RPA Developers helping each other with solutions and our careers. Here’s the video on how to join (the invitation link is in the video description): kzbin.info/www/bejne/roipq2CJbpuLfdE Kind regards, Anders
@zensiew3521
@zensiew3521 4 жыл бұрын
Hi Anders, what if the text that i wanted to extract has open brackets and close brackets? using your example: (Invoice Number) instead of Invoice Number? what configuration should i add in the regex builder? Thank you!
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Zen. You just need to escape the parentheses, so you regex will be (?
@zensiew3521
@zensiew3521 4 жыл бұрын
@@andersjensenorg Yes, thank you!
@renejitvs
@renejitvs 9 ай бұрын
Will this method work for pdf extracted using OCR
@andersjensenorg
@andersjensenorg 9 ай бұрын
Thanks for writing 😊 I'm getting more than 50 messages daily. While I read all of them, I can’t reply to everyone 😔 But I’ve created an RPA/Automation community where we’re 11,000+ RPA Developers helping each other with solutions and our careers. Here’s the video on how to join (the invitation link is in the video description): kzbin.info/www/bejne/roipq2CJbpuLfdE Kind regards, Anders
@AUTOINSTAN
@AUTOINSTAN 3 жыл бұрын
thanks
@andersjensenorg
@andersjensenorg 3 жыл бұрын
You're welcome! 🤗
@patto2k358
@patto2k358 3 жыл бұрын
It should be noted that the form of regex is .NET regex which is different than PCRE and other implementations that regex tester sites use
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey patto2k. For sure, but you won't find any major differences. I recommend use Regex Storm to edit your patterns. Kind regards, Anders
@calvincheung3353
@calvincheung3353 4 жыл бұрын
it seems that it's very helpful but there's an error message with "Object reference not set to an instance of an object" from Assign activity. Would you please advise how to fix this.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Calvin. That is because you are trying to do a reference, which is empty. Most likely there is nothing in your variable. Is it the one you are extracting to with Regex? Then try the code and the text in a regex tester like regexr.com. What you most likely need to do, is to refine the code a bit 😊 Let me know, if that helped? If not, paste the text and your regex code here, I can solve it for you. Kind regards, Anders
@Steve-yq5fh
@Steve-yq5fh 4 жыл бұрын
This is a great tutorial video. Subscribed. I have a scenario, can I get in touch?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Steve. Thank you 🙂 At the moment I try to reply to the video comments and the posts at the discussion tab. You're always welcome to connect with me at linkedin. Kind regards, Anders
@kennethhudig4861
@kennethhudig4861 4 жыл бұрын
Hi Anders, nice work! I really appreciate your tutorials. I took it to the next level and get the data from Read PDF with OCR and the Google Cloud Vision OCR Engine. The only thing that is not working for me are the regulare expressions from you. When I use them I don´t get any data of it. Instead of yours I used this one: "[\s ].*ID:\s*([^ ]*)". This gets the data as well but with the text "ID:" in front of it. That looks terrible in the excel file. Do you have any advice? Thanks!
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Kenneth. Thank you very much 😊 I would look at what comes out of your OCR and then tune your RegEx online expression editor (make a write line and copy it over). Let me know, if that works. Kind regards, Anders.
@kennethhudig4861
@kennethhudig4861 4 жыл бұрын
@@andersjensenorg Hi :) In the RegEx online expression editor your regular expression is working for the OCR output text. In UiPath it will not work. Maybe UiPath don´t allow Positive Lookbehind (?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
@@kennethhudig4861 Hey Kenneth! Yes, it certainly does. Can you link me to the pdf, then I gladly make a video on pdf ocr and regex 😊 Kind regards, Anders
@kennethhudig4861
@kennethhudig4861 4 жыл бұрын
@@andersjensenorg Please find here an Example PDF: drive.google.com/open?id=1bGl_qpk-uTUWa5JbWtUHPUfyRcboWdlF I scanned it with OneDrive document scanner. What I else try to do is to check the files extension of the files in a folder and do a seperate process on each file type. Because some of the files I get are either Excel or PDF. I´m an absolut beginner in UiPath but I need to learn it for my Bachelor thesis. Thanks for your time!
@andersjensenorg
@andersjensenorg 4 жыл бұрын
@@kennethhudig4861 Hey Kenneth. I think it's more easy (for me at least 😀), if you just post the extracted text and what you wanna look for with your regex. Regarding your last question, check this: AIzaSyCy_kltkAGlNd64JSiyJDhebm594nDL0gw Kind regards Anders
@SumitSingh-cg2qc
@SumitSingh-cg2qc 3 жыл бұрын
Hi, sir, I have a question related to pdf, I am trying to use click activity to select UI element of pdf but not able to select it, I have used all the option of accessibility one after one which is in the edit menu but not able to select the required UI element. Can you help me regard this problem
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Sumit. Sure can you share the PDF on my Discord: kzbin.info/door/Pdtz4gd_iYebJFYq9N8pWAcommunity?lb=UgxD1Qe9lWKL1I7HyAJ4AaABCQ, then I or some of the other skilled people, can help you :) Kind regards, Anders
@Arjunprasadjr
@Arjunprasadjr 4 жыл бұрын
Hi Anders, I tried this project but getting error on Writeline: Object reference not set to an instance of an object. Saw in some forum they asked me to try str_output = IF(String.IsNullOrEmpty(str_input.ToString)," ",str_Input.ToString).ToString. But I get the same error for Assign activity. Please help with a solution.
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey arjun, did you see my answer to your other/similar question? And sorry about the late reply 😣 Kind regards, Anders
@Gannilifts
@Gannilifts 7 ай бұрын
hello im unable to extract the text file in the beginning itself its working for normal pdf with text but not with pdfs containing table. also tried implementing read pdf with ocr still didnt work out please help me
@andersjensenorg
@andersjensenorg 7 ай бұрын
Thanks for writing 😊 I'm getting more than 50 messages daily. While I read all of them, I can’t reply to everyone 😔 But I’ve created an RPA/Automation community where we’re 11,000+ RPA Developers helping each other with solutions and our careers. Here’s the video on how to join (the invitation link is in the video description): kzbin.info/www/bejne/roipq2CJbpuLfdE Kind regards, Anders
@kratirajoriya3117
@kratirajoriya3117 4 жыл бұрын
hello Anders,i need particular data extract from the email body to the excel .
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Krati. Can you follow the guide in the video and use retrieved emails instead of the pdf? Kind regards, Anders
@moabdelkhalek
@moabdelkhalek 3 жыл бұрын
would you please advise how to skip in Uipath assign when the item we are looking for is not found as it return an error
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Mohamed. Yearh for sure. Check this: kzbin.info/www/bejne/oWXVZ2d_bMudsKs We use a Try Catch (which is similar to all other programming :) Kind regards, Anders
@kaushik__7728
@kaushik__7728 3 жыл бұрын
how to find the values if the value is below it not on the side for e.g invoice no date 345628 3/09/97
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey kaushik __ Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@Artech.Ranjit
@Artech.Ranjit 2 жыл бұрын
How to use regex to get the full paragraph content of a field?
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Ranjit Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@sushilpongde2782
@sushilpongde2782 4 жыл бұрын
Could you please make a video on recognizing signature in pdf using Uipath?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Sushil. Do you meen handwritten signature? Kind regards, Anders
@sushilpongde2782
@sushilpongde2782 4 жыл бұрын
@@andersjensenorg Yes
@fanhub4030
@fanhub4030 Жыл бұрын
Also tell how to extract unabled name from invoice using regex
@andersjensenorg
@andersjensenorg Жыл бұрын
Hey Fan Hub, you'll need to find an anchor. Something that's close to the un-labeled name you want to extract 🙂 Kind regards, Anders
@kalpanakalpu2352
@kalpanakalpu2352 4 жыл бұрын
Hi Anders I am getting an error saying "The Positive look behind may not be supported in all browsers" May i know why it is.?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Kalpana. I am not sure, but since we are not using the regex in a browser, but applying it on strings, you will face no problems 🙂 Kind regards, Anders
@kalpanakalpu2352
@kalpanakalpu2352 4 жыл бұрын
Hey that is working thank you But i want to extract pdf data through ocr using regular expression. How to do that.?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey again. Almost exactly like here, except you just use OCR 🙂 Take microsoft or tesseract ocr. Kind regards, Anders
@funedit732
@funedit732 Жыл бұрын
How we can extract from multiple pages invoices
@andersjensenorg
@andersjensenorg Жыл бұрын
Hey Madhu Singh Thanks for writing. I'm getting more than 50 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@kalebnim
@kalebnim 2 жыл бұрын
Hi is it possible to release the source code you used in the video?
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Kaleb Nim Thanks for writing. I'm getting more than 50 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@sreekanth9736
@sreekanth9736 3 жыл бұрын
Hi Anders my data is like Transaction ID "12314341231231231" how to get below text to transaction id in regex ?
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Sree Kanth Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@akshaykumarpal5394
@akshaykumarpal5394 2 жыл бұрын
Can you please provide the link to pdfs..
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hey Akshay Kumar Pal Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@Jme_wu
@Jme_wu 3 жыл бұрын
how do I get a data like time
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Wu. The same way you get all the other data. You need to find an "anchor". That is something close to the thing to extract. That could be the text "date" etc. Kind regards, Anders
@Ed-lu2gc
@Ed-lu2gc 4 жыл бұрын
Hi, I followed all your steps but the last value of the row is not written into excel
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Ed. Then it's the Regex pattern most likely. Can you extract the data in the regex sample generator (regexr)? Kind regards, Anders
@Ed-lu2gc
@Ed-lu2gc 4 жыл бұрын
@@andersjensenorg I had it fixed. Thanks for the reply though. Hope you have more subs.
@danielsmobile8154
@danielsmobile8154 4 жыл бұрын
Hello Anders, now I am one step further, reading a PDF has worked. Now in the second step to read three PDF's from one file does not work because when you run the file the following appears after a few seconds: 20.10.0-beta.754+Branch.release-v20.10.0.Sha.843ee306bbef4922f3a7f923118f7e04d3bd27a0 Quelle: Assign Meldung: Der Objektverweis wurde nicht auf eine Objektinstanz festgelegt. Ausnahmetyp: System.NullReferenceException RemoteException wrapping System.NullReferenceException: Der Objektverweis wurde nicht auf eine Objektinstanz festgelegt. bei lambda_method(Closure , ActivityContext ) bei Microsoft.VisualBasic.Activities.VisualBasicValue`1.Execute(CodeActivityContext context) bei System.Activities.CodeActivity`1.InternalExecuteInResolutionContext(CodeActivityContext context) bei System.Activities.Runtime.ActivityExecutor.ExecuteInResolutionContext[T](ActivityInstance parentInstance, Activity`1 expressionActivity) bei System.Activities.InArgument`1.TryPopulateValue(LocationEnvironment targetEnvironment, ActivityInstance activityInstance, ActivityExecutor executor) bei System.Activities.RuntimeArgument.TryPopulateValue(LocationEnvironment targetEnvironment, ActivityInstance targetActivityInstance, ActivityExecutor executor, Object argumentValueOverride, Location resultLocation, Boolean skipFastPath) bei System.Activities.ActivityInstance.InternalTryPopulateArgumentValueOrScheduleExpression(RuntimeArgument argument, Int32 nextArgumentIndex, ActivityExecutor executor, IDictionary`2 argumentValueOverrides, Location resultLocation, Boolean isDynamicUpdate) bei System.Activities.ActivityInstance.ResolveArguments(ActivityExecutor executor, IDictionary`2 argumentValueOverrides, Location resultLocation, Int32 startIndex) bei System.Activities.Runtime.ActivityExecutor.ExecuteActivityWorkItem.ExecuteBody(ActivityExecutor executor, BookmarkManager bookmarkManager, Location resultLocation)" I hope you can help me. Thanks a lot in advance. best regards Daniel
@danielsmobile8154
@danielsmobile8154 4 жыл бұрын
Hello Anders, i have now found out that if i just read out my first bill everything works. But if I put another PDF in the folder the error message appears. Does it mean that I have to determine further regex data or what can I do so that he can also read another invoice? Thanks for your videos. Greetings Daniel
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Daniels Mobile Thanks for writing. I'm getting more than 40 questions/comments every day and having difficulty answering everyone (I try my best). I've created my UiPath Groups, where we all can post and solve problems or hang out around our favorite tool, UiPath. So you're very welcome. Get the addresses here: kzbin.info/www/bejne/eWfbnnqcerFjbas Have a great day. Kind regards, Anders
@pranotimeshram9554
@pranotimeshram9554 4 жыл бұрын
Hi Anders, thanks for sharing the solution But I am facing issue in Assign activity for row.item("Values") it throws error as: Assign: Object reference not set to an instance of an object. Can you please help with that.
@bootifulghost8624
@bootifulghost8624 4 жыл бұрын
I receive the exact same problem, tried quite a few things, but couldn't get it to work, any help?
@bootifulghost8624
@bootifulghost8624 4 жыл бұрын
@Anders Jensen Can you help on that :)?
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey Pranoti and Forky. Are you having the assign in For Each Row? Else the "row" reference won't work. And sorry for the late reply 😬 Kind regards, Anders
@bootifulghost8624
@bootifulghost8624 4 жыл бұрын
​@@andersjensenorg Thanks for the fast answer now. Yes I do have it in the "For Each Row". Just the same as you do. With assign being like: row.item("0")=ienMatch(0).value The only thing that is different for me is probably that I added like 7 rows in datatable, but that shouldn't matter? Also Error is shown for the assign
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Hey forky. Is 0 the name of your column? If you wanna target the first column, leave out the quotation marks around the 0. Let me know, if that works? Kind regards, Anders
@bharathikumar1192
@bharathikumar1192 4 жыл бұрын
Hi can you please help me in extracting 140FCVP-1083, 140FCVP-1014,140FE-1045A 140FE-1045B,ZH-43101 ,ZH-49001 About 200 pages in a pdf it will be really helpful
@andersjensenorg
@andersjensenorg 4 жыл бұрын
Sure. What do you wanna extract? Each of the values seperated? Kind regards, Anders
@bharathikumar1192
@bharathikumar1192 4 жыл бұрын
yes i want to extract each data to be exported in excel file bharathi kumar 17 hours ago Hi can you please help me in extracting 140FCVP-1083, 140FCVP-1014,140FE-1045A 140FE-1045B,ZH-43101 ,ZH-49001 i tried this ([a-zA-Z])[a-zA-Z0-9]*$[a-zA-Z0-9!@#\$%\^\&*\)\(+=._-]+$+
@bharathikumar1192
@bharathikumar1192 4 жыл бұрын
yes i want to extract each data to be exported in excel file bharathi kumar 17 hours ago Hi can you please help me in extracting 140FCVP-1083, 140FCVP-1014,140FE-1045A 140FE-1045B,ZH-43101 ,ZH-49001 i tried this ([a-zA-Z])[a-zA-Z0-9]*$[a-zA-Z0-9!@#\$%\^\&*\)\(+=._-]+$+ @@andersjensenorg
@bharathikumar1192
@bharathikumar1192 4 жыл бұрын
@@andersjensenorg yes i want to extract each data to be exported in excel file bharathi kumar 17 hours ago Hi can you please help me in extracting 140FCVP-1083, 140FCVP-1014,140FE-1045A
@abdullahbjelak2087
@abdullahbjelak2087 2 жыл бұрын
R.I.P. your keyboard
@andersjensenorg
@andersjensenorg 2 жыл бұрын
Hahaha. Yeah, I'm not so gentle 😀
@sujoypal3837
@sujoypal3837 Жыл бұрын
Excellent tutorial but please speak in normal voice. It's hard for non native English speakers to understand. Thanks.
@andersjensenorg
@andersjensenorg Жыл бұрын
Thanks a ton, what exactly do you mean by "normal voice"? Kind regards, Anders
@jaggukiran08
@jaggukiran08 4 жыл бұрын
Fabulas
@andersjensenorg
@andersjensenorg 3 жыл бұрын
You're very welcome, Jagannath Patil - your support helps me a lot 😍
@mimiflores1327
@mimiflores1327 3 жыл бұрын
I can't even hear his video. Bad sound
@andersjensenorg
@andersjensenorg 3 жыл бұрын
Hey Mimi. Sorry, but can you turn up the sound a notch? Kind regards, Anders
@mimiflores1327
@mimiflores1327 3 жыл бұрын
It wasn’t the sound it’s just in general I could t understand a word you were say. On my side I put that speaker louder and still could not hear you so nope not that. Sorry
Hoodie gets wicked makeover! 😲
00:47
Justin Flom
Рет қаралды 40 МЛН
This dad wins Halloween! 🎃💀
01:00
Justin Flom
Рет қаралды 64 МЛН
How to Extract Data from PDF with Power Automate
29:30
Anders Jensen
Рет қаралды 232 М.
How To Split PDFs With Dynamic Ranges In UiPath
24:16
Anders Jensen
Рет қаралды 9 М.
UiPath: How to Extract Outlook Emails with Regex (Full Use Case)
39:09
UiPath Document Understanding: Extract Tables Out of PDFs
24:29
Anders Jensen
Рет қаралды 42 М.
UiPath PDF Table Data extraxtion with ReGex
12:43
UiPath with Jeppe
Рет қаралды 11 М.