That is absolutely brilliant!!! I have spent the last two days trying to figure out how the do it in Python and within 8 minutes you showed me a much easier straight forward way.
@MyOnlineTrainingHub3 жыл бұрын
:-) so pleased it was helpful, Robert!
@abhinandanaams26132 жыл бұрын
@@MyOnlineTrainingHub can i download epaper into pdf without coding?
@JayPatel-hc8dq Жыл бұрын
lol... literally me too.. i got quite for until python was reading arabic webpages in hex and then i thew my laptop out the window!
@obinnaduru38153 жыл бұрын
Thank you so much for this video. Very practical for my Data Analyts journey. I followed the steps and didn't tun into any errors.
@MyOnlineTrainingHub3 жыл бұрын
So pleased you found it helpful!
@fentian2 ай бұрын
Wow, what an astonishing concept and how wonderfully well you've explained it. I've just applied it in Excel PQ to call an API over and over with a number of variables including a date that changes for each iteration, returning JSON data that is then transformed and presented in a pivot table. Thank you Mynda, xxx
@MyOnlineTrainingHub2 ай бұрын
Thank you! So pleased this video was helpful.
@abdulhaseeb80274 жыл бұрын
It's like you have read my mind because I was looking to scrape data from web like this currently. Thanks for the tutorial it's really helpful.
@MyOnlineTrainingHub4 жыл бұрын
My pleasure, Abdul!
@prameelar17533 жыл бұрын
I watched this video on this teachers day, and I believe you are one of the best teacher could help me on web scraping... 🤗
@MyOnlineTrainingHub3 жыл бұрын
Wow, thank you!
@Secret구이구이4 жыл бұрын
Thank you! It is hard to study in Korea because there is not much data about powerquery. Thanks to this, I integrated several post api into a single query.
@MyOnlineTrainingHub4 жыл бұрын
Pleased I could help!
@davegoodo36034 жыл бұрын
A bit beyond me at this point Mynda, Power Query is on my "to learn" list. Well presented.
@MyOnlineTrainingHub4 жыл бұрын
Thanks, Dave! Power Query is amazing...I'm confident you'll think so too :-)
@malaniebanney1634 Жыл бұрын
I slightly adjusted this to scrape data from a folder full of PDF files. Excellent thanks!
@MyOnlineTrainingHub Жыл бұрын
Glad it helped!
@jamessawyer85654 жыл бұрын
I wasn't even aware that M/Power Query can be used to such extent. Thank you for the great insight!
@MyOnlineTrainingHub4 жыл бұрын
Glad you enjoyed it, James!
@awesh19864 жыл бұрын
Thanks Mynda, there is no way that I would not like this video. It's awesome.
@MyOnlineTrainingHub4 жыл бұрын
Thanks so much, Awesh! And thanks for sharing it on LinkedIn :-)
@sushicatsan3 жыл бұрын
I knew this was possible, but ran into some errors while trying to do it on my own. Thank you very much for the great tutorial. Now to let Power Bi Spin!
@MyOnlineTrainingHub3 жыл бұрын
Glad it helped!
@omenaokoro4693 Жыл бұрын
spot on. I was only able to do the first page. This gives me the ability to do an entire site.
@geoffreyzziwambazza78622 жыл бұрын
To think I was doing this manually 🤦🏽♂️. Thank you, this is a huge time saver!
@MyOnlineTrainingHub2 жыл бұрын
Great to hear, Geoffrey!
@davidstevens40642 жыл бұрын
Wow...Easily used this tutorial to query printer settings from every Zebra printer on my LAN. Very helpful!
@MyOnlineTrainingHub2 жыл бұрын
Awesome to hear, David!
@markhooper2794 жыл бұрын
That's remarkable; this is like the limit of most peoples Python learning, and most co-workers would consider them "dangerous" with those Python abilities. (in the most professional and excellent way of course!)
@MyOnlineTrainingHub4 жыл бұрын
:-) Glad you liked it, Mark!
@awesh19864 жыл бұрын
This is an amazing way of working with web pages. I have seen people write lengthy macros and Python code for this.
@MyOnlineTrainingHub4 жыл бұрын
Yes, Power Query is super easy to use. I wish more people knew of it's powers ;-)
@naotoaguilarmorita70793 жыл бұрын
Thanks a lot for this tutorial! I could get mutiple api call in single query, best solution ever!
@MyOnlineTrainingHub3 жыл бұрын
Glad it’ll be useful!
@biswajeetswaro78314 жыл бұрын
Great video mam!!! I was doing this before python then saved into csv then importing to PBI. Now I can do with PBI directly 👏👏👏
@MyOnlineTrainingHub4 жыл бұрын
Wow, that's fantastic to hear :-)
@StephanOnisick2 жыл бұрын
Awesome use of M for us tiptoeing into the M Script!
@MyOnlineTrainingHub2 жыл бұрын
Glad you liked it!
@prashantmanshrestha3 жыл бұрын
Clear Voice, Beautifully Explained Super-woman.
@MyOnlineTrainingHub3 жыл бұрын
Thank you so much 🙂
@LeonoraRolston27 күн бұрын
This is fantastic! Had no idea it could be done. Explained so well - thanks!
@MyOnlineTrainingHub27 күн бұрын
Glad it was helpful!
@khalidessaadi8915Ай бұрын
Wonderful job ! So clear and perfectly explained, thank you so much !
@MyOnlineTrainingHubАй бұрын
Glad it was helpful!
@Kingleer692 жыл бұрын
Mynda- @ 1:18 - Instead of the 13 HTML Table options listed in your ‘Navigator’ dialog box, when I try to run the same Power BI query on my end, I am getting only 5 Tables (Table 0 through to Table 5; & an additional Document Table.) And these tables there is hardly any data to work with. Please advise.
@MyOnlineTrainingHub2 жыл бұрын
I get that now too. I guess the website has changed it's layout. Table 3 is the one you want.
@Kingleer692 жыл бұрын
@@MyOnlineTrainingHub Thank you, Mynda.
@MichaelBrown-lw9kz Жыл бұрын
This is simply awesome, now I have to practice this technique.
@MyOnlineTrainingHub Жыл бұрын
Enjoy!
@victorgabrielcamargo63849 ай бұрын
Wooww thank you so much, took me months to find this function. I will try it in a more complicated webpage. thank you
@MyOnlineTrainingHub9 ай бұрын
Glad you can make use of it! 😊
@MichaelHendersonMHC4 жыл бұрын
Brilliantly framed and well communicated. Thank you again Mynda.
@MyOnlineTrainingHub4 жыл бұрын
Thanks so much, Michael!
@deepakd-w5h2 ай бұрын
Merci Beaucoup madame. You made my work very easier
@MyOnlineTrainingHub2 ай бұрын
I'm so glad!
@merbouni4 жыл бұрын
I have never tried this, but I frequently convert data from the csv file to the html Datatable, Thanks Mynda.
@MyOnlineTrainingHub4 жыл бұрын
Hope you can make use of it, Reda!
@vincasvosylius60454 жыл бұрын
You are the legend! Helped me to solve this greyed out "change data source "button
@MyOnlineTrainingHub4 жыл бұрын
Great to hear, Vincas!
@abcapplesg Жыл бұрын
thank you for your video! but Can i please ask is it possible to add multiple sources? like under 4:02 instead of just one link how can I add multiple sources/ links? For instance I would want to do what you have done (pulling out all book titles from page 1-20) but from multiple different sources. E.g. All listing names under books (page 1-20) + all listing names under Biscuits (page 1-20), hence i would want to add multiple sources
@MyOnlineTrainingHub Жыл бұрын
You would do this in separate queries and then append those queries together.
@athilfaizaan8558 Жыл бұрын
Absolutely useful video! Thank you for this. Also I have a doubt, I have to scrape 560 pages and each page has 25 number of items that I need. I'm a little confused on the modulo part. In 5:28 of the video you say the starting number of pages that u need are 1, 11, 22 etc. But in my case the pages the I require are 1, 2, 3 etc and 25 records in each page. So do I use modulo same as you with 1, 26, 51 etc or avoid the modulo part and continue?
@MyOnlineTrainingHub Жыл бұрын
Great to hear! Yes, for the modulo enter the number of records in a page. Try it and see if you get the results you expect and adjust as necessary.
@athilfaizaan8558 Жыл бұрын
@@MyOnlineTrainingHub Thank you so much for answering my question and yes it worked, except I got one small problem. Let's say I'm trying to get the rental data from a website and I require three columns; Address, Price and Area (sqft). I chose it correctly from the select from example option. But after seeing the preview of the tables, I see that it has gotten a different category of data (additional info like gym/swimming pool instead getting the address that I wanted). I thought I selected it wrongly and did it again but I got the same results. I'm tackling the problem by now getting only the address and merging with the two tables together. But do you know why this is happening or a workaround for it?
@MyOnlineTrainingHub Жыл бұрын
Not sure as I'm not familiar with the site. You're welcome to post your question and sample Excel file on our forum where someone can help you further: www.myonlinetraininghub.com/excel-forum
@ramakumarguntamadugu12992 жыл бұрын
Great Video... Thanks for the efforts and sharing it. this will be very useful for many tasks...
@MyOnlineTrainingHub2 жыл бұрын
Great to hear!
@michalvydrzel10 ай бұрын
YOU ARE THE BEST!! Saved me so much work!
@MyOnlineTrainingHub10 ай бұрын
So pleased I could help 😊
@CEYLAN644 жыл бұрын
Thank you very much. I'm from Turkey. Have a nice day.
@MyOnlineTrainingHub4 жыл бұрын
Thank you! You too!
@stephencross4978 Жыл бұрын
Wow, this is clever and exactly what I needed. My mind is blown !!
@MyOnlineTrainingHub Жыл бұрын
Awesome. Glad I could help 😊
@julianstarkey93014 жыл бұрын
Very helpful, a lot less complicated excel formulas in my life now, shame that challenge has gone but I had to think a lot about my queries.
@MyOnlineTrainingHub4 жыл бұрын
Don't be sad that the challenges have gone...there are plenty of new challenges awaiting; M code, DAX, dynamic array functions :-)
@fabio.s.barbosa3 жыл бұрын
Wonderfull tutorial! that was exactly what I Looking for. I was duplicating datasources for each week to scrap some web data. Thanks a lot!
@MyOnlineTrainingHub3 жыл бұрын
So pleased it helped, Fabio!
@shakiraasfoor75994 жыл бұрын
Well Done Mynda All Your Videos Are Useful
@MyOnlineTrainingHub4 жыл бұрын
Cheers, Shakira!
@bryandadiz56772 жыл бұрын
The website is not anymore updated
@valentecg85183 ай бұрын
I really appreciate your tutorial! monysaver! Most data extraction tools are costly.
@MyOnlineTrainingHub3 ай бұрын
Glad it was helpful!
@peimanhosseini37 Жыл бұрын
thank a lot, that was really really useful. you solve my very big problem. 🙏🙏🙏🙏🙏🙏
@MyOnlineTrainingHub Жыл бұрын
Awesome to hear! 😊
@machadolopes2 жыл бұрын
Amazing how it is easy to scrape web pages. Thanks for this excellent tutorial.
@MyOnlineTrainingHub2 жыл бұрын
Glad you like it, Marcel!
@iankr8 ай бұрын
Brilliant! Many thanks, Mynda.
@MyOnlineTrainingHub8 ай бұрын
Cheers, Ian!
@nazaarshadir3 жыл бұрын
Another great lesson. I have a website with unstructured data for many items. I need specific values for each item from the site. Please, how may I do it automatically and quickly. cftc .gov/dea/futures/deacmesf . htm I only need LONG and SHORT value for each code. Thanks.
@MyOnlineTrainingHub3 жыл бұрын
Great to hear, Nazaar! The URL provided isn't right. Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum
@nazaarshadir3 жыл бұрын
@@MyOnlineTrainingHub thanks for the quick reply. I just joined the forum. Your forum is clean and organized. Looking forward to learning more. Thanks.
@powerb_i2 жыл бұрын
Great video thanks this makes web scraping a lot easier. Thank you.
@MyOnlineTrainingHub2 жыл бұрын
Great to hear!
@eslamfahmy8710 ай бұрын
Thank you, one more thing if my pages contain PDF files and I need to add another column which contains that PDF and I need to be accessible by link
@NadeemShafiqueButt Жыл бұрын
As always, an excellent tutorial
@MyOnlineTrainingHub Жыл бұрын
Glad you liked it!
@jbjs58202 жыл бұрын
Excellent work. just a question, when i try to refresh it in the system it doesn´t allow. indicates "This dataset includes a dynamic data source. Since dynamic data sources aren't refreshed in the Power BI service, this dataset won't be refreshed", any workaround?
@StephenMattison663 жыл бұрын
Great info, easy to understand. TYVM! I'd love to learn how to do all of this in Google Sheets. Power Query sounds cool!
@MyOnlineTrainingHub3 жыл бұрын
Glad you liked it, Stephen! Sheets doesn't have Power Query.
@naveedkhowaja4089 Жыл бұрын
Excellent tutorial, super easy to follow. That’s brilliant 👍
@MyOnlineTrainingHub Жыл бұрын
Glad it was helpful! 🙏
@mariaalcala51593 жыл бұрын
Wow amazing what you can do! Thanks a lot mynda I’m always learning from you!
@MyOnlineTrainingHub3 жыл бұрын
So pleased to hear that, Maria!
@03mariadelmar2 жыл бұрын
Hi! Your tutorial is very clear. However, what if the web page you are trying to access needs your credentials first? Do you know how I can go around that? Thank you!
@ritvikbolugudde86882 жыл бұрын
Thanks a lott!! I was wondering if the web page is updated would the loaded data in power bi update too (so basically if it's real time or not)
@MyOnlineTrainingHub2 жыл бұрын
Only direct query datasets can refresh real time, however, you can schedule refreshes at set intervals.
@austinbright-j3o4 ай бұрын
Can you get around captchas for more advanced stuff?
@MyOnlineTrainingHub4 ай бұрын
Not captchas, AFAIK.
@AnonymousHunYaar2 жыл бұрын
Marvelous ! You make it so easier, Thanks a lot
@MyOnlineTrainingHub2 жыл бұрын
Thank you! Glad to hear that!
@WeKnowIt1003 жыл бұрын
Hi! I have encountered a login page before the page that i need to scrap. Anyway can i bypass the page or key in the credentials?
@iliyatsekov60442 жыл бұрын
Many thanks for the video! What if I have two variable names? My URL includes both a year and a quarter. I created the two variable names but how do I invoke the function to take all quarters from every year?
@MyOnlineTrainingHub2 жыл бұрын
Make a table containing the string made up of the quarter and year components and whatever other characters form that section of the URL, and feed that into a single variable.
@darrylmorgan4 жыл бұрын
Hi Mynda!Great Tutorial,Just Learnt Something New So I Can Have More Fun With POWER BI..Thank You :)
@MyOnlineTrainingHub4 жыл бұрын
Great to hear you found it useful, Darryl!
@ssomtom2 жыл бұрын
Beautiful. It's solved my actual problem. Thx. :)
@MyOnlineTrainingHub2 жыл бұрын
Great to hear!
@carltonquine92774 жыл бұрын
Wow you're amazing! Can't believe this information is free! Thank you so much!
@MyOnlineTrainingHub4 жыл бұрын
You're most welcome, Carlton!
@m_shakes3 жыл бұрын
Amazing video and awesome ideas that I incorporated instantly! Quick question, how would you go about making each "page" into a separate query (each page a query on its own)?
@MyOnlineTrainingHub3 жыл бұрын
Glad you liked it, Mohammed! To make each page a separate query, you'd have to create them one by one by pasting in the URL for each page, or copying the query and modifying the URL to point to a different page.
@m_shakes3 жыл бұрын
@@MyOnlineTrainingHub Thanks for your prompt reply!
@kulern983 жыл бұрын
Is it possible to use requests or something in Power Query to get the string needed for the next page in order to get all Pages? The pagination is not linear or in a pattern.
@kulern983 жыл бұрын
The URL in itself is the same as for your URL, with “start=“.
@MyOnlineTrainingHub3 жыл бұрын
You'd need to know what the next page is in order to provide it to Power Query. It's not something Power Query can request.
@jayli3291 Жыл бұрын
Great resource! I am curious about replacing 1 with "&PageStart&". Can you explain why we use the double quotes coupled with the double ampersand? Which language/grammar are we following here, M or HTML or something else? I just wanted to learn more coding rules so I can crack the query more freely. I would appreciate any help you could provide.
@jayli3291 Жыл бұрын
I guess I figured it out. We are just concatenating the opening " with PageStart and then with the closing "; the & works as the concatenation operator. And because PageStart is a text variable, we need to put it inside the double quotation marks.
@MyOnlineTrainingHub Жыл бұрын
You got it 👍
@marosbrezovsky7512 жыл бұрын
When we assume that bookstore extend the number of pages in the time, how can I set it up so that query will check all pages available. I can not set it up so, because when it checks the urls which does not exists yet, it will stop sraping procedure. Is it possible to fix it somehow?
@MyOnlineTrainingHub2 жыл бұрын
You can build in some error handling so that when it gets to a page that doesn't exist it doesn't break the code: kzbin.info/www/bejne/f6bSXpyKpZ2UfMU
@仁です Жыл бұрын
It's usefull. Thanks you. I am looking for silimilar data scraper software. Do you mind to show me how to work with power BI in the case with differences website please.
@charlesmcdermott2824 жыл бұрын
Awesome! I managed to import a table for 1 page from a URL. It is a list of books unfortunately the number of books per web page varies. Is there a way to handle the issue of generating each page number in this case? As a backup is there a method of exporting all pages to a csv file and Load & Transform the csv back into PBI or PQ?
@MyOnlineTrainingHub4 жыл бұрын
Glad it was useful, Charles. In terms of figuring out the number of items on a page, I'm not sure there's any way to do that in advance of accessing the pages. Whether there's a way to export the pages to a csv file would be down to that website and whether it offers that as an option. It's not something Power Query can do.
@eo49222 жыл бұрын
Incredible overview, thank you so much! Is it possible to do this if you have a site with multiple pages that uses the same URL? I'm trying to scrape data from a public site with multiple pages, but all of them use the same URL - there are no unique identifiers (e.g. page numbers). Any assistance would be greatly appreciated.
@MyOnlineTrainingHub2 жыл бұрын
Glad it was helpful. Unfortunately, if the site's URL doesn't change, then you can't scrape the data with Power Query.
@eo49222 жыл бұрын
@@MyOnlineTrainingHub Understood. Could you recommend any other options that may be helpful? Thank you in advance.
@MyOnlineTrainingHub2 жыл бұрын
Only to say that if you know JavaScript (I don't) you can write some code to change the 'page' displayed so you can get the data.
@k00233824 жыл бұрын
Hi, Following your example, if you want store the book title Hyperlink, how do you get that? At the moment it stores the title as text only. Regards
@MyOnlineTrainingHub4 жыл бұрын
You can do this in Power BI: docs.microsoft.com/en-us/power-bi/create-reports/power-bi-hyperlinks-in-tables
@youse33 ай бұрын
Thank you so much for this video. what if we have "read more" instead of page numbers ?
@MyOnlineTrainingHub3 ай бұрын
Power Query typically can't see 'read more' information unless it's already in the page HTML. If it's generated using JavaScript, then you can't scrape it.
@webscrapingseniors2 ай бұрын
Power Query can struggle with data generated by JavaScript after the initial page load. In such cases, consider using a web scraping tool like Selenium, which can handle JavaScript and interact with 'read more' buttons to load additional content. This way, you can extract all the necessary information from the page. Let me know if you need more guidance!
@Chriiichriii3 жыл бұрын
Exactly what I was looking for, thanks ! great video
@MyOnlineTrainingHub3 жыл бұрын
Glad you found it helpful 😊
@calleranchero32123 жыл бұрын
The webpage I am trying to query frequently changes. When refreshing the table, is it possible to maintain the historical data while also pulling in the new information?
@MyOnlineTrainingHub3 жыл бұрын
You would have to use VBA to automate taking a copy of the data before refreshing the query.
@manuelsanchezmiranda14583 жыл бұрын
Is it possible to scrape the data within the listing page (visible in a separte URL once you click on each item), and not only the data listed in the search results?
@MyOnlineTrainingHub3 жыл бұрын
You’d have to enter the URL of the listing pages to do this.
@mohamedadjal85023 жыл бұрын
Hi, Professor, you have provide in a lot of effort for these videos, thank you, I have a question in excel, if we have for example in cell "a1" the number 10.00 m, how to have this number with the same format in cell "b1 "using a text function or some other function, thank you very much.😃👍
@MyOnlineTrainingHub3 жыл бұрын
Thanks, Mohamed! I'm presuming the value in A1 is a number, in which case you can use this formula: =TEXT(A1,"##.00,,\M")
@mohamedadjal85023 жыл бұрын
Good evening Professor, I thank you for the answers that you sent me on the Internet. May God protect you. Suppose we have in cell a1 = "excel", in cell a2 = "is", in cell a3 = "fun", in cell b1 = 12.00m, in cell b2 = 10.00gr, in cell b3 = 15.00kg, use the vlookup function: vlookup = ("is", $ a $ 1: $ b $ 3,2, false), the result is 10, which means that This function didn't give me the full format of the number in cell b2 (b2 = 10.00gr), but my goal is to get b2 = 10.00gr and not b2 = 10. thank you so much.😃👍
@MyOnlineTrainingHub3 жыл бұрын
Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum
@mohamedadjal85023 жыл бұрын
@@MyOnlineTrainingHub Good morning Professor, I have emailed you an excel file containing a question and comment on the question, thanks a lot for the help.
@wayneedmondson10654 жыл бұрын
Hi Mynda.. another great example and technique. Thanks for sharing it :)) Thumbs up!! PS - Any idea when the Add Table Using Examples feature will come to Power Query in Excel in Microsoft 365?
@MyOnlineTrainingHub4 жыл бұрын
Thanks, Wayne! No idea when Excel will get Add Table Using Examples :-( it has been available in Power BI for quite a while now, but that doesn't seem to mean anything.
@DLHSuper4 жыл бұрын
Hi Mynda, you’re videos are teaching me so much... is there a way to scrape a website that only works in google chrome or Firefox? Unfortunately the website I need to scrape doesn’t work in IE.
@MyOnlineTrainingHub4 жыл бұрын
Pleased to hear that! If the website doesn't work in IE then you should still be able to scrape it, but you might not be able to see the preview and use the 'from example' tool. Power Query is looking for HTML tags in the web page source code, so as long as your data is stored in these, Power Query can find it. If it's tables generated using JavaScript, then you can't easily get the data using Power Query unless you know how to write JavaScript!
@DLHSuper4 жыл бұрын
MyOnlineTrainingHub thank you for explaining this, I’ll have another go toddy 😊
@lindalai14063 жыл бұрын
Thank you very much for bringing this brilliant video. I do have a question, if I am not used to using Power BI and still want to use excel to extract web data like you do in this video, how do I do that?
@MyOnlineTrainingHub3 жыл бұрын
Hi Linda, Power Query in Excel doesn't have 'from web by example'. Your best option is to use 'From Web', but that will require the data in the web page to be stored in a HTML table. You'll know if it is, because you'll be able to see the table in the preview in Power Query.
@lindalai14063 жыл бұрын
@@MyOnlineTrainingHub Thank you very much for your prompt response.
@StephenMattison663 жыл бұрын
I need to scrape data from a map page that shows thousands of map-pins that each lead to the contact data that I need. Do you have a video already showing that? Any suggestions? TYVM!!
@MyOnlineTrainingHub3 жыл бұрын
No examples of that. Unless the map data is stored in a table in the web page HTML then you won't be able to scrape it with Excel. You could try Power BI to get data by example: www.myonlinetraininghub.com/power-query-get-data-from-web-by-example
@chrism90374 жыл бұрын
Super cool video, thanks Mynda
@MyOnlineTrainingHub4 жыл бұрын
Cheers, Chris!
@petermcallister9082 жыл бұрын
Great tutorial! Helped me a lot. But do you have any idea, why "Add Table Using Examples" won't work and throws this message: "This Stencil app is disabled for this browser"?
@MyOnlineTrainingHub2 жыл бұрын
Never heard of that before, Peter. It sounds like you're trying to use Power Query online because there's reference to a browser.
@jamesflieder81644 жыл бұрын
Great video and so clear with the explanation! My researching will be much easier now!
@MyOnlineTrainingHub4 жыл бұрын
So pleased it was helpful 😊
@wrandyrice54473 жыл бұрын
Mind blown. This is awesome. Thank you.
@MyOnlineTrainingHub3 жыл бұрын
Glad you liked it 😊
@PowerProd4 жыл бұрын
Hi ! Do you know if you can add a proxy list so your IP isn't blocked and rotates to avoid captcha ?
@MyOnlineTrainingHub4 жыл бұрын
Not that I know of.
@PowerProd4 жыл бұрын
Ok thanks ! Maybe you can retrieve data via python and include a proxy list in the process. Thoughts ?
@TheJoshtheboss4 жыл бұрын
I was having a few difficulties and got a syntax error when adding the variable into the function and got stuck. The website I was working with does not have the string "=1" but instead uses a suffix 2017, 2018 and so on at the end of the web address. You mention a list of textstring variables can be imported. What exactly do you mean? Sorry I am a little confused and new to this. Been watching your vids, they are awesome. I am surprised how much better PowerBI is at grabbing web data via Power Query than Excel Power Query. Excel cannot even recognise the data on the page while PowerBI no problem. Also, can the finished query be then saved and imported to Excel to work with the data? Sorry, too many questions, my mind is too stimulated from your video :)
@MyOnlineTrainingHub4 жыл бұрын
Hi Josh, what I meant by text string variables was if your URL had a variable for say, country, you could feed it a list of country names which are text strings. Yes, Power Query in Excel is a bit behind on functionality. New features come to Power BI first, then Excel later.
@TheJoshtheboss4 жыл бұрын
@@MyOnlineTrainingHub I keep getting syntax error for some reason. I thought good data exercise was to scrape past lotto numbers. The suffix is consistent e.g. 2005, 2006. But I can't get to work :(. I am completely new to PowerBI syntax and code. Btw, do you have a donation link? Your online support and presence is really good. www.euro-millions.com/results-history-2004
@MyOnlineTrainingHub4 жыл бұрын
Hi Josh, That page doesn't store the data in HTML tables, which is what Excel's Power Query requires in order to scrape the data, so while the URL has parameters, the page content isn't suitable. However, you can try using Power BI and the 'add table using examples' option to locate the data on the page. Thanks for the donation link request. We don't have one, but you can show your appreciation by sharing my channel on your social media.
@TheJoshtheboss4 жыл бұрын
@@MyOnlineTrainingHub No problem at all. Will gladly share on LinkedIn. I think I needed to have extra double quote to fix the syntax error i.e. source = Web.BrowserContents("www.euro-millions.com/results-history-"&PageStart&"")
@ayeshaibraheem42173 жыл бұрын
which office version are you using? i am using office 365 and it doesn't give the option of add by example. Also the data i want is not available in tables as well as in web view.
@MyOnlineTrainingHub3 жыл бұрын
In this video I'm using Power BI Desktop. The version of Power BI is not tied to your Office version. You may need to download the latest version of Power BI Desktop.
@gzfraud Жыл бұрын
I can't get PQ or BI to extract embedded URL in a webpage table. eg email is embedded in person's name? Any ideas?
@MyOnlineTrainingHub Жыл бұрын
Hard to say. Please post your question and sample Excel file on our forum where someone can help you further: www.myonlinetraininghub.com/excel-forum
@adamsteele442 жыл бұрын
Wow. Amazing video, thank you!
@MyOnlineTrainingHub2 жыл бұрын
Thanks so much, Adam!
@parvez3014 жыл бұрын
First comment. thanks for the video
@MyOnlineTrainingHub4 жыл бұрын
Winner :-) hope you found it useful, Anwar.
@rodrigomoro80473 жыл бұрын
Dear! Thank you so much for this video. Could you please share with us how can we do the following: I have a web based database that is constantly fed. Today it has 300 itens and 15 itens per page, so: 20 pages. But next week, this database may have 600 itens, and due that, 40 pages. How can I automate the function to identify the total number of the pages each time it acesses the web data source? Thank you!
@MyOnlineTrainingHub3 жыл бұрын
Good question, Rodrigo. I see you've also posted this question on our blog post. We'll answer it there as that will be more helpful to others.
@rodrigomoro80473 жыл бұрын
@@MyOnlineTrainingHub thank you so much!
@iwcik3 жыл бұрын
@@rodrigomoro8047 Hi Rodrigo, could you please share the link with the reply to your question?
@louielouie95023 жыл бұрын
I'm currently new to this stuff. I see that you might be able to customize queries for specific data scraping tasks. I'm interested in learning ethical data scraping techniques. How would it be possible to create custom scraping software? What computing language would you recommend learning in that case?
@MyOnlineTrainingHub3 жыл бұрын
I can't answer that question, Louie. I know nothing about creating custom software.
@louielouie95023 жыл бұрын
@@MyOnlineTrainingHub Thanks Anyway
@ДенисДементьев-т3о2 жыл бұрын
Great video! Extremely useful It works in my case, but only for first 19 sheets out of 89. Starting from 20th sheet i get a blank page without any data, however i can see pages from range 20 to 89 via browser. I would appreciate if you show how many pages could be exported in your exact example
@MyOnlineTrainingHub2 жыл бұрын
Sounds like the web site is throttling the feed so you can't get the data. Not much you can do about this, other than try splitting the task into multiple queries and run them one at a time.
@gzfraud Жыл бұрын
@@MyOnlineTrainingHub Goods News 1 ..... Solving the throttle problem. When PQ and BI won't work, I use Instant Data Scrapper. It's a free Chrome extension and works 95% of the time. It let's you set a time delay to go to next page. I usually start at 12 seconds then decrease the delay 1 second every 100 pages or so to about 4 or 5 seconds. Most I've ever done it scrapped more than 40,000 pages on a website. It scrapes only when the webpage is active. So if you navigate to a different webpage tab it pauses. To restart scrapping simply make that page active, ie displaying, and click Start Scrapping. To prevent pausing, simply drag the webpage to be stand alone before starting IDS. Goods News 2 ..... it does something that PQ and BI don't do. It extracts embedded URLs. Say email addresses are embedded in people's names. PQ and BI will import the names (as plain text) but I've never figured out how to get them to extract the embedded email address. IDS does extract the embedded URL. Bad News .... IDS doesn't connect to the website so you can "refresh" the query like you can with PQ and BI.
@stevewilson15442 жыл бұрын
Good afternoon, i followed your instructions however instead of producing the results from the subsequent URL pages it just mirrored the results from the first page. Any ideas? Thanks, Steve
@MyOnlineTrainingHub2 жыл бұрын
Hi Steve, sounds like the page number isn't being changed with each iteration. Hard to say more without seeing the file.
@stevewilson15442 жыл бұрын
@@MyOnlineTrainingHub thanks for you help. All rectified now.
@robertcameron42883 жыл бұрын
Hey I have a question If I data scrape from a site that is a paid subscription when the subscription ended does the data still save in your excel or will it say #ref
@MyOnlineTrainingHub3 жыл бұрын
Hi Robert, the data will stay in the file. Just be sure not to refresh the query because if it tries to get the data again it will presumably return an error and then the original data will be overwritten.
@robertcameron42883 жыл бұрын
@@MyOnlineTrainingHub awesome thank you!!
@nurezzati98882 жыл бұрын
Hi Mynda. Thank you for sharing it. Very useful. However, Is there any way to get the actual URL since the position keeps changing whenever I refresh data in Power BI.
@MyOnlineTrainingHub2 жыл бұрын
You can use the actual URL if it's more suitable for your scenario.
@Iamthatmike2 жыл бұрын
This is a great tutorial! I was able to connect to one table but my second attempt isn't working. The table's data isn't in a specific order so when I go from page 1, 2, 3 etc. it loads them randomly each time and because I'm not on the actual website it's treating each individual page as a completely new query. The order is random so an individual record might be randomly pulled for the page 1 query then again on page 3 leading to duplicates and records that never populate on the page. Any idea how to work around this issue? Thanks again!
@MyOnlineTrainingHub2 жыл бұрын
Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum
@Iamthatmike2 жыл бұрын
@@MyOnlineTrainingHub thank you so much! I posted the topic in the Power Query forum.
@bali5012 жыл бұрын
Thank you soooo much! You changed my life this weekend. Been struggling with Excel's limitations for years, and lost countless hours of my life sometimes without even accomplishing my goal. I only discovered the existence of Power Query last night with your video, and you blew my mind. A brilliantly well presented and comprehensive video on it too! It got me partway through my current problem, but now I'm stuck again if you can help? I've created Query1 to gets multiple tables from each webpage with 10 records each , and includes a record ID. But each record has a link to a details page for more info for that record. The record ID is used within the URL string to get those details. Can I create a single query that collects the list of records and uses the ID to also collect the details for each record all in one go? Also, with 30,000 records in total, it takes hours to refresh. However, as the historic records don't change, and have a historic date of filing, is there any way for future updates to only get and append the latest records (with a filing date after the last date of the previous dataset, whilst removing any duplicates, and append it to the list? Finally, it would be great if a timestamp could be added in an additional column to denote the date when that query was run, so that I can easily see which data has been added and when. Is any of this possible with PowerQuery?
@MyOnlineTrainingHub2 жыл бұрын
So pleased that my video was helpful! Please post your questions and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum
@miguelcalejo17704 жыл бұрын
Hi Mynda, I'm trying to scrape data from a web page which displays the 10 top items and there's a button "View more" to see more items (till 100) which I open, but powerquery only scrap the to 10 and displays "View More" as a line. Is there a solution for this? Thanks
@MyOnlineTrainingHub4 жыл бұрын
Hi Miguel, it sounds like the page is using JavaScript or similar to load more data and that data isn't stored in the web page HTML, which is why Power Query can't see it. I'm not sure there's a workaround for this, sorry.
@miguelcalejo17704 жыл бұрын
@@MyOnlineTrainingHub Thanks Mynda
@MrOktovan4 жыл бұрын
What a great tutorial.. I've tried your tutorial and it works! However, when I upload the app to the Power Bi service and I set the automatic refresh schedule. There is a failure notification for automatic refresh for dynamic data. did you also experience this?
@MyOnlineTrainingHub4 жыл бұрын
Currently scheduled refreshes for queries where the data source is part of a function aren't supported. You should be able to manually refresh though.
@abcVegeBreads Жыл бұрын
Is it possible to scrape the URL of each individual book? If yes, how can't it be done?
@sadinenim53602 жыл бұрын
Can do a video on how we can scrap the data from after login into portal with our credentials and then fetch the data
@prameelar17532 жыл бұрын
Hi, I ve followed your steps and tried downloading web data from multiple pages, however, I've got data only for 68 pages out of 83. I have no idea where I missed. Could you please help me in this?