How to scrape website links with Power Query

  Рет қаралды 15,071

Curbal

Curbal

Күн бұрын

Пікірлер: 60
@GabeTek
@GabeTek 2 жыл бұрын
I saw the post from Chris Webb but I didn't get it right away. You just made it so clear! Thank you!
@CurbalEN
@CurbalEN 2 жыл бұрын
Fab!!
@richweinberg6355
@richweinberg6355 4 жыл бұрын
This worked great. Thanks. At first this didn't work for me, but it was because the links I wanted were only relative links, meaning they didn't contain the website address or . I modified the text from: {"Link", "a[href^=""http""]", each [Attributes][href]} to {"Link", "a[href]", each [Attributes][href]}
@peterkrkos6489
@peterkrkos6489 4 жыл бұрын
Many thanks, this helped me
@Chriiichriii
@Chriiichriii 3 жыл бұрын
Same issue, thanks you bro !
@EricaDyson
@EricaDyson 5 жыл бұрын
Well a big thank you to both of you! Never would have discovered this without you!
@CurbalEN
@CurbalEN 5 жыл бұрын
I was lucky I saw his code :) /Ruth
@paspuggie48
@paspuggie48 4 жыл бұрын
I'm sure I commented but I can't see it. Anyway this is brilliant Ruth and had read Chris Webbs post before but your videos makes it easier. I managed to extract 140 URLs from one webpage...Cool!
@CurbalEN
@CurbalEN 4 жыл бұрын
It definitely is! 🥳🥳
@paspuggie48
@paspuggie48 4 жыл бұрын
@@CurbalEN I had to use Rich Weinberg's edit version but worked a charm. Does not work on some webpages I've also tried though. Still, it has helped wonderfully Ruth :)
@CurbalEN
@CurbalEN 4 жыл бұрын
Oh, maybe chris webbs trick doesnt work on all sites. Glad there is a fix out there :)
@paspuggie48
@paspuggie48 4 жыл бұрын
@@CurbalEN Well, ironically the webpage I tried was Power Query's own Function Reference page, even with Rich's solution I couldn't get it to extract the url's LOL
@CurbalEN
@CurbalEN 4 жыл бұрын
😂😂 Check the html code, it is there somewhere, or see if it follows a structure you can replicate manually.
@diegolozano2397
@diegolozano2397 3 ай бұрын
OMG, Thanks,, Thanks.....sooooooo usefull video
@nikakalichava8012
@nikakalichava8012 Жыл бұрын
very useful, thanks,
@ayobamiologun9184
@ayobamiologun9184 5 жыл бұрын
Cool trick... Thank you Ruth.
@CurbalEN
@CurbalEN 5 жыл бұрын
:) /Ruth
@ezequielellena
@ezequielellena 4 жыл бұрын
Hello, what KPI or analysis could be done with that table. I have the same case and I don't know what to analyze.
@chamelion100
@chamelion100 5 жыл бұрын
You are a great help mam............thanks a ton
@CurbalEN
@CurbalEN 5 жыл бұрын
My pleasure , happy it is useful :) /Ruth
@AnalyzeIt_Nael
@AnalyzeIt_Nael 5 жыл бұрын
This is an awesome video. Thank Ruth
@CurbalEN
@CurbalEN 5 жыл бұрын
Thanks! /Ruth
@miguelcalejo1770
@miguelcalejo1770 4 жыл бұрын
I'm trying to scrape data from a web page which displays the 10 top items and there's a button "View more" to see more items (till 100) which I open, but powerquery only scrap the top 10 and displays "View More" as a line. Is there a solution for this? Thanks
@bpdeepakbp
@bpdeepakbp 5 жыл бұрын
Thank you for this video. this is really good one
@CurbalEN
@CurbalEN 5 жыл бұрын
Yey! Happy it was helpful:) /Ruth
@oshiowueme254
@oshiowueme254 2 жыл бұрын
hi, if the column1 could not extract the exact text, how could I do, if I split the column1 after fisrt step?
@MoviFrenzy
@MoviFrenzy 3 жыл бұрын
Hi Ya plz help me the web page scraping which has credential page
@usmanmirza8966
@usmanmirza8966 Жыл бұрын
I am trying to do the same but the navigator doesn't load the entire page as there is a script arrow at the bottom of the page hence showing only first 40 rows. Please advise.
@iyasabdulrahman
@iyasabdulrahman 5 жыл бұрын
Breliant, thanks Ruth, 👍👍👍
@CurbalEN
@CurbalEN 5 жыл бұрын
🎉🎉🎉 /Ruth
@neerajahirwar8600
@neerajahirwar8600 2 жыл бұрын
Can i get the link of all PDF using same steps
@alreadydeadfunk
@alreadydeadfunk 5 жыл бұрын
When I saw the title of the video (before watching it), I thought about Chris' blog, too.
@CurbalEN
@CurbalEN 5 жыл бұрын
He is the best :) /Ruth
@TheVamos777
@TheVamos777 5 жыл бұрын
Same here
@Softwaretrain
@Softwaretrain 5 жыл бұрын
You are amazing...
@CurbalEN
@CurbalEN 5 жыл бұрын
:) /Ruth
@jerrykappa6899
@jerrykappa6899 5 жыл бұрын
Awesome 😎
@CurbalEN
@CurbalEN 5 жыл бұрын
He is :) /Ruth
@TheWandererRises
@TheWandererRises 2 жыл бұрын
Excelente!! Graciaaas!!
@CurbalEN
@CurbalEN 2 жыл бұрын
💃💃💃
@walidkhlil2872
@walidkhlil2872 5 жыл бұрын
thanks again and again
@CurbalEN
@CurbalEN 5 жыл бұрын
:) /Ruth
@CeliaAlvesSolveExcel
@CeliaAlvesSolveExcel 5 жыл бұрын
Beautiful!!
@CurbalEN
@CurbalEN 5 жыл бұрын
Simplicity is always best :) /Ruth
@cokemaya
@cokemaya 5 жыл бұрын
Awesome tips as usual! I was thinking about going a level deeper in the hierarchy and scrapping those URL's, for instance, creating a table from a webside "www.example.com/Address/number" and then create a function that calls the column from the previous table which contains the hyperlinks with different "Addresses" & "numbers". Obviously they have to share the same structure to allow me to put out the same data. " www.example.com/"&Column_with_URL's&" Well I do not known if that make sense, but it would be great to learn that. Congrats for you channel!
@CurbalEN
@CurbalEN 5 жыл бұрын
Yes, like this? m.kzbin.info/www/bejne/bIG9hI1raM6GpdU /Ruth
@wimvanderlinden3080
@wimvanderlinden3080 5 жыл бұрын
Where can i load these pbix files? Very nice film!!
@CurbalEN
@CurbalEN 5 жыл бұрын
If you go to curbal.com > Resources > download center and then Community downloads, you can get the filés from there. /Ruth
@wimvanderlinden3080
@wimvanderlinden3080 5 жыл бұрын
@@CurbalEN no 140?
@CurbalEN
@CurbalEN 5 жыл бұрын
Exactly!
@sheilahart5989
@sheilahart5989 5 жыл бұрын
Very cool
@CurbalEN
@CurbalEN 5 жыл бұрын
❤️ /Ruth
@CPatwal
@CPatwal 5 жыл бұрын
Thanks Ruth, will the website owner know that his/her/their website was scraped ? I mean I would want to just avoid getting into trouble :-)
@CurbalEN
@CurbalEN 5 жыл бұрын
It is my website :) /Ruth
@bartdepauw1156
@bartdepauw1156 4 жыл бұрын
No, that can't be seen, as the links are obtained from the (already downloaded) page. If you access each individual link, that can be seen in the web server log.
@CurbalEN
@CurbalEN 4 жыл бұрын
Better answer, thanks!
@kaustubhk8648
@kaustubhk8648 4 жыл бұрын
subtitles destroying video
@elchin9748
@elchin9748 3 жыл бұрын
Nothing special. The most important is how to get data from each scraped link (and you can do it with custom function)
Differences between Power BI Apps and Workspaces
9:28
Curbal
Рет қаралды 27 М.
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН
黑天使只对C罗有感觉#short #angel #clown
00:39
Super Beauty team
Рет қаралды 36 МЛН
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 158 МЛН
How to Use List Generate to Make API calls in Power Query
18:27
BI Gorilla
Рет қаралды 43 М.
Search for key words with Power Query
18:04
Access Analytic
Рет қаралды 31 М.
How To Iterate Web Results Using Power Query M
20:01
It's Not About The Cell
Рет қаралды 7 М.
Web Scraping with Python - Start HERE
20:58
John Watson Rooney
Рет қаралды 39 М.
How to import data from multiple URLs in Power BI
9:40
DataTaleAU
Рет қаралды 8 М.
Easy Looping in Power BI/Power Query
15:46
BI Elite
Рет қаралды 104 М.
How To Scrape Any Website
9:36
Karston Fox
Рет қаралды 100 М.
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН