AI and Web Scraping: The Unstoppable Duo
21:12
How to Bypass IP Bans
5:04
Ай бұрын
How to Hide Your IP Address
6:59
3 ай бұрын
Scraper APIs Playground: Google
8:03
What Are Transparent Proxies?
4:33
Getting started with ISP Proxies
1:29
Scraper APIs Playground: Amazon
7:49
Пікірлер
@dand4485
@dand4485 21 минут бұрын
I've plenty of experience with both. Until BiDi comes in and is supported Playwright is hands down much better if you want reliable tests. Also depending on the app being tested and more often the JS front end library used Angular, React, others, while i had code for Selenium to worth with shadow roots, selenium couldn't reliably work with the shadow dom nodes, same test would fail 40-60% of the time. Playwright on the other hand, automating the same exact pages 100% success 100% of the time. Not to mention a number of things PW has done to help reliably get and work with screen elements Selenium, well you could get kind of the same functionality but will require someone rather knowledge a fair amount of work to get things to get Selenium to have the same functionality that is simply baked in and supported by PW out of the box. I've been away from Selenium for about 3 years now, i doubt it has support to intercept REST/Http requests? PW has this out of the box too...
@InTheZone-p4d
@InTheZone-p4d Күн бұрын
What if we wanted to collect the description of each book?
@patrickfogui3620
@patrickfogui3620 Күн бұрын
girl u're so cute 😍
@traderbully
@traderbully 2 күн бұрын
But does it work? Also related queries is an option?
@indimerz
@indimerz 6 күн бұрын
will this bypass captcha on websites like binance, coinbase? if we try to login with multiple emails with puppeteer? for example i need to check login on binance with 100 emails? will binace block the request?
@oxylabs
@oxylabs 5 күн бұрын
We’d like to clarify that we do not support scraping under login. Such activities often violate the terms of service of the websites in question. Therefore, we cannot assist with this specific use case.
@indimerz
@indimerz 5 күн бұрын
@oxylabs i just want to know for learning purposes, like will this bypass the login captchas too
@flexocase440
@flexocase440 15 күн бұрын
How can I check my ip port it doesn't show up on terminal
@oxylabs
@oxylabs 6 күн бұрын
Hi! To check your IP and port, you may need to ensure the proxy settings are correctly configured. If you're still having trouble, please contact our support team via live chat or email at [email protected]. Our professional team is always available for assistance!
@Reddit.Storie.s
@Reddit.Storie.s 17 күн бұрын
what about if it is a numbered pagination but it doesn’t change the url depending on the page
@oxylabs
@oxylabs 14 күн бұрын
Hi, thanks for your question! For scenarios where pagination doesn’t alter the URL, you may need to extract content dynamically by interacting with the page’s HTML elements or scripts.
@augustdelacreme
@augustdelacreme 17 күн бұрын
Can you use Socks5 and HTTP proxies together?
@oxylabs
@oxylabs 14 күн бұрын
Hi, yes, it's possible :) But the specific setup depends on your tools or software. Oxylabs provides SOCKS5 and HTTPS proxies, but chaining them requires a proxy client or script that supports this configuration, like Proxifier, proxychains, or custom programming using libraries like requests with PySocks.
@augustdelacreme
@augustdelacreme 6 күн бұрын
@@oxylabsand what about the mobile proxies, is that a socks5, will it work on apps?
@oxylabs
@oxylabs 5 күн бұрын
Our Mobile Proxies support HTTP, HTTPS, and SOCKS5 protocols. You can learn more about using them here: developers.oxylabs.io/proxies/mobile-proxies/protocols
@augustdelacreme
@augustdelacreme 5 күн бұрын
@ thank you
@robtaylor4464
@robtaylor4464 19 күн бұрын
this shit is not working guys dont buy this shitty service is an scam proxy never work
@oxylabs
@oxylabs 14 күн бұрын
Hi, we’re sorry to hear you had a frustrating experience. This is definitely not the level of service we aim to provide. Could you please share more details at [email protected]?
@anonydass2.o817
@anonydass2.o817 19 күн бұрын
Ai name. ?
@_pavelvasilik
@_pavelvasilik 22 күн бұрын
wtf is she talking about??? i just want to know what to do jesus
@oxylabs
@oxylabs 14 күн бұрын
Hi, we appreciate your feedback and understand your frustration. To help you find the information you need more quickly, we recommend checking out our quick start guide oxylabs.io/blog/residential-proxies-quick-start-guide. You could also look into documentation that we have on our page, here's a link to documentation on Residential Proxies - developers.oxylabs.io/proxies/residential-proxies. Please let us know if you have any specific questions!
@apsaraG-k7r
@apsaraG-k7r 23 күн бұрын
When we click see more reviews, it is prompting for login.How to handle this
@ns_the_one
@ns_the_one 29 күн бұрын
Hello Martin, I am having some problems when i am trying to generate for digikey product pages. I managed to generate at first but then when i edited my prompt to explain better i started to get error "cant generate for these link (although i did not change the links and it run succesfuly once)" I sent you a request in linkedin for further details
@kritikusi-666
@kritikusi-666 Ай бұрын
Does it only work with puppeteer? How about selenium or playwright?
@oxylabs
@oxylabs 22 күн бұрын
Hi, thanks for your question! CAPTCHA bypass isn't exclusive to Puppeteer. You can use similar methods with Selenium or Playwright, as they're all browser automation tools. The key is integrating CAPTCHA-solving services or leveraging human-like interactions. Let us know if you need some more specific guidance :)
@GarenaGamer-v2b
@GarenaGamer-v2b Ай бұрын
How To Set Up and Use a SOCKS5 Proxy | Proxy Setup Tutorial
@tigreonice2339
@tigreonice2339 Ай бұрын
Beautiul AI woman
Ай бұрын
Can you make a tutorial on Websoket Like as WhatsApp use Websoket So target is send message and recive messages using web session ❤
@MuhammadOvais
@MuhammadOvais Ай бұрын
Can you show how to download a video from any platform using web scraping
@sneakymove
@sneakymove Ай бұрын
you should show more visuals of the software, looking at your face for 7 minutes talking is not engaging
@oxylabs
@oxylabs 14 күн бұрын
Hi, thank you for your suggestion! We’re always looking for ways to make our content more helpful and engaging. We’ll take this feedback into account for future videos :)
@alisiddiquii
@alisiddiquii Ай бұрын
She looks like an AI bot 😂
@seakayaker20
@seakayaker20 Ай бұрын
Great tutorial. Is 'get_product_info' missing from the parse_listing function? Thanks.
@oxylabs
@oxylabs 22 күн бұрын
Hi, it is not - but you can view the full code that we showed in the video here: oxylabs.io/blog/how-to-scrape-amazon-product-data
@tarah-leekaplan622
@tarah-leekaplan622 Ай бұрын
How Can I find out bout ip bypass. And bypass the ban
@oxylabs
@oxylabs 22 күн бұрын
Hi, to check if your IP is banned, try accessing the site from a different IP using a mobile network. If it works, your IP may be blocked. Alternatively, you can check error messages or contact the site’s support team for clarification. If your ban is confirmed, you can use techniques like switching to a new IP via proxy or other tips we showed in the video. Most importantly, make sure you're not violating the site's terms of service to not get banned again :)
@suvetar
@suvetar Ай бұрын
Cite?
@blueyZee
@blueyZee Ай бұрын
No way to know view username in the dashboard. Bad designed website
@oxylabs
@oxylabs 14 күн бұрын
Hi, thank you for your comment! You can find your credentials under your products, in the tab "users", however we're aware of this design flaw and we're looking to improve it soon. Thank you for your feedback :)
@RamandeepKaur-qt4np
@RamandeepKaur-qt4np Ай бұрын
Hi, I used these instructions to get the body of the product review. I get a successful response code of 200. But when I print the scraped data, it's empty. Any insight into how to fix it? Thanks
@mlsauron
@mlsauron Ай бұрын
Kol kas nepavyko nuimt 403. Bet judėsiu
@G3msFinder
@G3msFinder Ай бұрын
Thank you very much, my question is that Most of these settings are for windows guys, How about Linux user ?
@oxylabs
@oxylabs Ай бұрын
Thanks for watching! We hope you enjoyed this video 💙 Find more content like this here: oxy.yt/4jet
@vservicesvservices7095
@vservicesvservices7095 Ай бұрын
Will be handy if the script is included for download.
@oxylabs
@oxylabs Ай бұрын
Hello, noted. Here you go: github.com/oxylabs/scrape-google-python
@charisadu9544
@charisadu9544 Ай бұрын
Millie Bobie Brown now eplaining golang to me. Enough youtube for today
@guyteigh3375
@guyteigh3375 Ай бұрын
With a project to be able to scrape *most* pages it is given (a 30% or so failure rate here is perfectly tolerable) , how much code is required to make it populate fields like Title, Description, H1, H2, Category, first 4K of page text - and so on? Happy to use Python, Java or other language - and do not mind which headless browser to use - but the priority is speed (bandwidth to internet will not be an issue even though we need to be able to make this work on multiple cores at once). I realise this is perhaps not a simple question, but I am just wondering how difficult it is to create a script that will make it scrape well over half the sites it is asked to do, with a priority on speed. An hour or so work, an afternoons work, a weeks work? Most of the tutorials I have seen from others, explain how you can tune the system to search specific sites - which is awesome if you (for example) want to scrape a huge site with a consistent page format. I am after a guide that let's me provide a file of (say) 1000 pages - and it will "do its best" to scrape each one, regardless of layout - and populate fields like TITLE, DESCRIPTION, H1 Content, H2 Content, CATEGORY and so on - very much like a little search engine might want to do. If you know of any tutorials that might be worth a look, I would appreciate a link please. Google for once has not been massively helpful! Many thanks.
@Witty_traveler
@Witty_traveler Ай бұрын
It is not working now even after passing the header. Could you please suggest how to get the html code in the text format ?
@buhaha_bharathSurya
@buhaha_bharathSurya Ай бұрын
Is she a robot
@nordeenhasan8630
@nordeenhasan8630 Ай бұрын
me too IDK
@Aidas_Li
@Aidas_Li 2 ай бұрын
Why puppeteer and not playwright for example?
@oxylabs
@oxylabs 22 күн бұрын
Hi, great question :) Puppeteer is often chosen for its simplicity and tight integration with Chromium. Playwright, however, supports multiple browsers and advanced features like browser contexts. Your choice depends on project needs - Puppeteer is simpler, Playwright is more versatile. Both can handle fingerprinting with proper configuration.
@andreasfranzmauskrell1126
@andreasfranzmauskrell1126 2 ай бұрын
socks5 ^^ SCHWALLE
@juniorrocha8833
@juniorrocha8833 2 ай бұрын
muito bom o vídeo. Eu vi daqui do brasil
@AI_DS_Adventure.
@AI_DS_Adventure. 2 ай бұрын
She looking same as robot
@Aakash-e1x4t
@Aakash-e1x4t 2 ай бұрын
Same on my mind 😅
@oxylabs
@oxylabs 2 ай бұрын
Thanks for watching! We hope you enjoyed this video 💙 Find more content like this here: oxy.yt/Ljo6
@YaserFarid
@YaserFarid 2 ай бұрын
Twice as fast? I think GoLang is at least 10x faster than Python, Python's speed SUCKS.
@DanielAnagnos
@DanielAnagnos 2 ай бұрын
what about newbies..This doesnt help at all...damn it
@oxylabs
@oxylabs 14 күн бұрын
Hi, we understand starting with proxies can be challenging. For step-by-step instructions, check out our blog and documentation pages - this quick start guide should help you out oxylabs.io/blog/residential-proxies-quick-start-guide. Additionally, maybe you have any suggestions how we could make our videos more suitable for newbies? If there are still problems with starting out with our products, our 24/7 support team is here to assist you - [email protected]
@DevalPatel-zs2sp
@DevalPatel-zs2sp 2 ай бұрын
really really LOVE you My friend, Thank you for sharing the information :)
@DevalPatel-zs2sp
@DevalPatel-zs2sp 2 ай бұрын
in case if anyone is still suffering, i would like to tell you that from inspect > network > right click on the webpage request status> header > copy everything from headers> send it to chatgpt and tell them to create headerds of most of them and donot change the context... (chatgpt might change the safari version or something but tell to have the exact same context) i hpoe this work for you, please ignore spelling mistakes :)
@fizzy7984
@fizzy7984 2 ай бұрын
Yeah, your proxies fucking suck and they don’t work piece of shi ass company. I’m on Mac and have tried all the port #’s on different proxy types. I want my money back you spineless shitless fuck ass company
@sheepscorpion
@sheepscorpion 2 ай бұрын
Impressive, could you elabrate what plugin or tools used in your clip to aim at or locate the element of the page?
@GaleSimons-m2j
@GaleSimons-m2j 2 ай бұрын
Yasmin Causeway
@asmoreira
@asmoreira 2 ай бұрын
where are the full script in text mode ?
@oxylabs
@oxylabs 2 ай бұрын
Hello, find it on our GitHub: github.com/oxylabs/how-to-scrape-amazon-product-data
@muhammaddava5164
@muhammaddava5164 2 ай бұрын
still no bus😢
@ElanazŞener-d4z
@ElanazŞener-d4z 2 ай бұрын
Nicolas Mews
@jeetendrasamariya7798
@jeetendrasamariya7798 2 ай бұрын
Ai model shit😂😂