Is Your Scraper Slow? Try THIS Simple Method

  Рет қаралды 5,335

John Watson Rooney

John Watson Rooney

Күн бұрын

Пікірлер: 20
@hicham_6544-_-
@hicham_6544-_- 5 ай бұрын
I have been following you for about 3 years, thanks for all the information, we wish a video on how to spend your day 😊
@muhammedjaved4322
@muhammedjaved4322 5 ай бұрын
You really deserve 10000000000 subscriber
@Mr.AIFella
@Mr.AIFella 5 ай бұрын
Q: I'm a beginner in the web scraping world as I entered it last February and I was trying to scrap a text from a single infinite scrolling page that needs two operations: one is clicking show more hyperlink (to show more hidden text for the reader). Two is scrolling the page down to show more content. Do you have a video reference that you suggest me to watch for this purpose. Appreciate it in advance, Best
@Extrey
@Extrey 5 ай бұрын
Amazing!!! I have sync playwright parser, and it's little bit messy and big, so rewriting it to async will be week of time at least, and tthreading will be powerfull solution. Thanks for your videos, insainly usefull, as ussual!
@irakli1264
@irakli1264 5 ай бұрын
Thank you John. I am considering scraping as a career path, but unsure about it. Would be nice to hear your opinions about web scraping career.
@exqvision
@exqvision Ай бұрын
This is great. Do you have a link to the code for review?
@mintydevdaz
@mintydevdaz 5 ай бұрын
have you considered using the niquests library in place of requests?
@JohnWatsonRooney
@JohnWatsonRooney 5 ай бұрын
No I haven’t heard of it - looking at it now will definitely give it a go thanks!
@razapoetra9355
@razapoetra9355 5 ай бұрын
I always get a new library every time I am going to comment section. 😂
@itzcallmepro4963
@itzcallmepro4963 5 ай бұрын
also what's the difference between using async_playwright api with asyncio and using sync_api with threading , when i used sync_api with threading , it really did open 4 browsers , but only 1 was scraping while others did nothing until the first one is finished .
@itzcallmepro4963
@itzcallmepro4963 5 ай бұрын
i still don't undetsand the difference between it and using asyncio with httpx for example , both almost works the same way i think , it waits for a thread to sleep and then runs another one , also in async when you wait for a request another process runs ,
@MahmudNuman
@MahmudNuman 5 ай бұрын
loved it 🥰
@patrickavis5475
@patrickavis5475 5 ай бұрын
I'm being slow...can someone explain the use of the proxy service/server here?
@berkay.digital
@berkay.digital 5 ай бұрын
Each request is sent through a different IP address to make it appear as if it's coming from different users. This prevents the website from blocking you.
@bakasenpaidesu
@bakasenpaidesu 5 ай бұрын
;)
@alexdin1565
@alexdin1565 5 ай бұрын
please can you make a video on how we can calculate how much 1k requests can costs I go to the nodemaven website and they say 5 GB for $35 I'm planing to sell scraping service but if 1k costs $35 its very expensive
@berkay.digital
@berkay.digital 5 ай бұрын
When you use proxies, the amount of data transferred varies depending on whether you're rendering a website or requesting data from an API. If you're rendering a website, it generally takes around 2MB per full load. However, if you're requesting data from an API, the amount of data transferred is much less. It's hard to determine the cost of the job as it depends on the specific needs of the task.
@pypypy4228
@pypypy4228 5 ай бұрын
My third like 😊
@JohnWatsonRooney
@JohnWatsonRooney 5 ай бұрын
Thank you!
This is How I Scrape 99% of Sites
18:27
John Watson Rooney
Рет қаралды 80 М.
Learning Scraping is MUCH harder now.
10:55
John Watson Rooney
Рет қаралды 6 М.
БЕЛКА СЬЕЛА КОТЕНКА?#cat
00:13
Лайки Like
Рет қаралды 2,2 МЛН
How Strong is Tin Foil? 💪
00:26
Preston
Рет қаралды 122 МЛН
АЗАРТНИК 4 |СЕЗОН 3 Серия
30:50
Inter Production
Рет қаралды 1 МЛН
Офицер, я всё объясню
01:00
История одного вокалиста
Рет қаралды 3,3 МЛН
This script I threw together saves me hours.
13:38
John Watson Rooney
Рет қаралды 19 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 993 М.
still the best way to scrape data.
41:01
John Watson Rooney
Рет қаралды 16 М.
Website to Dataset in an instant
13:15
John Watson Rooney
Рет қаралды 7 М.
15 Python Libraries You Should Know About
14:54
ArjanCodes
Рет қаралды 391 М.
The Biggest Issues I've Faced Web Scraping (and how to fix them)
15:03
Cleaning up 1000 Scraped Products with Polars
15:30
John Watson Rooney
Рет қаралды 5 М.
The Home Server I've Been Wanting
18:14
Hardware Haven
Рет қаралды 139 М.
The Easiest Way to Avoid Being Blocked When Web Scraping
8:19
John Watson Rooney
Рет қаралды 3,1 М.
БЕЛКА СЬЕЛА КОТЕНКА?#cat
00:13
Лайки Like
Рет қаралды 2,2 МЛН