Just came across this channel and am so surprised that you don't have at least 70k subscribers. Such quality content! keep at it.
@JohnWatsonRooney3 жыл бұрын
Thanks, will get there some day I hope!
@samman59803 жыл бұрын
I was literally just thinking about doing this. Are you some how web scraping my brain?!?
@JohnWatsonRooney3 жыл бұрын
Haha, that would be quite a skill to have!
@techamusement67143 жыл бұрын
It's damn truth
@pythonantole98923 жыл бұрын
Just blown away at how simple this is.
@celerystalk3903 жыл бұрын
Another great and super useful tutorial! Thank you and keep up the good work John.
@JohnWatsonRooney3 жыл бұрын
Thanks mate!
@loverboykimi Жыл бұрын
It was smooth. Thanks.
@c.caratti28483 жыл бұрын
Great intro tutorial to FastAPI with scraping! Thanks!
@techamusement67143 жыл бұрын
Thank you sir you just provided that what we wanted.Absolutely exquisite
@ricksanchez7077 Жыл бұрын
Thanks for sharing 😀
@s6yx3 жыл бұрын
dope. I was meaning to just use bs4 and flask for my old webscraping code but decided on fastapi because of this video
@hayat_soft_skills3 жыл бұрын
I love the content that you are sharing . your teaching method is awesome! kindly make more videos on hosting scraper on digital ocean like selenium script , scrapy script Thanks!
@zacky78623 жыл бұрын
Great tutorial. I'm gonna need this.
@chitranshjain59313 жыл бұрын
Thanks for sharing this ! Really like your stuff and it makes my life easier. 👍
@yusufrumi16262 жыл бұрын
Happy new year John.
@JohnWatsonRooney2 жыл бұрын
Happy new year!
@paulmiller5913 жыл бұрын
Cool stuff John
@JohnWatsonRooney3 жыл бұрын
Thanks Paul
@adityaraj12842 жыл бұрын
i have watched this video, its a fantastic video i got all i wanted except one that how to deploy it online so that i can use api for my android app projects. Please make a video on it as this is really gonna help me and i would be very thankfull to you.
@MuhammadAbdullah-fy6sg3 жыл бұрын
man you need to start streaming. Honestly there very few programming streamers on twitch.
@bighneswar982 жыл бұрын
Nicely explained mate. Unable to use fastapi along with scrapy. Throws internal error everytime and on command line it shows twisted reactor already running. Any fix?
@JohnWatsonRooney2 жыл бұрын
I’m that case I think you’d need them separate - use scrapy to save to a database then fastapi to display the information
@bighneswar982 жыл бұрын
@@JohnWatsonRooney So no way to run scrapy scrapers directly like request html ones using fastapi?
@thekarthik3 жыл бұрын
Great tutorial, thankyou. I was just thinking about doing this haha. Will this work with multiple endpoints too?
@JohnWatsonRooney3 жыл бұрын
Thanks! yes it would, we you just need to code them in
@DM-py7pj3 жыл бұрын
Have you done a video on setting up a virtual environment in python?
@waliullah3321 Жыл бұрын
please create an api for Turnitin (plagiarism checker) using fast api or flask or Django
@artof-war Жыл бұрын
How can you scrape more than one tag ?
@alfakih7247 Жыл бұрын
How to update negative reviews on Google
@hassanrahmani47643 жыл бұрын
I need to become expert in web scrapping what should i do😁
@-__--__aaaa3 жыл бұрын
maybe not to use bs4 to scrap everything & also u should try using regex btw
@hassanrahmani47643 жыл бұрын
@@-__--__aaaa hmmmm okay i ll see
@thekarthik3 жыл бұрын
@@hassanrahmani4764 honestly, have a look at this channel. It's all web scraping and he explains in an easy to understand manner
@mariabazueva72043 жыл бұрын
@@thekarthik do you know where he works now?
@karthikb.s.k.44863 жыл бұрын
Nice tutorial. What is the command prompt that you opened in windows OS for running fas api. Please let me know.
@hassanrahmani47643 жыл бұрын
Its is wsl sub system for linux u can get it from windows store on win 10😊
@karthikb.s.k.44863 жыл бұрын
@@hassanrahmani4764 Thank you 😊
@hassanrahmani47643 жыл бұрын
@@karthikb.s.k.4486 welcome am.thinking to make a web scrapoing and programming channel should i make it like will it grow? Freelancing related stuff and programmming
@karthikb.s.k.44863 жыл бұрын
@@hassanrahmani4764 sorry I am not sure on this
@valuetraveler2026 Жыл бұрын
What do you think about cacheing these FastAPI requests? I am using render() within the scraper function so it is slow. How would you handle dynamic data which changes frequently?
@JohnWatsonRooney Жыл бұрын
Since making this video I’ve found that separating it out is much better - having a separate script for scraping and adding to the database freeing up the api to serve the data itself. If you want to have a more scrape on request service you really need to think about using something like celery to manage the tasks and return the data when the scrape is complete
@paul_devos Жыл бұрын
I've been scraping websites for 10+ years as a hobby data scientist. bs4 & requests. Rare occasion if I have to, use Selenium. You have the most interesting content I've seen in some time on KZbin or the web in general. Using tools I hadn't even heard of. requests_html, selectolax, httpx, etc. I had used Scrapy before (found it too bloated for my needs). That said, regarding FastAPI + web scraping -- I hadn't ever really considered FastAPI as I mostly just make "packages" (e.g. Reddit API wrapper like praw) for any websites I scrape regularly. I'm guessing the added advantage of this is you can sell/expose the API instead of the package... otherwise, is there any inherent advantage to using FastAPI versus just creating the aforementioned package wrapper?
@JohnWatsonRooney Жыл бұрын
It's very use case dependent. Things like this can be useful integrating into applications, and having a simple API to access your data or run scraping jobs is quite useful but you are right, most are just standalone scripts/projects that run and have a specific task.
@forbidden_lion3 жыл бұрын
Been trying to upload this thing on Heroku for the past 3 hours, no solution! Can you please make a video on deploying FastAPI Scrapper on Heroku?
@kannadastocktrader3369 Жыл бұрын
Can I webscrap Amazon website for the product details and price??
@JohnWatsonRooney Жыл бұрын
yes, i have videos on my channel to show you how
@kannadastocktrader3369 Жыл бұрын
By creating api
@kannadastocktrader3369 Жыл бұрын
I have more than 1000 pages to be scrapped, it is taking 2 hours for me to scrape , but I wanna automate this and also reduce the time
@tpag202 жыл бұрын
I wish learn Python with FastAPI with Graphql with Strawberry.
@ahmd093 жыл бұрын
I just want to know how you got JSON formatted in the browser in such a nice way. I hate unformatted JSON data
@Klausi-uq4xq3 жыл бұрын
in firefox it is built in.. for chrome you have to install an extension like JSON Viewer
@JohnWatsonRooney3 жыл бұрын
I think I use “json formatter” chrome plugin
@ahmd093 жыл бұрын
@@JohnWatsonRooney thanks
@ahmd093 жыл бұрын
@@Klausi-uq4xq thank you
@willyle232 жыл бұрын
Hello John, I have a question. I was able to create everything like you have shown but I have used render() for a JS website. When I go to host it on fastAPI, it is telling me to use the AsyncHTMLsession instead. Have you ran into this problem?
@JohnWatsonRooney2 жыл бұрын
Hey! Did you put the render part inside an async function? If so change it to a regular sync function with just “def” and it should work
@willyle232 жыл бұрын
@@JohnWatsonRooney Thanks for your reply, No I did not. What I did was create a scraper class then created a function called load_rls which takes in all of the pages available on the given URL ( by looping through r.html and calling a helper function scape_other_pages to scrape data from all of the pages and add to a list.) . Both of the function uses HTMLsession class and the r.html.render(). This was able to run through an entire javascript page and return a response in the python terminal, however when I try to post it on FastAPI, it gets an error. I did what you suggested and I am getting a 404 not found error. Thank you for the help! edit: I found out why it was giving a 404 error, the parameter cat I takes in converts / into %2f,. I changed my function to just take in 1 parameter and when running it, I get runetimeError: There is no current event loop in thread "AnyIO worker thread"
@tpag202 жыл бұрын
awesome!
@DanishKhan-ob3xh3 жыл бұрын
how to remove watermarks using python fastapi
@theinstigatorr3 жыл бұрын
I have to confess I don’t really know what an API is still, particularly in the context you’re using it. I thought an API was a way for people hosting data to provide outsiders access to data without being able to access everything carte blanche. To me this just looks like you’re creating a scraper project. What am I missing?
@faldofajri67963 ай бұрын
MATURSUWUN SANGET MISTER
@randyallen8610 Жыл бұрын
how can i contact you
@ferilukmansyah30373 жыл бұрын
great tutor thanks john
@acikast2 жыл бұрын
Thank you John for sharing your knowledge!
@JohnWatsonRooney2 жыл бұрын
Sure thing!
@NotBeHaris3 жыл бұрын
I have question not related to this video sir how to scrape web which are build with vue or react means with javascript because there behavior is different from simple html sites.
@rameshks52813 жыл бұрын
Hi sir, can you create one video about import multiple URLs present in external files like excel and import those URLs individually and loop through and scrape and append the scrape data to the same file .