How To Scrape Reddit & Automatically Label Data For NLP Projects | Reddit API Tutorial

  Рет қаралды 25,669

Patrick Loeber

Patrick Loeber

Күн бұрын

Пікірлер: 45
@bthapa94
@bthapa94 3 жыл бұрын
Great video and well explained! How do you scrape ALL the posts for a certain time period? I am looking a small subreddit and require a lot of data.
@dabunnisher29
@dabunnisher29 3 жыл бұрын
You are certainly one of my favorite Python Masters. I really needed to learn how to do this for stocks. Thank you sooooooo much! You are AWESOME!!!!!
@patloeber
@patloeber 3 жыл бұрын
Glad you like it :)
@dabunnisher29
@dabunnisher29 3 жыл бұрын
I looked all around today and I couldn't find how to search a subreddit by a key word like "PLTR", get the results and use the NLTK library. If anyone can help, I would appreciate it.
@dodgewagen
@dodgewagen 3 жыл бұрын
Thanks! Definitely, do more of these API consumption/analysis videos.
@patloeber
@patloeber 3 жыл бұрын
Ok :)
@carfromcars3679
@carfromcars3679 Жыл бұрын
wowwww easiest tutorial to follow by FAR. thank you!!!!
@NoIntroTutorials
@NoIntroTutorials 2 жыл бұрын
MAGNIFICENT! I just needed the first part, getting the post titles!, thank you man!
@moy92
@moy92 3 жыл бұрын
Thanks for doing this! I have been wanting to scrape reddit for a while as exploratory analysis
@patloeber
@patloeber 3 жыл бұрын
hope you like it :)
@paulsastre9833
@paulsastre9833 Жыл бұрын
thank you for this wonderful video. but how did you get the url used in the beginning
@Asianyoungman22
@Asianyoungman22 5 ай бұрын
thank you very much, you save my life, my dissertation for my master degree.
@catalina5382
@catalina5382 3 жыл бұрын
This is exactly what I wanted. I would like to know what modifications do I have to make in order to get the headlines with the flair as well
@chasengonzales85
@chasengonzales85 Жыл бұрын
This is really awsome thank you for taking the time to put this together.
@anny23108
@anny23108 3 жыл бұрын
Could you do a tutorial for mining historical data as well? thank you
@fernandosantos3576
@fernandosantos3576 3 жыл бұрын
Yes, I woul love if you publish a video on a complete project. Thank you.
@patloeber
@patloeber 3 жыл бұрын
Ok 👌🏻
@ElectroCoderEC
@ElectroCoderEC Жыл бұрын
woooow amazing. You save my life. very useful. Thanks a lot! :)
@mealone007
@mealone007 3 жыл бұрын
Great video! Quick question, how to scrape the historical headlines with date stamp?
@bitsinbytes9002
@bitsinbytes9002 3 жыл бұрын
The UTC attribute will give you the Unix Timestamp, then you just have to convert it. Getting historical headlines may be a little trickier, as the PRAW API allows you to iterate through the following "submission" types: controversial, gilded, hot, new, rising, top.
@miaoinperth680
@miaoinperth680 2 жыл бұрын
Thanks so much for your video. Will you share the codes in github or somewhere?
@JackFrost1206
@JackFrost1206 3 жыл бұрын
Maybe you can scrape the subreddit wallstreetbets :D
@patloeber
@patloeber 3 жыл бұрын
Good idea
@nathanielpetruska1305
@nathanielpetruska1305 3 жыл бұрын
@@patloeber Scrape to see the stocks that are rising in popularity! haha
@tazrinkhan1297
@tazrinkhan1297 3 жыл бұрын
Thank you for this video. This is really helpful. I am trying to get data for a particular time period (March 2020- November 2020). Can you please tell me how to write the code for this?
@varinderjitkaur3656
@varinderjitkaur3656 2 жыл бұрын
great video, i am trying to get the historical daily number of members on a subreddit. Is it possible using praw?
@prod.kashkari3075
@prod.kashkari3075 3 жыл бұрын
Wow push and praw!
@wasgeht2409
@wasgeht2409 3 жыл бұрын
Hey, danke für das Video :) Habe unten lesen können, dass du aus Deutschland bist. Ich hätte da mal eine Frage und zwar ist es auch möglich über LDA kommende Textnachrichten in Themengebiete zuzuordnen ?
@basemgoueli
@basemgoueli 3 жыл бұрын
I have a project I could use the help of someone of your caliber with. I want to determine the five stocks mentioned most frequently on Reddit's WallStreetBets page on a given day. from January 2022-August 2022 (I have the CSV file for this). After that I want to take the five most commonly mentioned stocks based on number of days in the top 5 from the aforementioned analysis. I would like to plot the number of mentions of the given stock per day against its stock price for the designated time frame. Any help you can offer would be greatly apprecaited.
@prajjwalsinha1187
@prajjwalsinha1187 Жыл бұрын
How do I scrape comments from reddit posts?
@Probly
@Probly 2 жыл бұрын
Do you know how to scrape in a specified time period so I can compare sentiment towards a stock within r/wallstreetbets or r/investments against the historical stock price of the same period
@gsom2000
@gsom2000 3 жыл бұрын
great tutorial! Thanks a lot! is there any opportunity to do the same with twiiter data?
@patloeber
@patloeber 3 жыл бұрын
I already have 2 tutorials using the twitter API (tensorflow NLP and flask Twitter bot). Maybe you can apply the knowledge from these videos here
@gsom2000
@gsom2000 3 жыл бұрын
@@patloeber nice! apparently i just missed them! Danke!
@blancaherrerosdetejada7160
@blancaherrerosdetejada7160 Жыл бұрын
Is it a way to automatically scrape any new posts in a subreddit? (without having to re-run program)
@limjuroy7078
@limjuroy7078 3 жыл бұрын
Why the user_agent is not "Example"?
@selcukturk3550
@selcukturk3550 Жыл бұрын
how can i get this code?
@samarendrapradhan5067
@samarendrapradhan5067 2 жыл бұрын
I"m using python 3.9,so older vesion may differ for my below comment.Thanks
@fintech1378
@fintech1378 Жыл бұрын
why is it always 401?
@gardnmi
@gardnmi 3 жыл бұрын
Just went to that politics subreddit. It's laughably bias. Thanks for the tutorial.
@patloeber
@patloeber 3 жыл бұрын
Haha yeah
@knowledgeshack5040
@knowledgeshack5040 3 жыл бұрын
First!
@JackFrost1206
@JackFrost1206 3 жыл бұрын
Are you german?
@patloeber
@patloeber 3 жыл бұрын
Yes I am
@samarendrapradhan5067
@samarendrapradhan5067 2 жыл бұрын
Please import followings import matplotlib.pyplot as plt import seaborn as sns nltk.download('vader_lexicon') Use from vaderSentiment.vaderSentiment import SentimentIntensityAnalyzer as SIA in place of from nltk.sentiment.vader import SentimentIntensityAnalyser as SIA Please suggest witdth =100 is showig error as "width' is an invalid keyword argument for print()"
How-to Use The Reddit API in Python
23:21
James Briggs
Рет қаралды 56 М.
Wait for it 😂
00:19
ILYA BORZOV
Рет қаралды 11 МЛН
Hoodie gets wicked makeover! 😲
00:47
Justin Flom
Рет қаралды 80 МЛН
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 195 МЛН
PRAW - Using Python to Scrape Reddit Data!
28:31
BitsInBytes
Рет қаралды 6 М.
Building a Chatbot with ChatGPT API and Reddit Data
27:35
Thu Vu data analytics
Рет қаралды 52 М.
How is this Website so fast!?
13:39
Wes Bos
Рет қаралды 932 М.
Always Check for the Hidden API when Web Scraping
11:50
John Watson Rooney
Рет қаралды 643 М.
HuggingFace Crash Course - Sentiment Analysis, Model Hub, Fine Tuning
38:12
This is How I Scrape 99% of Sites
18:27
John Watson Rooney
Рет қаралды 159 М.
Awesome Python Automation Ideas
11:50
Patrick Loeber
Рет қаралды 112 М.
Scraping comments and posts from reddit in Python from scratch
13:36
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Wait for it 😂
00:19
ILYA BORZOV
Рет қаралды 11 МЛН