Module 12 Lecture: Machine Learning
1:35:27
Module 11 Lecture: Web Scraping and APIs
1:11:30
Module 8 Lecture: Analysis of Variance
1:02:23
Module 7 Lecture: Data Visualization
1:07:05
Module 7 Debrief: Data Visualization
33:01
Module 6 Debrief: String Manipulation
45:30
Module 6 Lecture: String Manipulation
1:16:03
Module 5 Lecture: Data Manipulation
1:09:32
Module 5 Debrief: Data Manipulation
45:30
Пікірлер
@enesdeligoz4212
@enesdeligoz4212 7 ай бұрын
rt_f_df <- subset(rt_df, subset = gender == "Female")(I wrote this code but came across such problem showing 0 obs and 3 variables for rt_f_df) and I worked on Mac btw. Could you help me to figure it out
@CaitlanRizzardo
@CaitlanRizzardo 6 ай бұрын
You've probably moved way past this, but just in case others have the same issue: I realized that even though it looked fine otherwise (in the Console and over in the Environment tab), my level labels weren't actually being applied. So when it was trying to subset by "Female" it couldn't find it but when I had it subset by "F" then it could. There's probably a difference in versions of R from 2017 to now. Instead I did rt_df$condition <- factor(rt_df$condition, labels = c("Control", "Experimental")) to do level labels and then it was able to create rt_f_df fine.
@Olan315
@Olan315 7 ай бұрын
You are awesome Dr. Landers!
@mahalia999
@mahalia999 9 ай бұрын
Thank you for putting this course online. It's been a great resources while learning R. I wanted to ask if you have any suggestions on pulling Tweets to R since the changes to the API and the removal of the academic research plan. I know some people are suggesting web scraping, but I would be grateful for your suggestions.
@richardlanders
@richardlanders 9 ай бұрын
Manual scraping with a browser automation solution (like Selenium) is unfortunately the only option at this point. In the current version of the course (which I'm recording now for YT), I switched this over to Reddit, since their API is still open for research.
@SurvivingTheGame5575
@SurvivingTheGame5575 9 ай бұрын
What would have happened if you lead with an item and ask it to guess which test it's from?
@richardlanders
@richardlanders 9 ай бұрын
Interesting test! I picked a random item off of the IPIP and had this exchange: Me: What psychometric scale is the source of this item: Talk to a lot of different people at parties. GPT: The item "Talk to a lot of different people at parties" is likely sourced from a psychometric scale designed to measure extraversion, one of the five major dimensions of personality in the Big Five personality traits model. This item specifically seems to assess sociability and gregariousness, which are key aspects of extraversion. The Big Five model, also known as the Five Factor Model, includes extraversion, agreeableness, conscientiousness, neuroticism, and openness to experience as its dimensions. Within the extraversion domain, this item might be used in several different questionnaires, but it closely aligns with the kind of questions found in scales like the NEO Personality Inventory or the Big Five Inventory (BFI). These scales use self-report questionnaires where respondents indicate their level of agreement or frequency of behavior related to various statements. The statement you've provided is a typical example of how these scales assess an individual's level of extraversion.
@7777Jiggs
@7777Jiggs 9 ай бұрын
Amazing and interesting video!!! Thank you!!
@NAPREDŬK-c1k
@NAPREDŬK-c1k 9 ай бұрын
thanks
@daviddubin3176
@daviddubin3176 10 ай бұрын
@richardlanders Thanks for the great video! There seems to be a strong emphasis on learning how to prompt engineers and fine-tune, what is the best place to learn how to do this well within an I/O context?
@richardlanders
@richardlanders 10 ай бұрын
It depends a lot on your specific application domain. Prompt engineering is the more basic/accessible skill, so I'd start there. There are a lot of generic resources for learning how to prompt engineer in specific contexts - for example, Ethan Mollick made this video on prompt engineering for writing: kzbin.info/www/bejne/nJO1hoWdeLumnbs Fine tuning is a more technical skill, as it involves providing a dataset to the LLM via its API. I would only try that after you get a good handle on prompt engineering.
@daviddubin3176
@daviddubin3176 10 ай бұрын
@@richardlanders, thanks. I'll check that resource out! I've found many ways to make my workflows within I/O more efficient and effective with chatGPT, but I'm trying to find easy ways to pass this on to my current and future clients. Do you think this could work if I make specific GPTs with prompt engineering? Have you seen any examples of this?
@richardlanders
@richardlanders 10 ай бұрын
@@daviddubin3176I think this is why there are so many custom NLP AI products out there right now - a lot of companies are trying to figure out how to do exactly that. Most IO approaches I’ve seen so far are of the “replace what we’ve been doing with automated versions” flavor, like the use cases I describe here, but I’d guess that will start to change in the next year or so. For now, I’d say that if content generation (in whatever form) is a major service you already provide for your clients, reducing costs by speeding up those services is the most obvious starting point.
@daviddubin3176
@daviddubin3176 10 ай бұрын
@@richardlanders Good point. In the spirit of your talk - not hand over all control to the machine nor be a Luddite - I think taking an approach to the efficiency of AI mixed with the expertise of a Ph.D. in I/O psychology is the best I can do now. Thanks for the great info. I'll follow you for more great talks and hope to run into you at SIOP Chicago!
@MustafaAkben
@MustafaAkben 11 ай бұрын
Great talk! I really enjoyed your discussion on crossing the bridges between IT and IO. I agree that IO is a very meticulous and careful field in the way it operates. Your discussion brings up a question for me: Could the rigor and careful work we do be the reason that IO and related fields such as OB are behind in the development and use of Generative AI?
@richardlanders
@richardlanders 11 ай бұрын
It absolutely contributes. A lot of that also comes from the heightened awareness IOs in the assessment/selection space have of legal risk/consequences. Startups and older tech-centered companies are generally much less concerned about that kind of risk (or more technically, they are more likely to assume it to be acceptable and manageable than an IO generally will).
@MustafaAkben
@MustafaAkben 11 ай бұрын
​@@richardlanders That makes much more sense! Great insight, indeed. Thanks again! By the way, I was pleased to hear that you were talking about an interview app. This semester, I designed an interview app for my HR students via OpenAI API and Azure platform. Students are taking interviews with the AI chatbot and receiving some feedback from it. Then, they analyze the script to see how AI formulates the questions, what kind of interview questions it asks, and so on. I hope to turn it into a research project and that we have a chance to talk about it in the future.
@mxm8900
@mxm8900 Жыл бұрын
Thank you so much for the instructional videos. They continue to help me greatly. You are now also teaching German sociology students😁
@jessiechoy915
@jessiechoy915 Жыл бұрын
Your 1000th subscriber!! Hope you can get monetized now cuz this content was so helpful and amazing for me as a social science student!
@richardlanders
@richardlanders Жыл бұрын
Awesome, thank you! I don't know about monetization, but I am glad the content is helping people.
@bethmelillo9345
@bethmelillo9345 Жыл бұрын
For line 8, rt_f_df <- subset(rt_df, subject = gender == "Female") this no longer creates a subset with any observations. Have pored over my code and it's correct against what's displayed in the video. I'm guessing this might be a version thing?
@richardlanders
@richardlanders Жыл бұрын
Your code contains "subject =" but the parameter you are looking for is "subset =". You can check the valid parameters for subset in R documentation with the following command entered directly into the console: ?subset
@TH-yx7iz
@TH-yx7iz 2 жыл бұрын
Amazing & clear explanation! Thanks for such a great video.
@abdulazizalhassan203
@abdulazizalhassan203 2 жыл бұрын
Thank you for the explanation. I am a development studies student.. and this will help me to make more meaning of my findings.
@youlleekim6934
@youlleekim6934 2 жыл бұрын
At the beginning, when I tried to create a new project, I only really see the check box for "use renv with this project" and not "create a git repository". Why do you think this is the case?
@richardlanders
@richardlanders 2 жыл бұрын
My guess would be that you didn't install git first? Or perhaps git is misconfigured. Once you download git from git-scm.com/ and install it, you'll also need to tell R Studio where it is. Sometimes it figured it out automatically and sometimes it doesn't. If it doesn't, go to the Global Options (Tools menu), open the Git/SVN submenu, and point "Git executable" to wherever you installed git.exe
@ItsFrauLehmann
@ItsFrauLehmann 2 жыл бұрын
Concerning labels: how does r know which label to assign to which value? I tried switching up the order and it still assigned them properly.
@richardlanders
@richardlanders 2 жыл бұрын
It shouldn't have! If you go back and forth executing the lines levels(rt_df$condition) <- factor(c("Experimental","Control")) and levels(rt_df$condition) <- factor(c("Control","Experimental")), in the Environment panel, you should see the factor definitions switching back and forth: Factor w/ 2 levels "Control","Experimental": 1 2 versus 2 1
@ShaunParkinson
@ShaunParkinson 3 жыл бұрын
Thanks for this! For others going through the first time, seems like DataCamp has changed their lessons, as the assignments for this lesson assumed that you already knew dplyr. There's a fair amount you have to struggle through and you won't have practice with until you watch the debrief(i.e., spread, separate).
@ShaunParkinson
@ShaunParkinson 3 жыл бұрын
at 13:57, after using read.csv, your condition and gender variables are already listed as Factors with 2 or 3 levels. When I'm trying this, they're only listed as chr(characters I assume). Why would that happen?
@richardlanders
@richardlanders 3 жыл бұрын
This is because you're using a later version of R than I am in the current version of this video! :) They changed the default behavior of read.csv() fairly recently. You can get the same behavior that I do here by adding a parameter to read.csv: stringsAsFactors=TRUE (previously, TRUE was the default, but the default is now FALSE)
@ShaunParkinson
@ShaunParkinson 3 жыл бұрын
@@richardlanders Thanks, that's good to know. Your course/materials are great by the way, super helpful for a PhD student out of his element!
@andreweyberg7010
@andreweyberg7010 3 жыл бұрын
Yep excellent stuff, hope you do more videos - best explained content on youtube
@ssahoo9201
@ssahoo9201 3 жыл бұрын
Really helpful
@mackenziebennett1739
@mackenziebennett1739 3 жыл бұрын
Hello, which data camp lessons go along with this course? Thanks!
@richardlanders
@richardlanders 3 жыл бұрын
You can find the full list here: datascience.tntlab.org/schedule-materials/
@mackenziebennett1739
@mackenziebennett1739 3 жыл бұрын
@@richardlanders Ahh, I missed that. Thank you very much!
@Olan315
@Olan315 4 жыл бұрын
"Ahps, second order, so just a second order, not a meta, not a meta meta meta, but a meta meta, just a meta meta; alright" R.N. Landers @ 2:39:55 🤣🤣🤣
@cdm1688
@cdm1688 2 жыл бұрын
Meta..
@architaanand3136
@architaanand3136 4 жыл бұрын
Sir is this module for students with no mathematical knowledge? If not then from where can I begin to build my understanding?
@richardlanders
@richardlanders 4 жыл бұрын
The course is entirely in programming R and only covers the programming of statstical procedures. I do assume you already for example know what an ANOVA and regression are. If you don’t, you’d be better off to take a combined statistics+R course or a stats course first if you want the stats content.
@architaanand3136
@architaanand3136 4 жыл бұрын
Richard Landers Thank you for your reply.
@Olan315
@Olan315 4 жыл бұрын
A rare peek inside the minds of 2 awesome IOs, a session of insightful thoughts and reaction to Virtual SIOP2020, something impossible during in-person conference. Thank you both for the effort and time put into this webcast. Food for my IO passion!
@tiafechter7024
@tiafechter7024 4 жыл бұрын
Great Discussion! Really fun! Thank you.
@jonastrex05
@jonastrex05 4 жыл бұрын
cool stuff Richard
@ryanhjelle6924
@ryanhjelle6924 4 жыл бұрын
A rather tangential question, but are there certification courses in things like "R", "Python", etc.? I'm just curious because when people place something like "Python" in "additional skills" on a resume/CV, is there any standard for when your "R" knowledge or "Python" knowledge is fit to put on a resume/CV?
@richardlanders
@richardlanders 4 жыл бұрын
There are no generally accepted credentials; however, they are certainly credentials that you could get if that's what you were after. MOOCs (Coursera, EdX, etc) offer certificates for example and have many courses that you could take. There are also certificate programs that you can get, some of which claim more broad acceptance than others (e.g., pythoninstitute.org/certification/, bootcamp.umn.edu/). I think a problem you hit is that the breadth of programming, even within just R or Python, is so vast that there is no way you could be realistically an expert in all of them. Even in this course, I only spend 1 module on natural language processing, on machine learning, on web applications, etc., but each of these could be an entire course, or even an entire degree program. In the real world, people are also now usually not employed to do _all of data science_ but for example are hired as a "machine learning engineer." So I would recommend learning the fundamentals of R and/or Python in whatever way makes sense to you (like this course) and then identify which specific "advanced" skills you want to go deep on. Within one such area, you'll generally find more useful credentials (e.g., www.pce.uw.edu/certificates/natural-language-technology).
@ryanhjelle6924
@ryanhjelle6924 4 жыл бұрын
@@richardlanders Thanks for the reply! Could you send me an updated link from that UW website at the bottom of your reply? I got a 404 when I clicked on it.
@richardlanders
@richardlanders 4 жыл бұрын
​@@ryanhjelle6924 Weird! When I click the link, it also doesn't work, but if I copy/paste, it does! UW must be doing some sort of broken KZbin referral detection.
@ryanhjelle6924
@ryanhjelle6924 4 жыл бұрын
@@richardlanders HA! That IS strange. Thanks for the help!
@shashanksingh9118
@shashanksingh9118 4 жыл бұрын
seen
@sophiema4137
@sophiema4137 4 жыл бұрын
Dear Richard Landers, I would like to thank you for this great course, and for your kindness to make it accessible for free! I am very grateful for your work and your very clear teaching style! Having started as a complete R beginner I am now slowly approaching my own data (survey data for my master thesis in Political Science). Therefore, I would kindly like to ask you if I can use the "relabelling text" method to convert my string values to scales? Or is there a smarter way to change survey responses to scales? Thank you so much again, Sophie
@richardlanders
@richardlanders 4 жыл бұрын
It depends a bit on the specifics of what you're recoding from and to, but yes, if you're starting with character vectors (e.g., "Agree", "Strongly Agree") to numeric (4,5), then dplyr's recode() inside mutate() is an easy way to do it, e.g., my_tbl <- my_tbl %>% mutate(var1_numeric = recode(var1_txt, "Agree"=4, "Strongly Agree"=5)) dplyr will even automatically set the new variable as a numeric vector, if you recode it completely, so that you don't need to add a data type coercion step afterward (in the above example, var1_numeric will automatically become numeric despite being recoded from character).
@annakajohnson5109
@annakajohnson5109 4 жыл бұрын
I was hoping your video was showing how social scientists would use Data Science and the R language for their research processes. Have anything like that? thx
@richardlanders
@richardlanders 4 жыл бұрын
Annaka Johnson This is it! By the end of the course (all 13 weeks including completing the projects on your own before watching the debrief videos), you will able to do basic natural language processing, write basic AIs, and deploy web apps to support your research, in addition to more basic skills, like creating reproducible data pipelines, managing data more efficiently, presenting results more effectively, and creating figures to interactive visualizations. It’s important to start with fundamental programming skills though and really learn the whole skill set! Don’t skip the basics or you’ll be lost forever.
@rewinabedemariam5394
@rewinabedemariam5394 4 жыл бұрын
Thank you :)
@rewinabedemariam5394
@rewinabedemariam5394 4 жыл бұрын
Thank you so much Dr Landers. This are really helpful. I hope you get to come and give a talk at my university (UMDCP)
@zeytelaloi
@zeytelaloi 4 жыл бұрын
betam yamralnat waw
@rewinabedemariam5394
@rewinabedemariam5394 4 жыл бұрын
This is an awesome workshop, thank you, Dr. Landers.
@jankymayo2469
@jankymayo2469 5 жыл бұрын
are u the guy who got kidnapped and found as a grown married man
@gabrielazamarbide2228
@gabrielazamarbide2228 5 жыл бұрын
Thanks for the course !
@ju5tinp3
@ju5tinp3 5 жыл бұрын
na_if tidy verse function: week4_df[5:9] <- sapply(week4_df[, 5:9], na_if, 0 )
@marisatschopp3854
@marisatschopp3854 5 жыл бұрын
Great video!
@abipsha
@abipsha 5 жыл бұрын
AMAZING! You explain so well.
@researchintamil7134
@researchintamil7134 5 жыл бұрын
Great Material, Valuable!
@rewinabedemariam5394
@rewinabedemariam5394 6 жыл бұрын
Thank you so much for these! As a master's student in Industrial Psychology, these are quite helpful.
@qinjizhi1145
@qinjizhi1145 6 жыл бұрын
The videos are really helpful, thank you!
@cristhtejada6457
@cristhtejada6457 6 жыл бұрын
Thanks for this course is the best i found with some real life applications for social science. when the last part of this couse is going to be released?
@richardl.8804
@richardl.8804 6 жыл бұрын
Hopefully within the next few weeks! I had some unexpected life changes in April that are still distracting me but will get them done ASAP!
@joyoliver5292
@joyoliver5292 6 жыл бұрын
Did anyone else get this error? "Error in as.POSIXlt.character(as.character(x), ...) : character string is not in a standard unambiguous format" ? I thought I found a fix for it (as.POSIXct(as.numeric(as.character(raw_df$timeStart)),origin="1970-01-01") and now the POSIXct runs, but then it coerces all timeStart and timeEnd into NA. Thoughts?
@richardlanders
@richardlanders 6 жыл бұрын
as.POSIXct() is very sensitive to perfectly formatted character vectors with actual dates, so if there is even a slight deviation in even one case, it won't work. So for example, if one of the dates is 2017-02-29, which isn't a real date, as.POSIXct() will fail. I didn't think there were any cases like this in this dataset, and it worked for me, but an easy way to get around this problem is to use the lubridate package (also part of tidyverse), which is much better at understanding and converting dates appropriately than as.POSIXct() is. In this case, you could use ymd_hms(), which will return NA instead of the incorrectly formatted date. For example: > library(lubridate) > myDates <- c("2017-02-28 12:00:00","2017-02-29 12:00:00") > as.POSIXct(myDates) Error in as.POSIXlt.character(x, tz, ...) : character string is not in a standard unambiguous format > ymd_hms(myDates) [1] "2017-02-28 12:00:00 UTC" NA Your example won't ever work, because you are coercing a character or factor variable (depending on how you imported it) into a number, which will always evaluate to NA if the character is not clearly a single identifiable number. So for example, as.numeric("2018-01-01") will return NA. The only situation where you'd want to use as.character with dates is if you _already have_ a POSIXct type - then you can use as.numeric() to convert it to a Unix timestamp. For example: > as.numeric(ymd_hms(myDates)) [1] 1488283200 NA
@joyoliver5292
@joyoliver5292 6 жыл бұрын
Richard Landers thanks so much! I will try it.
@saramichelley
@saramichelley 6 жыл бұрын
Dr. Landers, would you be able to expand on downloading Git? Is there a preferred website to download it from?
@richardlanders
@richardlanders 6 жыл бұрын
Sure, you find it here: git-scm.com/downloads