Dimensionality reduction with tidymodels for the Billboard Top 100

  Рет қаралды 5,468

Julia Silge

Julia Silge

Күн бұрын

Пікірлер: 29
@517127
@517127 3 жыл бұрын
I'm sure that your videos make a greather impact in the quality of my work than any paid course
@enicay7562
@enicay7562 3 ай бұрын
Thank you
@marcogelsomini7655
@marcogelsomini7655 2 жыл бұрын
16:50 very interesting point!
@mochardhikurniawan1009
@mochardhikurniawan1009 3 жыл бұрын
That's a great video, thank you for uploading this video :D Hope someday, you can make a video about time series using tidymodels with hyperparameter tuning
@mkklindhardt
@mkklindhardt 3 жыл бұрын
Once again. Thank you Julia!
@tighthead03
@tighthead03 3 жыл бұрын
This is what I've been looking for thank you very much. Now to figure out how to quantify which approach provides the most accurate model performance. Thanks again.
@carvalhoribeiro
@carvalhoribeiro 2 жыл бұрын
Awesome. Thank you so much !
@aallstar414
@aallstar414 3 жыл бұрын
awesome video as usual!
@janetfigueroa3288
@janetfigueroa3288 2 жыл бұрын
Do you need to use tests such as Bartlett's sphericity test and the KMO index (Kaiser-Mayer-Olkin) before doing PCA?
@JuliaSilge
@JuliaSilge 2 жыл бұрын
I wouldn't say "need to" per se; you can read more here: stats.stackexchange.com/questions/92791/why-does-sphericity-diagnosed-by-bartletts-test-mean-a-pca-is-inappropriate
@janetfigueroa3288
@janetfigueroa3288 2 жыл бұрын
@@JuliaSilge Thanks Julia! I'll take a look. Do you test these assumptions when diving into PCA? Or what approach do you take if that makes sense? So many nuances to these methods/assumptions can definitely bog down some steps.
@alextantos658
@alextantos658 2 жыл бұрын
Excellent tutorial and demonstration! However, there is one thing I don't really get. Why would one define an outcome variable for PCA? I mean isn't the point of this type of models to reveal hidden dimensions that express the variability of the data without having an outcome variable?
@JuliaSilge
@JuliaSilge 2 жыл бұрын
Oh for sure, it is not necessary to specify an outcome. You can see how to set up a recipe step with no outcome in the examples here: recipes.tidymodels.org/reference/step_pca.html#ref-examples The reason I included an outcome here was to show how you could use dimensionality reduction as a preprocessing step before fitting models, like this: www.tmwr.org/dimensionality.html#bean-models
@alextantos658
@alextantos658 2 жыл бұрын
@@JuliaSilge Thank you for the links! Chapter 16 of the book uses the "class" variable as the outcome variable for all four types of PCA that are exemplified and it was not clear to me whether I would need to add an outcome variable while preparing the recipe. You reply here clears that out. In the mean time, I also watched yesterday an earlier video of yours that uses "~." for ignoring the outcome variable, while conducting PCA. Keep up the good work!
@xaviercasas100
@xaviercasas100 3 жыл бұрын
Very cool 👍 wish my teachers did videos like this
@srdjanobradovic66
@srdjanobradovic66 3 жыл бұрын
Superb, thank you.
@Stoney-g1o
@Stoney-g1o 3 жыл бұрын
I am only able to set the resolution to 360p. Do you still have the original recording at a higher resolution? It is difficult to read the text in the IDE. thak you for the excellent tutorials. I know it is a lot of work.
@JuliaSilge
@JuliaSilge 3 жыл бұрын
Can you try again? I can set it all the way to 1080p.
@Stoney-g1o
@Stoney-g1o 3 жыл бұрын
@@JuliaSilge Yes, different resolutions up to 1080p are available with the default at 720p
@guberney
@guberney 3 жыл бұрын
Thank you. An excellent video. Do you have any suggestions for multidimensionality reduction using tabular data as input?
@JuliaSilge
@JuliaSilge 3 жыл бұрын
Yes, these will all work for tabular data. Unless you mean something I am not understanding?
@guberney
@guberney 3 жыл бұрын
@@JuliaSilge By definition PCA is a method for quantitative variables. My question is about handle tabular data to apply multidimensional reduction as PCA, PLS or UMAP.
@JuliaSilge
@JuliaSilge 3 жыл бұрын
@@guberney I'm not sure what you mean here; "tabular data" typically means data arranged in a table form with rows and columns like what I have used here, like what you would find in a CSV or database or an R dataframe. Maybe you mean something else?
@guberney
@guberney 3 жыл бұрын
@@JuliaSilge I mean qualitative variables, nominal or ordinal. I am sorry for the “tabular data” expression.
@JuliaSilge
@JuliaSilge 3 жыл бұрын
@@guberney Ah gotcha. You can use `step_dummy()` to create dummy/indicator variables for any nominal/factor/string variables, before normalizing: recipes.tidymodels.org/reference/step_dummy.html
Predict ratings for chocolate with tidymodels
37:35
Julia Silge
Рет қаралды 5 М.
Lasso regression with tidymodels and The Office
44:49
Julia Silge
Рет қаралды 11 М.
If people acted like cats 🙀😹 LeoNata family #shorts
00:22
LeoNata Family
Рет қаралды 43 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 16 МЛН
PCA and UMAP with tidymodels and cocktail recipes
43:53
Julia Silge
Рет қаралды 13 М.
Dimensionality reduction of United Nations voting patterns
25:13
Julia Silge
Рет қаралды 4,5 М.
Resampling to understand gender in art history textbooks
27:13
Julia Silge
Рет қаралды 3 М.
Predict giant pumpkin weights with tidymodels workflowsets
37:19
Principal component analysis for hip hop songs with tidymodels
1:00:02
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 396 М.
Predict injuries for Chicago traffic crashes with tidymodels
53:43
If people acted like cats 🙀😹 LeoNata family #shorts
00:22
LeoNata Family
Рет қаралды 43 МЛН