How to interpret GSEA results and plot - simple explanation of ES, NES, leading edge and more!

Рет қаралды 21,939

Күн бұрын

In this video, I will focus on how to interpret the results from Gene Set Enrichment Analysis (GSEA) and to interpret the plots.
Learn what are the main statistics given by GSEA and how to use them to make the most of your pathway enrichment analysis results, including how to interpret the Enrichment Score (ES), Normalised Enrichment Score (NES), p-values, FDR...
We will go through basic GSEA terms like the ranking metric, the leading edge subset and more!
Hope you like it!
--------------------------------------------------------------------------------------------------------------------
Watched it already?
If you liked this video or found it useful, please let me know! Your comments and feedback are very much appreciated😊
If you have questions, don't hesitate to leave me a comment down below, I will answer as soon as I can:)
--------------------------------------------------------------------------------------------------------------------
Are you into biostatistics and computational analysis?
For more biostatistics tools and resources, you can visit:
biostatsquid.com/
Follow me on Instagram at @biostatsquid:
/ biostatsquid
For more
• simple and clear explanations of biostatistics methods
• computational biology tools
• easy step-by-step tutorials in R and Python
to analyse and visualise your biological data!
Don’t forget to subscribe if you don’t want to miss another video from me! --------------------------------------------------------------------------------------------------------------------
Other interesting resources for GSEA:
Original publication: www.pnas.org/doi/10.1073/pnas...
You can conduct your own Gene Set Enrichment Analysis with GSEA Software:
www.gsea-msigdb.org/gsea/inde...
or if you want to program your way through it, I recommend the fgsea or clusterProfiler packages:
bioconductor.org/packages/rel...
bioconductor.org/packages/rel...

Пікірлер: 46

@fadingawayyyyy Жыл бұрын

Thank you! Really helped in my understanding so much better compared to trying to read articles :') your hard work is much appreciated by us all here!

@user-mp2qj3ip3d Жыл бұрын

Im so appreciated for this video that simplify the basic concept of enrichment analysis. Im look forward to topology-based method.

@biostatsquid Жыл бұрын

Thank you for your comment! I'm glad you liked the video. Some more coding tutorials coming up but I will definitely write down topology-based methods in my to-do list:)

@shreyarao7032 5 ай бұрын

Your videos are a lifesaver! Thank you for making these

@bioinforbricker Жыл бұрын

This is very impressive video for better understanding the GSEA results, thank you for your effort

@bllimbyeee Жыл бұрын

amazing teaching!! best gsea tutorial on KZbin omg this helped me so much, thank u!

@mocabeentrill Жыл бұрын

BiostatSquidee!!! My enduring gratitude as always! You're the best.

@aleon9166 Жыл бұрын

this is simply amazing, cant wait for new videos!

@Tearr 3 ай бұрын

Thanks a bunch! Wonderful video describing how to interpret GSEA!

@mihacerne7313 Жыл бұрын

I love mountains but i love the ones in your video even more!

@Andrew-oq3fs Жыл бұрын

Thank you for this video! Helped me out alot!

@HH-ew5pd Ай бұрын

Thank you for the clear explanation!! Great help!! Looking forward to upcoming videos:)

@thomaskizzar3403 2 ай бұрын

This is such a great video thank you so much!!!

@faezehrafiee6257 Жыл бұрын

Thank you! you are the best!

@constanzarodriguez3165 Жыл бұрын

thank u so much for this videos!! 😍

@denizkortik1716 Жыл бұрын

you are amazing. please keep doing what you do. ı am grateful.😍

@zazoudunet5756 Ай бұрын

Thank you, very useful !

@Delios90 Жыл бұрын

Thanks a lot for this!

@xelaldaero9339 Жыл бұрын

Amazing!! Thank you

@cintiapalu1929 2 ай бұрын

Beautifully explained! Keep up the good work, I'm a fan and will be spread your tutorials :

@gerardhoeltzel4690 5 ай бұрын

elite explanation. ELITE I TELL YOU. thanks very much

@kayondofadhir8923 Жыл бұрын

Great. Have u ever tried to plot the p-value distribution just to get a relationship of the p-value with the corresponding FDR out put?

@amrsalaheldinabdallahhammo663 Жыл бұрын

You are genius !!

@efstratioskirtsios298 9 ай бұрын

Amazing!

@hossein37 Жыл бұрын

Great thanks

@jgk9111 Жыл бұрын

This is the best video ever

@biostatsquid Жыл бұрын

Thank you! Glad you found it useful:)

@juhijaiswal4441 2 ай бұрын

Amazing

@Muuip 10 ай бұрын

Thanks! 🙂👍

@juliachristiaanse2985 7 ай бұрын

thank you!

@nikelElegance 4 ай бұрын

thank you alot

@danielgladish2502 Ай бұрын

<a href="#" class="seekto" data-time="134">2:14</a> I am not clear on what contributes to the magnitude of the increase/decrease of the running statistic (i.e. what number specifically is the input for the running statistic calculation). Is it the rank value? In the video you focus explicitly on fold change, but in the previous video you mentioned that rank is determined by both fold change AND significance. Great video by the way :)

@biostatsquid Ай бұрын

Hey Daniel, thanks for your comment, great question! I tend to use -log10(pval)*sign(FC), to get a combination of both, but there's not a consensus in the community as far as I know. There's a few blogs/papers that discuss it: www.biostars.org/p/375584/

@danielgladish2502 Ай бұрын

@@biostatsquid ah makes sense! So it sounds like there are a number of different ways of doing this. Thanks for clarifying and the quick reply! I will have a look at the link.

@shizhaocheng4422 Жыл бұрын

I have a question? Is image with negative enrichment score wrongly placed? At 3 minutes and 41 seconds, the graph on the right side of the display is a list of genes with no specific distribution.

@biostatsquid Жыл бұрын

Hi Shizhao, thanks for your question! I am not sure which graph are you referring to. If you mean the orange graph in front of the ES graph (with the fish), perhaps I could have made it more obviously distributed towards the lower part (negative diff expression), yes!

@shizhaocheng4422 Жыл бұрын

@@biostatsquid Thank you for your reply. I have learned a lot from your video. cant wait for new videos! Thank you ！

@biostatsquid Жыл бұрын

@@shizhaocheng4422 I'm happy to hear that:) thanks for your question and feedback!

@jessehines4044 Жыл бұрын

Im confused at the end of the video. You said the q-value is the probability of the p-value for the test being wrong, ok but which p-value? The nominal or the adjusted one? Also, isnt the q-value just the adjusted p-value for multiple testing?

@biostatsquid Жыл бұрын

Hi Jesse, it is a great question, p-values, q-values and p-adjusted values can be confusing. Yes, as you say, the q-value is an adjusted p-value for multiple testing. So, in simple words: p-val = chance of a false positive (i.e., if you use a p-val cut off of 0.05, it means you are taking a chance that there are 5% of false positives --> calling something significant when it is actually not) Problem of multiple testing - the more tests, the more chance of observing at least one significant result, even if it is actually not significant. We need to correct for this - for which we can use different methods: p-adjusted values: p values corrected using the (most commonly) Bonferroni correction. Usually too stringent. q--values: p-values corrected based on the False Discovery Rate (FDR) - now we are not taking about 5% of all results being false positives, but 5% of SIGNIFICANT results being false positives. Hopefully that made sense. If you want a more exhaustive explanation you might want to check my video on multiple testing correction: kzbin.info/www/bejne/goeufayqZqdma9k&embeds_euri=https%3A%2F%2Fbiostatsquid.com%2F&source_ve_path=Mjg2NjY&feature=emb_logo or blog post: biostatsquid.com/multiple-testing-correction-fdr/

@jessehines4044 Жыл бұрын

@@biostatsquid Thank you for the indepth reply. Ok, if the q-value is the adjusted p-value then the only thing I don't get is why is there a column for the adjusted p-value and a column for the q-value in the chart near the end of the video? Furthermore, their values are different for each row? Ohhh wait I just accidently skipped over the adjusted p-value section of your reply, sorry. Ok I see so difference between the q and adjusted p-values is the method used to correct for multiple testing(adjusted p-value yielded from the Bonferroni correction and the q-value from FDR). Thanks, that really clears it up!

@biostatsquid Жыл бұрын

@@jessehines4044 Exactly! Glad it helped:)

@chusty93 7 ай бұрын

@@biostatsquidHold on. Now I'm even more confused. I thought that the adjusted p-value was a correction using Benjamini-Hochberg (as said in the video), not Bonferroni. Besides, I thought that Benjamini-Hochberg was a FDR-controlling method. So that means that both adjusted-p-val and q-val are FDR corrected? Please, help

@biostatsquid 7 ай бұрын

Hi @@chusty93 thanks for pointing that out! So p-adjusted values are just p-values corrected for multiple testing. This adjustment can be made using various methods: Bonferroni, BH.... q-values are adjusted p-values that control the False Discovery Rate. There are several FDR-controlling methods but the most common one is Benjamini Hochberg. Therefore, when you see "p-adjusted," it often implies FDR correction, just because BH is much more common than Bonferroni, but it's essential to check the specific method used. Same way, if you have q-values, you know they are FDR-corrected p-values, and chances are they will have been corrected using BH since it's the most commmon one, but you should always check. Hope this clarified things, let me know!