Gene Set Enrichment Analysis (+ R tutorial)

  Рет қаралды 51,001

LiquidBrain Bioinformatics

LiquidBrain Bioinformatics

Күн бұрын

Пікірлер: 57
@omairshariq6444
@omairshariq6444 3 жыл бұрын
I'm in the closing stages of my PhD and needed to get my head around GSEA and clusterprofiler in a short period of time, and this video was EXACTLY what I needed. Thank you!
@LiquidBrain
@LiquidBrain 3 жыл бұрын
Glad that it helps! All the best to your PhD :)
@AqleemAbbas
@AqleemAbbas 2 жыл бұрын
@@LiquidBrain Hello Great, lecture Give me your wechat id
@AqleemAbbas
@AqleemAbbas 2 жыл бұрын
Great What is enrichment? and Enrichment score? What are samples? You mean RNA from different plants spp or same spp
@AqleemAbbas
@AqleemAbbas 2 жыл бұрын
What is difference between false discovery rate and type errors
@marijohnson5212
@marijohnson5212 Жыл бұрын
I'm exactly the same haha, bless this vid
@ashleyhoped
@ashleyhoped 2 ай бұрын
I have been trying to figure out this part of my analysis for so long and was struggling with where to even begin but this video was perfect! So easy to understand and straight to the point, you're amazing. Thank you!
@clutch3171
@clutch3171 5 ай бұрын
very kind of you to explain this sort of material. it could easily take a book probably for some to get through it.
@maltam3873
@maltam3873 4 ай бұрын
Thumbs up to you guys, what a wonderful channel, looking forward to see more bioinformatic enthusiasts in Malaysia
@Unaimend
@Unaimend 2 жыл бұрын
Wow I watched your WGCNA Tutorial a few weeks ago and how I have to analyse my results and jsut saw you also have a GO video. Thanks
@player90nurken85
@player90nurken85 Жыл бұрын
Thank you very much for making this video It was very clear and straight to the point
@SuperMixedd
@SuperMixedd Жыл бұрын
having subtitles would be nice
@user-yh4tx6ol9e
@user-yh4tx6ol9e 2 жыл бұрын
Thank you very much. I like your presentation and explanation.
@c.p.8689
@c.p.8689 Жыл бұрын
very helpful. Thanks for sharing the knowledge!
@MrRamaeri
@MrRamaeri 3 жыл бұрын
thank you so much for your presentation
@gianni_drm
@gianni_drm Жыл бұрын
What if I don't have DEGs but instead I have co-expression modules. I have a gene list with associated GO terms, but I don't have any LFC or values. Is it still possible to use clusterProfiler?
@LiquidBrain
@LiquidBrain Жыл бұрын
Yes, you can use enrichplot package for the visualization :)
@johirislam8174
@johirislam8174 2 жыл бұрын
Great vedio.But I would have to clear something suppose i analysed a data set by deseq2 and found DEG .So now i have to convert the DEG symbol to entrez id for functional analysis.So will i now download the file into .csv and then cut the gene symbol from the file and save into a new excel sheet and import the R studio for the entrez id?After getting the id i again download the file and pick the entrez id and paste it in the first download file and remove the symbol?After that import the file and generate that.That'll about you mean to say the lecture??Am I write?Another thing is that My gene symbol is about Homo sapiens but you should there mmusculus.So what will be the code for homosapiens??
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi Johir, for converting the gene symbol to entrez ID, you can do it in R using biomaRt package: library(biomaRt) gene_id
@johirislam8174
@johirislam8174 2 жыл бұрын
@@LiquidBrain i am not clear about your data code.and gene id means my selected gene id?Yourdata
@stretch8390
@stretch8390 2 жыл бұрын
@@johirislam8174 See how in defining the object "gene_id" attributes was listed with 3 things: ensembl_gen_id, external_gene_name, and entrezgene_id. The c(2,3) you are querying is columns two and three in gene_id. That is because of the comma in the square brackets means the selection is on columns and not rows, see: gene_id[ ,c(2,3)] I believe you will understand the x and y by looking at the documentation for function merge() in r. Hope this helps!
@whobugui
@whobugui Жыл бұрын
Thanks for the video! I think the table accompanying the hypergeometric testing is not accurate in a sense that the row and column names should be switched to match the definition though it is mathematically equivalent.
@Tiagomed96
@Tiagomed96 Жыл бұрын
Thanks for these nice videos, your tutorials are a great resource! I was wondering, for the GSEA analysis, should you not filter out noise? Like genes that are not DEGs based on p value alone (not fold change)? Otherwise, your input will be full of genes that are just noise, but GSEA only takes into account the ranked list of logFC. If a gene has a high logFC but it is not reproducible in your samples, it won't have a low pvalue and would be filtered out as noise in a DEG analysis. But if you input just a ranked list of all genes without filtering for pvalues prior, you might end up with many genes which are just noise? Thanks in advance!
@LiquidBrain
@LiquidBrain Жыл бұрын
Hi thanks for liking the video! Yes, for GSEA analysis, I usually filter and use only the genes with significant pval from their differential expression analysis output. -Lind
@javierhernando5063
@javierhernando5063 Жыл бұрын
Is it possible to apply this approach to RT-qPCR analysis? Obtaining relative quantification values (Ct or cycle threshold) normalized by its endogenous controls? Or just RNA-Seq?
@LiquidBrain
@LiquidBrain Жыл бұрын
Yes, it is the downstream analysis to look for the enrichment of a set of gene-of-interest, as long as you supply the correct gene IDs and/or the ranking values (log2FC in my example), hope this helps :) - Lind
@Gabrielowns321
@Gabrielowns321 2 жыл бұрын
So informative and helpful thank you ☺️
@user-yh4tx6ol9e
@user-yh4tx6ol9e 2 жыл бұрын
Hi, I wrote a question about the argument of gseGO used in the script, but I found it deleted... can you please answer me if the argument "padjMethod" is neccessary in the analysis or not? I found several papers skipped with the argument, on the other hand, they used BH correction I would appreciate it for your advice. Thanks
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi it is a statistical way to perform the multiple test correction, BH is commonly applied , you may change to other method or use the default one as most people do :) Here’s for better understanding: www.rdocumentation.org/packages/stats/versions/3.5.0/topics/p.adjust
@mortezahadizadeh9637
@mortezahadizadeh9637 3 жыл бұрын
Thanks for your valuable tutorials.
@nitinshukla6532
@nitinshukla6532 2 жыл бұрын
I am stuck how to do same thing for non model organism such labeo rohita which is present on NCBI. It doesn't have any ensemble id or anything Please help
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi Nitin, yes, you need to create the gene annotation reference list yourself, here's the tutorial by the author GuangChuang Yu guangchuangyu.github.io/2015/05/use-clusterprofiler-as-an-universal-enrichment-analysis-tool/ hope it helps. -Lind
@chrislee8408
@chrislee8408 Жыл бұрын
is it possible to do a gene set enrichment analysis without doing a DEG? In my lab, we have just started doing NGS and we are still setting up our QCs. What we have in mind right now for one of our QC is to see if we can guess which sample (there are 4 samples) came from which tissues (heart, liver, kidney, diluent) by doing a gene set enrichment analysis to see if we can identify overexpressed genes which may only be expressed in specific tissues. Do you think it's feasible? Thank you.
@LiquidBrain
@LiquidBrain Жыл бұрын
Oh you need to perform DGE. Differential gene expression analysis will come out a list of genes that the expression is distinguished from other samples, then GSEA is to understand what are terms/pathways that enriched in the DEGs. Hope this helps - Lind
@abebemisganaw7377
@abebemisganaw7377 7 ай бұрын
great
@ntsopoul
@ntsopoul 2 жыл бұрын
before doing gse are you filtering for pvaladj from your list of differential expressed genes?
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Yes, based on what I understood, we can filter with adjusted p value -Lind
@trishakrishna939
@trishakrishna939 Жыл бұрын
Really hard to catch up with what she is saying.
@setarehsohail5422
@setarehsohail5422 2 жыл бұрын
Perfect!
@jasonlee416
@jasonlee416 Жыл бұрын
Hi, thank you very much for the video. I am really new to R studio, and this maybe a stupid question, but could you kindly expand a bit more on the GOplot (circle_dat and GOChord) please? For example, I have an RNA-seq data (species: rat) with EMSEMBL IDs and log2fold changes, and would like to generate a circular plot as you did in this video. Thank you!
@LiquidBrain
@LiquidBrain Жыл бұрын
Hi thanks for your comment! Too bad that I afraid may not have time to demo properly, but I found the GOplot creator, Wencke Walter, has a very nice tutorial about it wencke.github.io/ hope it is useful for you :)
@chinhhoang2375
@chinhhoang2375 2 жыл бұрын
Thank you for your useful video. I used to run clusterProfier smoothly, but recently it has informed that "fail to download KEGG data... Error in download.KEGG.Path(species) " even though I used the same script that worked before. I would be grateful if you show me how to fix it. Thanks!
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi it’s hard to diagnose with little info…could you let me to have a look on the script ? You may send it to liquidbrain.r@gmail.com
@amtd3808
@amtd3808 6 ай бұрын
I dont understand...
@supriyaawasthi2944
@supriyaawasthi2944 2 жыл бұрын
Hi, Really an awesome presentation...thanks a lot...can you please make video on analysing somatic mutation data from exome data analysis...or else suggest some good tools for somatic mutation data analysis in form of network analysis
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi may I know more what kind of somatic mutation analysis do you refer to?
@shadabali9476
@shadabali9476 Жыл бұрын
difficult to understand whether you are speaking chinese or english
@martbull12
@martbull12 2 жыл бұрын
Very helpful!
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Glad it was helpful!
@kdr9608
@kdr9608 3 жыл бұрын
Thanks!
@palettenpeter331
@palettenpeter331 2 жыл бұрын
Thanks for the great video! When performing gsea: should I only use up- or downregulated genes or all genes? Thanks a Lot! :)
@LiquidBrain
@LiquidBrain 2 жыл бұрын
For GSEA, you can decide based on the aim for the experiment, but I believe the default ranking tries to place upregulated genes on the left and down regulated genes on the right. So you can observe the effects of both in the enrichment analysis. -Brandon
@palettenpeter331
@palettenpeter331 2 жыл бұрын
@@LiquidBrain thank you so much! Really love this channel
@amrsalaheldinabdallahhammo663
@amrsalaheldinabdallahhammo663 2 жыл бұрын
Thank you genius, can you please make mitch analysis on R :)
@AqleemAbbas
@AqleemAbbas 2 жыл бұрын
What is enrichment? Definition
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi what do you want to know about the enrichment? Perhaps Google is your good buddy?
@ncedilemankahla9758
@ncedilemankahla9758 Жыл бұрын
sadly your stepwise guidance was not as great here
Gene Set Enrichment Analysis| GSEA algorithm
18:09
LiquidBrain Bioinformatics
Рет қаралды 14 М.
Gene Set Enrichment Analysis (GSEA) - simply explained!
8:14
Biostatsquid
Рет қаралды 28 М.
Will A Guitar Boat Hold My Weight?
00:20
MrBeast
Рет қаралды 112 МЛН
Throwing Swords From My Blue Cybertruck
00:32
Mini Katana
Рет қаралды 11 МЛН
Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot
00:15
How to use DAVID for functional annotation of genes
12:55
Genomics Guru
Рет қаралды 76 М.
Introduction to Pathway and Network Analysis
56:28
Bioinformatics DotCa
Рет қаралды 37 М.
Single cell transcriptomics - Differential gene expression and Enrichment analysis (8 of 10)
1:06:42
SIB - Swiss Institute of Bioinformatics
Рет қаралды 2,7 М.
How to use ggplot2 in R | A Beginner's RStudio Tutorial
43:38
LiquidBrain Bioinformatics
Рет қаралды 35 М.
Pathway enrichment analysis - simple explanation!
12:53
Biostatsquid
Рет қаралды 22 М.
Gene set enrichment analysis in R
1:29:32
Kim Dill-McFarland
Рет қаралды 28 М.
GSEA with a custom gene list using clusterProfiler
23:07
Aldas_On_Bioinformatics
Рет қаралды 370
Will A Guitar Boat Hold My Weight?
00:20
MrBeast
Рет қаралды 112 МЛН