Find markers and cluster identification in single-cell RNA-Seq using Seurat | Workflow tutorial

  Рет қаралды 30,713

Bioinformagician

Bioinformagician

Күн бұрын

A detailed walk-through of steps to find canonical markers (markers conserved across conditions) and find differentially expressed markers in a particular cell type between conditions using Seurat's find markers functions in R. I hope you find the video informative. I look forward to your comments under the comments section!
1) Data:
drive.google.com/file/d/13I22...
2) Link to code:
github.com/kpatel427/KZbinT...
3) Vignettes:
▸ satijalab.org/seurat/articles...
▸ satijalab.org/seurat/articles...
▸ hbctraining.github.io/In-dept...
4) Marker databases:
1. SCSig: www.gsea-msigdb.org/gsea/msigd...
2. PangloDB: panglaodb.se/
3. CellMarker: bio-bigdata.hrbmu.edu.cn/CellM...
Chapters:
0:00 Intro
0:36 findMarkers(), findAllMarkers(), findConservedMarkers()
4:09 Study design
4:57 Load data
6:20 Visualize by clusters and condition
9:15 findAllMarkers()
12:51 DefaultAssay 'RNA'
14:10 findConservedMarkers() for cluster 3
17:12 Visualize canonical markers in a FeaturePlot
20:15 RenameIdents
21:55 Annotating clusters and marker databases
23:56 Annotating rest of the clusters
26:21 Perform differential expression in CD16 Monocytes between conditions (findMarkers())
30:52 Visualize markers identified by findConservedMarkers() vs findMarkers()
Show your support and encouragement by buying me a coffee:
www.buymeacoffee.com/bioinfor...
To get in touch:
Website: bioinformagician.org/
Github: github.com/kpatel427
Email: khushbu_p@hotmail.com
#bioinformagician #bioinformatics #findmarkers #findallmarkers #findconservedmarkers #deg #seurat #integration #cca #R #genomics #beginners #tutorial #howto #omics #research #biology #ncbi #GEO #rnaseq #ngs

Пікірлер: 65
@yusufali5812
@yusufali5812 2 жыл бұрын
Thanks for this simplified and super informative video!
@nayeemanushrat3174
@nayeemanushrat3174 2 жыл бұрын
Thank you! Looking forward to your next video tutorials!☺
@linus8490
@linus8490 2 жыл бұрын
Very informativet! Thank you! Looking forward to your next video tutorials!
@georgegavriil8951
@georgegavriil8951 2 жыл бұрын
Thank you for this very detailed and informative video, can't wait for the next scRNA-seq videos!
@lukesimpson1507
@lukesimpson1507 Жыл бұрын
Really amazing content. I could have saved myself months if I had found this channel earlier! Keep up the good work!
@janicexu1548
@janicexu1548 Жыл бұрын
Thank for these tutorials :-) I wish I had them earlier! Thanks also for including when you have errors as it is helpful for learning how to troubleshoot.
@poojasavla6240
@poojasavla6240 2 ай бұрын
honestly as a computational biologist who just started working in this industry, you are so awesome
@user-dp7sg7kd7f
@user-dp7sg7kd7f Жыл бұрын
Really helpful tutorial. Thanks for your effort!!
@aigerimk692
@aigerimk692 Ай бұрын
God bless you and your videos! Thanks a lot!
@demetronix
@demetronix 2 жыл бұрын
thank you for these videos. Very helpful!!
@pragnyarishika5661
@pragnyarishika5661 Жыл бұрын
Most useful channel for single cell RNA seq. Thank you so much for excellent explanation. Please make videos on building neural network models for single cell RNA seq data.
@tushardhyani3931
@tushardhyani3931 2 жыл бұрын
Thank you for this video !!
@siankangchong3617
@siankangchong3617 2 жыл бұрын
Thanks for the video! It is very helpful, I'm looking forward to seeing a video explaining the steps of performing GO enrichment analysis, appreciate your hard work!!
@Bioinformagician
@Bioinformagician 2 жыл бұрын
I shall make a video on GO enrichment analysis soon :) Thanks!
@user-mb5ld7re8m
@user-mb5ld7re8m 2 жыл бұрын
brilliant work!
@abdou-samadkone6397
@abdou-samadkone6397 11 ай бұрын
THANK you very much. You are amazing 🤩🤩🤩🤩🤩🤩🤩🤩
@user-gg1js5kg1p
@user-gg1js5kg1p 9 ай бұрын
Thanks a lot. I've been following your tutorial for the last 8/9 months. It helped a lot with my M.S thesis and my bioinformatics Knowledge. I appreciate your time and would like to request you that it would be helpful if you make a tutorial for Cell-cell communications for scRNAseq data analysis in R.
@sunghyoukpark7423
@sunghyoukpark7423 Жыл бұрын
Your videos are just awesome! I am looking forward to the cell type identification video. Without cell type identification, all the painstaking previous steps do not have much meaning, I guess.
@Bioinformagician
@Bioinformagician Жыл бұрын
Absolutely, working on it. Hopefully should be able to come out with it soon.
@kitdordkhar4964
@kitdordkhar4964 2 жыл бұрын
New commands learn today, q10. It would be great if you show some datasets on the mouse model. Due to the lack of mouse atlas, it is a long road to annotate the cells. I believe that you will find something easy for us to do. I will be waiting for the pipeline. Thanks again! Great video as always!
@Bioinformagician
@Bioinformagician 2 жыл бұрын
I shall consider using data from mouse models for some of my upcoming single-cell videos. Thanks for the suggestion! :)
@yukaizhang2675
@yukaizhang2675 Жыл бұрын
Nice content. Really helps me start from the beginning. Thank you! May I ask that how to fetch the relative expression of given genes of each animals/ conditions?
@user-ck3ki9hq9t
@user-ck3ki9hq9t 9 ай бұрын
Your tutorials make me feel like a first year grad student getting schooled by a 5th year. Nothing better than that! Thank you. Did you ever work in the Satija lab?
@chriskuo
@chriskuo Жыл бұрын
This is extremely helpful. If i am interested to see if there’s a cell that expressed both cd163 and cd45 how do i do that?
@fabiohbcosta
@fabiohbcosta 10 ай бұрын
Thanks for the amazing tutorials! One question: how do I perform this exact analysis starting from my filtered matrix.h5 files? I have two files, for two conditions, and wanted to do the same thing you did here. Thanks !
@efstratioskirtsios298
@efstratioskirtsios298 11 ай бұрын
Lovely video! Many thanks. Do you prefer using the DEseq2 option as test.use instead of the default in the DEG analysis? Is edgeR also compatible with Seurat? Sorry, I am new to this
@domenicoalessandrosilvestr7829
@domenicoalessandrosilvestr7829 Жыл бұрын
Hi, what is in your opinion the best test to use in the findmarkers or findallmakers function when comparing two cell populations with very different cell numbers?
@xiaoliu6964
@xiaoliu6964 Жыл бұрын
Your videos are super helpful and informative! Could you make a tutorial for how to integrate and analyze single-cell ATAC-seq and RNA-seq data? Thank you!!!!
@Bioinformagician
@Bioinformagician Жыл бұрын
That’s definitely in the pipeline. Please stay tuned :)
@xiaoliu6964
@xiaoliu6964 Жыл бұрын
@@Bioinformagician You are awesome 🤩!
@marcelohurtadocastillo3982
@marcelohurtadocastillo3982 Жыл бұрын
Great video, thank you so much for doing it! Sorry if I missed something but I didn't finish understanding why you choose to use FindConservedMarkers() to find markers differentially expressed between one clusters and all the others. As far as I knew, this is accomplished with FindAllMarkers() and FindConservedMarkers() will give you the markers that are conserved between two groups. Maybe the reason will be that you are calculating the differentially expressed markers from one cluster versus the other groups, but with similar expressions between the two conditions (treated and untreated)?. If this is the reason, you are not supposed to have the same result using FindAllMarkers()? Thanks again and hope you can help me :)
@junxiao7009
@junxiao7009 2 жыл бұрын
Thanks for your informative video! I have a question. Your last video had mentioned that the batch correction method 'harmony' would not change the original expression data (included 'count' or 'data' in seurat data), but add a dimensionality reduction data. However, when we use the 'FindAllMarkers' to identify the different expression genes bewteen the 'STIM' and 'CTRL', this function will use the 'count' or 'data' in our seurat data. Dose it mean we actually compared the expression bewteen 'STIM' and 'CTRL' arcoss the data without batch correction?
@Bioinformagician
@Bioinformagician 2 жыл бұрын
Great question! No matter which integration method you use, the one which returns a corrected expression matrix or the one which do not (like Harmony), we always perform differential expression test on 'unintegrated' data. That is the reason we make sure our default assay is set to 'RNA' (the assay that stores unintegrated data) prior to performing this analysis. The integration procedure inherently introduces dependencies between data points. This violates the assumptions of the statistical tests used for differential expression. So the 'count' or 'data' slot is from RNA assay that stores unintegrated data.
@abhilashdasari
@abhilashdasari 6 ай бұрын
0:18: 🔍 The video discusses finding differentially expressed features and cluster identification in single cell RNA seq data using the seurat package. 4:34: 🧬 The video discusses identifying gene expression changes in samples treated with interferon beta and the control group in a particular cell type. 9:59: ⚙ The video discusses the parameters for testing genes in clusters and populations. 14:35: 🔍 The video discusses the process of identifying cell clusters based on gene expression and grouping variables. 19:23: 📊 The video explains how to use quantiles to divide data and rename cell identities in a biological dataset. 24:20: ⚙ The video demonstrates how to perform cluster identification and find differential gene expression using pre-annotated cell data. 29:14: 🔬 The presentation discusses comparing gene expression in cd16 monocyte cells between stimulated and control groups. Recapped using Tammy AI
@abdou-samadkone6397
@abdou-samadkone6397 11 ай бұрын
This is extremely useful. What about using the FindconservedMarkers function to separate our cells, ie high/low PD1 expression, rather than control/treatment. Is it the same method? thanks
@sumankundu762
@sumankundu762 Жыл бұрын
Great presentation. simple, clear and to the point. Application and interpretation of many functions in Seurat package are now clear to me. Just wondering did you make any video how the processed dataset: ifnb_harmony.rds was constructed using the source data? This is just to appreciate the R codes better for my own understanding as I am relatively new in this space. Thank you.
@Bioinformagician
@Bioinformagician Жыл бұрын
This is the video - kzbin.info/www/bejne/sHbYopufqphnZ6s where I explain how ifnb_harmony.rds was generated.
@patrickmellors8445
@patrickmellors8445 Жыл бұрын
When we have used SCT to normalize data, I assume with should use the SCT assay for FindMarkers?
@KellyBlust
@KellyBlust Жыл бұрын
Hi, Thanks for your great videos! You mentioned during this video that you want to make a new video about using the automatic cell annotation tools. Has this already been done?
@Bioinformagician
@Bioinformagician Жыл бұрын
No it hasn't been done yet, however it is very much on my list of videos to make, and hopefully I should be able to create one soon. Thanks for following up!
@luiseduardogoncalves2228
@luiseduardogoncalves2228 2 жыл бұрын
Hi, thank you so much for your videos and for this topic specifically. I was trying to run it myself and I came across through this error: Error in findconservedmarkers(seurat_loom, ident.1 = 3, grouping.var = "Patient") : could not find function "findconservedmarkers" So it suggested to install these packages install.packages('BiocManager') BiocManager::install('multtest') install.packages('metap') After installing these packages, the same error keeps poping. Do you have any suggestions of what I should do?
@Bioinformagician
@Bioinformagician 2 жыл бұрын
Did you load the libraries after installing these packages?
@stacygenovese1761
@stacygenovese1761 2 жыл бұрын
This videos have saved me! I have three conditions: KO/WT/DBLKO. How do I do FindMarkers() on the integrated data? I can only specify ident.1 and ident.2. There is no ident.3. Any ideas???
@Bioinformagician
@Bioinformagician 2 жыл бұрын
One way I can think is you can make pairwise comparisons and then intersect the DE genes from both comparisons.
@chrisdoan3210
@chrisdoan3210 Жыл бұрын
Hi Bioinformagicain, I try to run FindConservedMarkers() but I got this message: Warning: Identity: 8 not present in group B. Skipping VVWarning: Identity: 8 not present in group A. Skipping NCError in marker.test[[i]] : subscript out of bounds. This error appears in many clusters I chose. Would you have a suggestion to troubleshoot this error? Thank you so much!
@singhh5050
@singhh5050 2 жыл бұрын
Hi! At what stage of the analysis workflow can you utilize GSEA?
@Bioinformagician
@Bioinformagician 2 жыл бұрын
GSEA gives you an idea on what pathways are differentially enriched. It could be after you identify markers for each cluster if you are trying to understand the biological mechanism of certain cells or it could be used to help you with cluster identification. If it is latter, then it would be used after you cluster your cells. So really depends on what your goal is.
@singhh5050
@singhh5050 2 жыл бұрын
@@Bioinformagician Thank you for your response! So does this mean that you can use GSEA to find enriched gene sets between different clusters in the same dataset/condition? Like can you compare different cell type clusters in one graph using GSEA? I’m used to thinking about it as something that you can only utilize when you have a specific control dataset and another experimental dataset and you compare similar cell types between the two conditions. I’m really new to the field of scRNA-seq analysis so any thoughts would be super helpful :)
@user-pn5gn8lw2o
@user-pn5gn8lw2o 6 ай бұрын
How to annotation other species other than mouse, like ferret?
@wasima4463
@wasima4463 Жыл бұрын
@Bioinformagician did you already make a video on automatic cell annotation tools (23:15)?
@sayantidey6368
@sayantidey6368 Жыл бұрын
@Bioinformagician ...Yes please if you have then that would be great for people who are struggling with the unbiased annotation using packages like SingleR. Thanks in advance.
@Bioinformagician
@Bioinformagician Жыл бұрын
That's next on my list. Hopefully should be able to come up with a video soon. Please stay tuned :)
@c.p.8689
@c.p.8689 Жыл бұрын
@@Bioinformagician SingleR is a mess. I just used the scType. Not that great either.
@menu1006
@menu1006 Жыл бұрын
Hello thx for making such informative videos plz create video on automated cell annotation using different packages in R.. this will be a great help thx
@Bioinformagician
@Bioinformagician Жыл бұрын
That will hopefully be published on my channel soon. Please stay tuned :)
@ifeoluwaemmanuel5093
@ifeoluwaemmanuel5093 26 күн бұрын
What if the seurat ident has not been given?
@derejejima9420
@derejejima9420 Жыл бұрын
How do you integrate and find markers for more than two conditions?
@Bioinformagician
@Bioinformagician Жыл бұрын
You could integrate data and follow the pseudo-bulking approach by aggregating counts for all cells to sample level.
@shubhrajitbarman3006
@shubhrajitbarman3006 2 жыл бұрын
I want to find number of cells present in each cluster? Please help me
@Bioinformagician
@Bioinformagician Жыл бұрын
table(seu.obj$seurat_clusters)
@bioinfo3
@bioinfo3 Жыл бұрын
You RenameIdents() on the Idents of the seurat object and then instead of renaming the remaining, just change the ident column to the existing annotations in the seurat object. This makes it very unclear of how someone would manually change the name of each cluster.
@HahaHub-gd4nz
@HahaHub-gd4nz 7 ай бұрын
Please talk slower
@chrisdoan3210
@chrisdoan3210 Жыл бұрын
Thank you for your video! Would you please tell me why you choose only top gene in b.interferon.response. Could we choose more genes such as top 5 genes in this list of 1273 genes?
Single Cell Sequencing - Eric Chow (UCSF)
24:37
iBiology Techniques
Рет қаралды 234 М.
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 108 МЛН
Luck Decides My Future Again 🍀🍀🍀 #katebrush #shorts
00:19
Kate Brush
Рет қаралды 8 МЛН
I’m just a kid 🥹🥰 LeoNata family #shorts
00:12
LeoNata Family
Рет қаралды 19 МЛН
Men deeply respect women who apply these 4 rules
9:50
Brian Nox
Рет қаралды 1 МЛН
The Beginner's guide to bulk RNA sequencing vs single-cell RNA Sequencing
12:24
Standard scRNAseq preprocessing workflow with Seurat | Beginner R
31:16
Single cell transcriptomics - Cell type annotation (7 of 10)
46:02
SIB - Swiss Institute of Bioinformatics
Рет қаралды 2,4 М.
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 108 МЛН