Applying random forest classifiers to single-cell RNAseq data

  Рет қаралды 6,771

Sanbomics

Sanbomics

Күн бұрын

Пікірлер: 24
@jsm640
@jsm640 2 жыл бұрын
Thanks for your helpful and technological video! And looking forward to some videos about scATAC seq.
@sanbomics
@sanbomics 2 жыл бұрын
Sometime in the future! I have a few more planned before that and not enough free time. But one day!
@muhammadjamalahmed2273
@muhammadjamalahmed2273 2 жыл бұрын
Love your work..
@sanbomics
@sanbomics 2 жыл бұрын
Thank you!
@calvinchen1081
@calvinchen1081 6 ай бұрын
Hey,bro,love your work!And please show me more machine learning.God, I am so tired of the slow speed of R when it runs machine learning with no use of GPU.
@sanbomics
@sanbomics 2 ай бұрын
I have so many grand plans but not enough time. I'm sure I will have some more soon!
@shilpasy
@shilpasy Жыл бұрын
Thank you so much, amazing video. Can you please tell me where can I get this kind of dataset to try this?
@sanbomics
@sanbomics Жыл бұрын
Any single-cell paper should have a data availability or equivalent section that contains links to the raw data or counts tables. Or you can search something like NCBI geo directly. Or you can look at the list of publications on the 10x genomics website.
@jamilaiqbal202
@jamilaiqbal202 10 ай бұрын
Cool video again. Would you be able to make a video on neural network applied to scRNAseq
@minus21334
@minus21334 2 ай бұрын
Hi, i have a question. Wouldn't the machine learning detect sample names pattern, and use cov/ctrl as prediction basics?? Should we instead remove the cov/ctrl sample names for training set??
@sanbomics
@sanbomics 2 ай бұрын
The model will never see the names. Just the expression data and the right/wrong label
@minus21334
@minus21334 2 ай бұрын
@@sanbomics silly me. thank you for your patience haha
@코모도-y5u
@코모도-y5u 2 жыл бұрын
Thank you so much!
@sanbomics
@sanbomics 2 жыл бұрын
You're welcome!
@mst63th
@mst63th 2 жыл бұрын
That was cool. Do you use your PC to run ML tasks, or are you using HPC systems?
@sanbomics
@sanbomics 2 жыл бұрын
Usually just my PC, but sometimes an AWS EC2 with Nvidia GPUs. Simple models like RF don't take much processing power at all. My PC is decently beefy too with Nvidia gpu, 128 gb memory, 24 cpu
@shreyaslabhsetwar6083
@shreyaslabhsetwar6083 Жыл бұрын
Amazing video! Are there any existing pre-trained models which we can directly use to auto-annotate cell types given cell clusters?
@sanbomics
@sanbomics Жыл бұрын
Not sure about pre-trained models. There are simple models like SingleR or CellTypist. But if you have a reference dataset you can train a model with SCANVI. I have a video on that
@shreyaslabhsetwar6083
@shreyaslabhsetwar6083 Жыл бұрын
@@sanbomics Thanks!
@garyhoward8198
@garyhoward8198 2 жыл бұрын
This is amazing! So helpful! I'm looking at applying some of these to publicly available data. How would this workflow change for k-nearest neighbour classification ? What would one need to change to do this ?
@sanbomics
@sanbomics 2 жыл бұрын
I haven't tried KNN for classification in single-cell, but neighborhood graphs are used all the time for unsupervised sc clustering. I'm not sure how well KNN would work without dimension reduction first but you could definitely try it. But dim reduction, like PCA, will require processing of your train/test together. Maybe there is a better way to do dim reduction but keep the train/test independent. RF is pretty flexible with the number of features. TLDR, I don't know, you should try it with only the variable features and see how accurate it is. Please let me know because I am curious!
@garyhoward8198
@garyhoward8198 2 жыл бұрын
@@sanbomics I tried it how can I send you the code ? Trying to do a ROC curve with it as well but the kernel keeps dying (even when I'm running it on the cluster).
@sanbomics
@sanbomics 2 жыл бұрын
You can upload it to a public github repository. Were you able to fix it? (sorry i just saw this, I don't get notifications for responses to my response)
@savparker9743
@savparker9743 2 жыл бұрын
🌹 【promosm】
Easy RNAseq volcano plot with one line of code
5:28
Sanbomics
Рет қаралды 6 М.
Single-cell gene set activity with AUCell
11:53
Sanbomics
Рет қаралды 6 М.
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 158 МЛН
Гениальное изобретение из обычного стаканчика!
00:31
Лютая физика | Олимпиадная физика
Рет қаралды 4,8 МЛН
Introduction to single cell ATAC data analysis in R
17:36
Sanbomics
Рет қаралды 16 М.
Random Forest Classification | Machine Learning | Python
17:29
Stats Wire
Рет қаралды 36 М.
Applying Random Forests and Decision Trees in R pt. 1
18:22
Spencer Pao
Рет қаралды 1,7 М.
Random Forests : Data Science Concepts
15:56
ritvikmath
Рет қаралды 49 М.
Single-cell analysis with scVI machine-learning toolkit
13:00
Sanbomics
Рет қаралды 10 М.
StatQuest: A gentle introduction to RNA-seq
18:26
StatQuest with Josh Starmer
Рет қаралды 516 М.
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН