How to use DESeq2's variance stabilizing transformation with microbiome data (CC195)

  Рет қаралды 4,645

Riffomonas Project

Riffomonas Project

Күн бұрын

Пікірлер: 22
@kylegervers1731
@kylegervers1731 2 жыл бұрын
Another great video, and I'm so excited that you're going to be tackling alpha diversity next! I'm assuming that the same benchmarking approach you've been using can also be applied, but the difference in richness, Shannon index, etc. can be plotted on the y-axis against the same difference in sequencing depth on the x-axis that we've been plotting. Along these lines, I'd be interested in testing how the iNEXT sample-size or coverage-based alpha diversity estimates (Hsieh et al., 2016, Methods in Ecol. and Evo.) compare to same rarefaction approach we've been using, which requires some samples to be dropped due to insufficient depth to estimate a parameter.
@Riffomonas
@Riffomonas 2 жыл бұрын
Thanks! Same approach. I’ll be sure to check out those other methods
@Mmitish
@Mmitish 10 ай бұрын
Please correct me if I am wrong -> (The VST in DESeq2 is commonly applied to raw counts (after filtering and handling of zero counts) to address overdispersion and stabilize the variance across the mean. This transformation is typically performed before downstream analyses that assume homogeneous variances, including distance matrices/ Euclidean or Bray or ...)
@Riffomonas
@Riffomonas 10 ай бұрын
That's my understanding. As used by the WNWN paper and elsewhere, a pseudocount of 1 was added to all values. You can see I did this here: github.com/riffomonas/distances/blob/c748a4086e9d1811810385ee2b11e270bbf1825a/code/variance_stabilization.R#L47
@Mmitish
@Mmitish 9 ай бұрын
Thanks!@@Riffomonas
@krishmen
@krishmen 2 жыл бұрын
Thank you for the wonderful series! These videos are very useful for digging into microbial ecology. What do you think about the assumption that uneven read count might reflect the true abundances of organisms/features? Thus, if the reads are rarefied to a common number, some important data about diversity can be lost. Or am I confusing something?
@Riffomonas
@Riffomonas 2 жыл бұрын
Hey Kriss - in general the read count doesn’t mean much because we do some type of normalization when we pool the PCR products. I have seen it matter when comparing things like lung and mouth when a ton of non specific amplicons are tossed bringing down their read count
@АлександрБолбат-ы1у
@АлександрБолбат-ы1у 2 жыл бұрын
Thank you. Are you going to publish these results? I would love to reference them, but referencing a KZbin video would likely be frowned upon. :)
@Riffomonas
@Riffomonas 2 жыл бұрын
I’m working on it! Hopefully I’ll have a preprint up in the next month or so
@leocadio_blanco
@leocadio_blanco 2 жыл бұрын
@@Riffomonas Hi Pat - I am again dealing with the same question from reviewers - Aitchison + Centerlog... Over the summer I was not sure if I read somewhere you had summitted the preprint? By now I just sending people the link to your youtubes....
@Riffomonas
@Riffomonas 2 жыл бұрын
Very sorry for the delay! I should have the preprint up by the end of the calendar year and hopefully much sooner
@1973vgc
@1973vgc 2 жыл бұрын
it is not boring! I am learning a lot from you! T H A N K Y O U !
@Riffomonas
@Riffomonas 2 жыл бұрын
Ha! Thanks for watching 🤓
@BannerOfBlasfemy
@BannerOfBlasfemy 2 ай бұрын
thanks a lot Pat. Would this stabilisation approach be appropriate prior to beta diversity analysis by NMDS of Bray-Curtis distances?
@Riffomonas
@Riffomonas 2 ай бұрын
Hi! No, I'd never encourage anyone use DESeq2. Use rarefaction of Bray-Curtis distances and then plug that in to NMDS
@BannerOfBlasfemy
@BannerOfBlasfemy 2 ай бұрын
@@Riffomonas Thanks Pat. I guess you refer to Episode CC200. From this episode's concluding remarks I took that DESeq2 is the better one at stablising variance in a microbiome dataset....My problem is a virome dataset with relatively even seq-depth, but huge variance in absolute (remapped-to-contig) read counts (100-7M virus reads per sample), obviously with many 0s in every sample. I have spent some time with the github discussions around DESeq and phyloseq, and although I learned some things, for my objective, I have left more confused.
@JimtheEvo
@JimtheEvo 2 жыл бұрын
One thing I found recently was trying to use deseq2 on a new Apple silicon Mac requires you use the intel version of r. Also I’ve always said no when asked if I want to compile from source, I think this works better if you don’t have a c++ library installed on your Mac. I might just have had 1 bad experience and stuck with not compiling.
@Riffomonas
@Riffomonas 2 жыл бұрын
Good to know - thanks for sharing!
@huikl6562
@huikl6562 2 жыл бұрын
Wonder if it is feasible to generate a mean rarefied count table so we can treat the data just the same for easy implementation to common tools and functions. There got to be a "centroid" which its vegdist() results is the same as doing avgdist() on original data, or are there are some complication that compel us to do the iteration right before calculating these metrics?
@Riffomonas
@Riffomonas 2 жыл бұрын
That would be the relative abundance
@gimanibe
@gimanibe 2 жыл бұрын
Thanks Pat, as always. Have you tried the normalization implemented in metagenomeSeq? I want to compare with your NULL model ;)
@Riffomonas
@Riffomonas 2 жыл бұрын
No but I’m not sure I’d expect it to be any different. Let me know what you find!
Is normalization an acceptable alternative to rarefaction? Nope. (CC190)
22:10
Don’t Choose The Wrong Box 😱
00:41
Topper Guild
Рет қаралды 62 МЛН
How to treat Acne💉
00:31
ISSEI / いっせい
Рет қаралды 108 МЛН
Каха и дочка
00:28
К-Media
Рет қаралды 3,4 МЛН
СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️
01:01
DO$HIK
Рет қаралды 3,3 МЛН
Multifactor Designs in DESeq2
21:58
DnA lab short read sequencing workshop
Рет қаралды 10 М.
An introduction to Snakemake tutorial for beginners (CC248)
30:12
Riffomonas Project
Рет қаралды 19 М.
Variance Stabilizing Transformations | Theory and Application in R
18:20
Why Negative Binomial is used in DESeq2?
8:09
LiquidBrain Bioinformatics
Рет қаралды 14 М.
Statistics for Genomics: Normalization
42:34
Rafael Irizarry
Рет қаралды 12 М.
How to rarefy community data in R with vegan and the tidyverse (CC200)
23:00
Data Transformation With Example | Box-Cox Transformation
16:36
LEARN & APPLY : Lean and Six Sigma
Рет қаралды 21 М.
Don’t Choose The Wrong Box 😱
00:41
Topper Guild
Рет қаралды 62 МЛН