Demo: bamSliceR: Estimation of Transcripts origin of Variants by a Bayesian Approach Using RNA-seq

  Рет қаралды 20

Bioconductor

Bioconductor

Күн бұрын

bamSliceR: Estimation of Transcripts origin of Variants by a Bayesian Approach Using RNA-sequencing data.
Understanding the transcript architecture of genes is crucial for annotating variants determined by raw read counts from RNA-sequencing data. Overlooking or biasing the transcript-specific context may lead to the misinterpretation of the disease-associated effects of these variants. For example, in 2020, Schoch et al. Genet Med highlights a epilepsy patient harboring pathogenic mutation in transcript NM_001323289.1 in exon 17, which were previously treat as at intronic region pass exon 18 of transcript NM_003159. Furthermore, identifying genetic variants within specific transcripts is highly relevant for understanding biological consequences, such as how these alternative mRNAs or their products interact with disease-related perturbations.
Cloud-based repositories like NCI Genomic Data Commons (GDC) harmonized DNA/RNA sequencing data from large translational research consortia provide resources for us to estimate the Variant Allele Frenquency (VAF) of variants in different stats from genome to specific-transcripts being expressed.
We previously developed the bamSliceR package to perform coordinate- or range-based extraction of genomic aligned DNA and RNA reads from the NCI Genomic Data Commons (GDC), followed by downstream analysis of variant effects using the robust Bioconductor ecosystem. We also demonstrate that this all-in-one toolset can be applied on public available or local sequencing data to reduce time, space, and computational burden. Here we want to highlight the new utilities we implemented in the bamSliceR to facilitate transcript-aware variants annotation using transcriptome BAM that generated by aligning reads from RNA-seq to reference transcripts sequences (GENCODE v36). For each variants, raw read counts of reference and alternative allele are tallied for all compatible transcripts of the gene. Read names of the variant that support each transcripts are also collected to provide weighted evidence based on degree of reads mapped to multiple transcripts. We then use a bayesian model comparison approach to estimate the transcript-specific VAF.
PeterHuang

Пікірлер
Short Talks 3, Day 2
49:24
Bioconductor
Рет қаралды 81
Cute
00:16
Oyuncak Avı
Рет қаралды 10 МЛН
Amazing Parenting Hacks! 👶✨ #ParentingTips #LifeHacks
00:18
Snack Chat
Рет қаралды 16 МЛН
Short Talks 5, Day 3
52:23
Bioconductor
Рет қаралды 28
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 469 М.
Fitting Non-identifiable Models using Bayesian Inference
1:57:59
Demo: Unraveling Immunogenomic Diversity in Single Cell Data
47:35
GEOMETRIC DEEP LEARNING BLUEPRINT
3:33:23
Machine Learning Street Talk
Рет қаралды 180 М.
Short Talks 1, Day 1
52:23
Bioconductor
Рет қаралды 26
Errichto Stream, POI 22/1
3:55:08
Errichto Algorithms
Рет қаралды 155 М.
Short Talks 2, Day 1
56:52
Bioconductor
Рет қаралды 10
Cute
00:16
Oyuncak Avı
Рет қаралды 10 МЛН