Cancer Somatic Mutation Analysis | MAFtools R Package

  Рет қаралды 9,820

LiquidBrain Bioinformatics

LiquidBrain Bioinformatics

Күн бұрын

Пікірлер: 21
@md.naiemhossain483
@md.naiemhossain483 Жыл бұрын
Thanks for making such kinds of videos. Could you please make video of tumor and normal sample variant calling in vcf files and conversion of vcf files to maf filed?
@prabirsaha688
@prabirsaha688 3 ай бұрын
Thanks for the information. How to visulaize CNV amplification data like how can i add amplification status like Amplification or Loss inside the input file? and amplificaation status want to see along with SNV.
@Dezzy7model
@Dezzy7model Жыл бұрын
Hey, Thanks for your work and efforts...But pls can you give a detailed explanation on how to convert vcf to maf file (maybe make a video and attach link to the reply). The link you suggested uses another programming language(?) Perl, and some of us are not familiar with that. And yes i have looked on how to run perl scripts in R studio but it crashes...when i attempt. Thanks in advance.
@taiwankyh
@taiwankyh 2 жыл бұрын
Thanks for the tutorial, and it is really helpful. Since I am a beginner, could you provide some more information for reference?
@LiquidBrain
@LiquidBrain 2 жыл бұрын
For reference do you mean this package specifically? I think their documentation is the best way to under the logic behind each of their function www.bioconductor.org/packages/devel/bioc/vignettes/maftools/inst/doc/maftools.html
@yasmeendodin3506
@yasmeendodin3506 2 жыл бұрын
Thanks a lot for the tutorial. I have a question related to another TCGA cohort which is TCGA-LUAD. I am interested in exploring differences between KRAS+ LUAD and KRAS wild-type LUAD. I have downloaded the MAF file and filtered it to contain only KRAS mutations and I got 163 KRAS mutated samples. Then I downloaded the gene expression data for the whole cohort (around 537 samples). To perform the analyses I need to divide the 537 samples into two groups; KRAS+ and KRAS wild type. I know that from the 537 samples I have 163 KRAS mutated samples (from the MAF file), but what about the remaining 374 samples? Can I consider them as KRAS wild type because they are not available within the MAF file? or does this mean that we don't know their mutation status because they are not tested for the KRAS mutation? is there any way through which I can get the samples that don't contain the KRAS mutation (wild type)? Many thanks in advance
@LiquidBrain
@LiquidBrain 2 жыл бұрын
To be really honest I am not very familiar with this cohort and this specific mutation you are working on. In theory those that have not shown the mutation can be treated as wildtype since it is not detected in the sequencing data from the patients. For KRAS in TCGA-LUAD however, I am not too confident to say, just that, in general this should be the case.
@yasmeendodin3506
@yasmeendodin3506 2 жыл бұрын
Thank you for the reply. Much appreciated!
@MrEgol
@MrEgol 2 жыл бұрын
Very good, thank you!
@YM-wo9pl
@YM-wo9pl 2 жыл бұрын
Thanks for great tutorial. This is an example of reading TCGA-LAML data from maftools. laml.maf = system.file('extdata', 'tcga_laml.maf.gz', package = 'maftools') laml.clin = system.file('extdata', 'tcga_laml_annot.tsv', package = 'maftools') laml = read.maf(maf = laml.maf, clinicalData = laml.clin) This example worked well. I want to analyze melanoma data, so I changed skcm.maf = system.file('extdata', 'tcga_skcm.maf.gz', package = 'maftools') but it does not work well. Please let me know if you know how to solve this problem.
@LiquidBrain
@LiquidBrain 2 жыл бұрын
system.file() only extract files that exists in the package, if you need the TCGA-SKCM, you might need to download the file directly from TCGA using the TCGAbiolinks package. I have paste an example code here, but you will still need to check the GDC portal for the exact file available on the database I have also another video explaining how to download the data here (kzbin.info/www/bejne/sGSZl6hnbJ2df7M), check the preparing data section. query
@adamnagy4544
@adamnagy4544 2 жыл бұрын
Thank you for your work! It is possible to upload the R file and link for the data?
@LiquidBrain
@LiquidBrain 2 жыл бұрын
oops, you can check out the script here: github.com/brandonyph/LiquidBrain_Scripts
@adamnagy4544
@adamnagy4544 2 жыл бұрын
@@LiquidBrain thank you!! :3
@sinugeorge9485
@sinugeorge9485 2 жыл бұрын
oral
@LiquidBrain
@LiquidBrain 2 жыл бұрын
Hi it seems the "oc.maf.gz" does not exist in the package I downloaded from Bioconductor. did you download this maf from external sources?
@sinumolgeorge9681
@sinumolgeorge9681 2 жыл бұрын
@@LiquidBrain I already have this maf file in the working directory.
@LiquidBrain
@LiquidBrain 2 жыл бұрын
system.file() will load the file that are contained in the package. since you already have the maf files in the current dir, you can simply load in the maf tools using the normal readmaf() function, try this (just make sure your filename is exactly "oc.maf.gz" oral
@divyaagrawal6740
@divyaagrawal6740 Жыл бұрын
againt the GDC is upgraded so your code is not supportive now
@sinugeorge9485
@sinugeorge9485 Жыл бұрын
Hi, i am using the below commands: >icgc =system.file("extdata", "ssm_orca_In.tsv.gz", package="maftools") >maf
@krushnachChandra
@krushnachChandra 10 ай бұрын
may be update the library
Finding Somatic Mutations in Cancer Genome Sequence Data
1:50:01
Bioinformatics DotCa
Рет қаралды 4 М.
Running Basic Statistical Analysis in R
22:40
LiquidBrain Bioinformatics
Рет қаралды 83 М.
Car Bubble vs Lamborghini
00:33
Stokes Twins
Рет қаралды 45 МЛН
How To Choose Mac N Cheese Date Night.. 🧀
00:58
Jojo Sim
Рет қаралды 82 МЛН
За кого болели?😂
00:18
МЯТНАЯ ФАНТА
Рет қаралды 2,7 МЛН
How to check the frequencies of gene mutations in TCGA cancer database [R]
23:12
LiquidBrain Bioinformatics
Рет қаралды 6 М.
5 genomics file formats you must know
19:10
OMGenomics
Рет қаралды 25 М.
How to read Kaplan-Meier plots
46:36
Vinay Prasad MD MPH
Рет қаралды 32 М.
Gene Set Enrichment Analysis| GSEA algorithm
18:09
LiquidBrain Bioinformatics
Рет қаралды 15 М.
Genomic Mutation Analysis in GDC 2.0 - April 22, 2024 GDC Monthly Webinar
52:48
NCI Scientific Events and Resources
Рет қаралды 208
Descriptive Statistics in R
25:51
David Caughlin
Рет қаралды 54 М.
Survival Analysis on Cancer data | RStudio Tutorial
35:15
LiquidBrain Bioinformatics
Рет қаралды 14 М.
STAT115 Chapter 23.3 Cancer Mutation Patterns
9:02
Xiaole Shirley Liu
Рет қаралды 2 М.
Genomic Files Formats and IGV
2:35:24
BRC RU
Рет қаралды 2,2 М.
Car Bubble vs Lamborghini
00:33
Stokes Twins
Рет қаралды 45 МЛН