STAT115 Chapter 3.6 SAM and BAM files

  Рет қаралды 14,766

Xiaole Shirley Liu

Xiaole Shirley Liu

Күн бұрын

Пікірлер: 11
@c.p.8689
@c.p.8689 Жыл бұрын
Thank you! You are great!
@bhayj
@bhayj 4 жыл бұрын
thanks a lot! very informative and fast explanation
@elliottkillian
@elliottkillian 4 жыл бұрын
Great info!
@tinacole1450
@tinacole1450 2 жыл бұрын
Like your explanation. Do you know how to annotate a sam file? Can you send a link?
@rezomgeladze5750
@rezomgeladze5750 2 жыл бұрын
It will be great if you add links for appropriate information in comments. Thanks!
@haroonzeb7087
@haroonzeb7087 3 жыл бұрын
hi , how to create .bam and .bai files
@loganchen7889
@loganchen7889 3 жыл бұрын
Usually, aligners will create the '.bam' for you, as well as the '.bai' file. BWA as an example, that only generate 'sam' file, which you can use the SAMTOOLS to covert it to the '.bam' file, also used the SAMTOOLS to create the '.bai' file. bwa xxxxxx (map command) | samtools view -BST - -o xxxx.bam then, samtools index xxxx.bam Hope this helps.
@haroonzeb7087
@haroonzeb7087 3 жыл бұрын
@@loganchen7889 thanks my 2nd question is how to create vcf file from fast or fastq files .is it necessary to first go through bcf tools or direct way to create vcf file or is it . mandatory first to have bcf file then vcf . please elaborate the answer with syntax and example
@loganchen7889
@loganchen7889 3 жыл бұрын
@@haroonzeb7087 I think what you have encountered is about variants calling. The VCF format was used to store the variants information, including contig (chromosome), location, reference, alternative base, and other related information. The simplest way to get vcf file from the raw fastq/fasta file should include two processes. 1. Mapping: align the sequences in the fasta/fastq file to the genome. 2. Variants calling: use a variant calling algorithm, deepvariant (mentioned by Shirely), GATK (widely used) to call the variants. The default output format is VCF. What you mentioned, bcf, is a binary format of VCF, if I remember correctly. Maybe, the examples of this process will be presented in the further videos, I am not sure, as I am also an audience of the course. Hope this message helps.
@haroonzeb7087
@haroonzeb7087 3 жыл бұрын
@@loganchen7889 absolutely i know VCF .but how to create VCF file is it necessary for the creation only BCF tools or BCF syntax is used as mandatory .and could you shed light on if VCF is created via bcf tools then or any other yours recommendationfor the creation of VCF file thanks in advance
@loganchen7889
@loganchen7889 3 жыл бұрын
@@haroonzeb7087 I am not sure if I understand correctly. I think many other tools, other than bcftools, which you mentioned, gatk, deepvariant, which I mentioned before, could create vcf/bcf. "The relationship between BCF and VCF is similar to that between BAM and SAM." (evomics.org/vcf-and-bcf/). I am not familiar with the bcftools, I don't know if anybody still used it to call variants. There are best-practice pipelines for GATK on both somatic and germline variants calling, you can refer (gatk.broadinstitute.org/hc/en-us/articles/360035894731-Somatic-short-variant-discovery-SNVs-Indels-) and (gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels-).
2020 STAT115 Lect3.1 RNA-seq Experimental Design
24:33
Xiaole Shirley Liu
Рет қаралды 7 М.
Я сделала самое маленькое в мире мороженое!
00:43
Кушать Хочу
Рет қаралды 4,7 МЛН
Don't look down on anyone#devil  #lilith  #funny  #shorts
00:12
Devil Lilith
Рет қаралды 47 МЛН
🕊️Valera🕊️
00:34
DO$HIK
Рет қаралды 9 МЛН
Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy
00:17
Leisi Crazy
Рет қаралды 80 МЛН
Understanding SAM/BAM file specifications
18:20
LiquidBrain Bioinformatics
Рет қаралды 19 М.
An Introduction to HDF5
10:24
hdf5
Рет қаралды 29 М.
FastQC tool for read data quality eval
9:36
Loren Launen
Рет қаралды 26 М.
STAT115 Chapter 3.2 FASTQ and FASTQC
13:13
Xiaole Shirley Liu
Рет қаралды 17 М.
5 genomics file formats you must know
19:10
OMGenomics
Рет қаралды 25 М.
Understanding File Formats in Bioinformatics: VCF and gVCF
25:40
Bioinformagician
Рет қаралды 12 М.
Bioinformatics Coffee Hour: Samtools
22:57
Harvard FAS Informatics
Рет қаралды 5 М.
5 Python Bioinformatics Libraries YOU Should Know
13:34
Base Call
Рет қаралды 3,7 М.
ADS1: Reads in FASTQ format
4:40
Ben Langmead
Рет қаралды 34 М.
Paired End vs. Single Run Sequencing
4:58
MakeTheBrainHappy - Scientific Exploration
Рет қаралды 22 М.
Я сделала самое маленькое в мире мороженое!
00:43
Кушать Хочу
Рет қаралды 4,7 МЛН