UoA ML Seminar: William Stafford Noble - DL applications in MS proteomics and single-cell genomics

  Рет қаралды 859

Machine Learning Group - University of Auckland

Machine Learning Group - University of Auckland

2 ай бұрын

William Stafford Noble - Deep learning applications in mass spectrometry proteomics and single-cell genomics
William Stafford Noble is a Professor in the Department of Genome Sciences and in the Paul G. Allen School of Computer Science and Engineering at the University of Washington. He received the Ph.D. in computer science and cognitive science from University of California, San Diego in 1998. Dr. Noble’s research applies statistical and machine learning methods to the analysis of complex biological data sets. He is the author of more than 300 peer reviewed publications and has advised 34 postdoctoral fellows and 24 PhD students. William is the recipient of the International Society for Computational Biology Innovator award, is on the Clarivate Analytics list of “Highly cited researchers,” and is a Fellow and former member of the Board of Directors of the ISCB.
In this talk, I will describe several recent and ongoing projects that apply deep neural networks to the analysis of large compendia of biological data. The first two projects operate on protein tandem mass spectrometry data. We first show how a Siamese architecture can be trained in a supervised fashion to embed individual mass spectra into a 32-dimensional space, yielding a compact representation that enables large-scale, highly accurate clustering of the spectra and significantly enhancing our ability to assign observed spectra to their corresponding peptide sequences. The second project uses a model with a transformer architecture to perform de novo peptide sequencing, by translating directly from a mass spectrum (a sequence of peaks) to a peptide (a sequence of amino acids). The resulting model, trained from 30 million spectra, outperforms existing methods and enhances our ability to interpret various types of mass spectrometry data. Finally, the third project aims to jointly analyze two types of single-cell genomics data, one measuring gene expression (scRNA-seq) and a second measure local chromatin structure (scATAC-seq). The model is trained using a combination of co-assay data and traditional “single-assay” data, first learning an autoencoder for each data modality and then using just the co-assay data to train a translator between the embedded representations learned by the autoencoders. The resulting model is able to translate between modalities with improved accuracy relative to state-of-the-art translation techniques and also produces a matching of cells across modalities.

Пікірлер: 1
UoA ML Seminar: Xia Ning - Modof: A Deep Generative Model for Molecule Optimization
1:02:19
Machine Learning Group - University of Auckland
Рет қаралды 589
I Built a Shelter House For myself and Сat🐱📦🏠
00:35
TooTool
Рет қаралды 25 МЛН
Clustering and Markers Identification for ScRNA-Seq | Seurat Package Tutorial
23:16
LiquidBrain Bioinformatics
Рет қаралды 25 М.
Lee Hartwell (Fred Hutch): Understanding the Cell Cycle Regulators
21:54
iBiology Science Stories
Рет қаралды 7 М.
AI vs ML vs DL vs Generative Ai
16:00
Krish Naik
Рет қаралды 27 М.
Calling Bullshit 7.4: A Replication Crisis
7:23
UW iSchool
Рет қаралды 19 М.
Calling Bullshit 5.1: Big Data
11:17
UW iSchool
Рет қаралды 16 М.
MIT Introduction to Deep Learning | 6.S191
1:09:58
Alexander Amini
Рет қаралды 235 М.
Difference Between Bioinformatics and Computational Biology
9:06
Main filter..
0:15
CikoYt
Рет қаралды 1,2 МЛН
Очень странные дела PS 4 Pro
1:00
ТЕХНОБЛОГ ГУБАРЕВ СЕРГЕЙ
Рет қаралды 430 М.