Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences

  Рет қаралды 705

ML for protein engineering seminar series

ML for protein engineering seminar series

Ай бұрын

Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences
Tuesday May 7th, 4-5pm EST | Jeff Ruffolo, PhD and Stephen Nayfach, PhD
Abstract: Gene editing has the potential to solve fundamental challenges in agriculture, biotechnology, and human health. CRISPR-based gene editors derived from microbes, while powerful, often show significant functional tradeoffs when ported into non-native environments, such as human cells. Artificial intelligence (AI) enabled design provides a powerful alternative with potential to bypass evolutionary constraints and generate editors with optimal properties. Here, using large language models (LLMs) trained on biological diversity at scale, we demonstrate the first successful precision editing of the human genome with a programmable gene editor designed with AI. To achieve this goal, we curated a dataset of over one million CRISPR operons through systematic mining of 26 terabases of assembled genomes and meta-genomes. We demonstrate the capacity of our models by generating 4.8x the number of protein clusters across CRISPR-Cas families found in nature and tailoring single-guide RNA sequences for Cas9-like effector proteins. We experimentally tested 209 of our novel Cas9-like proteins in human cells with a hit-rate of 63%. Our most performant gene editor, denoted as OpenCRISPR-1, demonstrates comparable activity and markedly improved specificity relative to SpCas9, the prototypical gene editor, while being 400+ mutations away in sequence. Our results highlight the power of LLMs for generating novel proteins with complex functional activities beyond the scope of structure-based design methods.
Preprint: www.biorxiv.org/content/10.11...

Пікірлер
Protein generation with evolutionary diffusion: sequence is all you need
59:46
ML for protein engineering seminar series
Рет қаралды 3,7 М.
Sequence modeling and design from molecular to genome scale with Evo
1:06:46
ML for protein engineering seminar series
Рет қаралды 1 М.
Sprinting with More and More Money
00:29
MrBeast
Рет қаралды 180 МЛН
I Built a Shelter House For myself and Сat🐱📦🏠
00:35
TooTool
Рет қаралды 34 МЛН
Designing and Executing Prime Editing Experiments in Mammalian Cells
1:11:12
Bio-protocol Webinars
Рет қаралды 3,9 М.
But what is CRISPR-Cas9? An animated introduction to Gene Editing. #some2
10:02
Powerhouse of the Cell
Рет қаралды 302 М.
HSV Update February 2, 2024
58:45
Fred Hutch
Рет қаралды 56 М.
16. Recombinant DNA, Cloning, & Editing
52:00
MIT OpenCourseWare
Рет қаралды 65 М.
PTM-Mamba: A PTM-Aware Protein Language Model with Bidirectional Gated Mamba Blocks
46:47
ML for protein engineering seminar series
Рет қаралды 424
Jennifer Doudna: CRISPR Basics
48:26
Innovative Genomics Institute – IGI
Рет қаралды 355 М.
Protein Design with Guided Discrete Diffusion
58:05
ML for protein engineering seminar series
Рет қаралды 1,8 М.
SAMPLE: A Self-Driving Laboratory System for Protein Engineering
46:35
ML for protein engineering seminar series
Рет қаралды 1,4 М.