MusICA Seminar: Julian Parker - Recent advances in generative modelling of musical audio

  Рет қаралды 849

AAG Edinburgh

AAG Edinburgh

Күн бұрын

Пікірлер: 3
@MattJackson808
@MattJackson808 7 ай бұрын
nice presentation, Julian
@MattJackson808
@MattJackson808 7 ай бұрын
Silly question but how is the Vector quantization reduction of the encoded Floats to quantized Ints a representation of music that is more able to be treated like text? In other words, how is the constant stream of Ints more Text like than the vector of Floats, when speaking abstractly? Do you mean because we use ASCII and already use a stream of Ints for language on computers, and that is the natural form rather than encoding language into tokens? - Matt Jackson
@julian_d_parker
@julian_d_parker 7 ай бұрын
​@@MattJackson808 Yes, I probably should have explained that distinction better. Text (at least in the context of transformer-based models like LLMs) is usually represented as a stream of integer 'tokens' which are derived from the input text using a tokenizer (usually not with a 1:1 correspondence to characters, but rather common groups of characters). LLMs are learning the categorical probability distribution over this discrete set of tokens given the previous tokens, very much like an advanced autocomplete. You could do the same with float vectors, but it doesn't usually work as well because you have to make assumptions about the continuous distribution which results in a much less expressive model. There's also a bunch of nitty-gritty architecture reasons why integer tokens work well with transformers. So ignoring the deeper philosophical aspect, the answer is basically "Ints work better for text in practice".
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН
黑天使只对C罗有感觉#short #angel #clown
00:39
Super Beauty team
Рет қаралды 36 МЛН
My scorpion was taken away from me 😢
00:55
TyphoonFast 5
Рет қаралды 2,7 МЛН
OpenAI Sora and DiTs: Scalable Diffusion Models with Transformers
1:02:38
Gabriel Mongaras
Рет қаралды 13 М.
5 Open Source Generative Music Models You Can't Miss
24:50
Valerio Velardo - The Sound of AI
Рет қаралды 11 М.
DAFx17 Keynote 3: Miller Puckette - Time domain Manipulation via STFTs
1:00:26
Forget About LLMs - Large Concept Models (LCM) Are Here Now!
7:03
Analytics Camp
Рет қаралды 2,6 М.
I attended Trump’s inauguration yesterday. Here are my thoughts.
7:01
Senator Bernie Sanders
Рет қаралды 4,4 МЛН
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 1,1 МЛН
The Turing Lectures: The future of generative AI
1:37:37
The Alan Turing Institute
Рет қаралды 637 М.
Diffusion and Score-Based Generative Models
1:32:01
MITCBMM
Рет қаралды 87 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,4 МЛН
MIT 6.S191: Deep Generative Modeling
56:19
Alexander Amini
Рет қаралды 77 М.
В Европе заставят Apple сделать в айфонах USB Type-C
0:18
Короче, новости
Рет қаралды 1,1 МЛН
СИЖУ БЕЗ ЕДЫ, ПЬЮ ОДНУ ВОДИЧКУ.
21:37
Быть Добру
Рет қаралды 79 М.
пранк: псих сбежал из дурдома
0:53
Анна Зинкина
Рет қаралды 1,7 МЛН
НИКОГДА не иди на сделку с сестрой!
0:11
Даша Боровик
Рет қаралды 729 М.
В Европе заставят Apple сделать в айфонах USB Type-C
0:18
Короче, новости
Рет қаралды 1,1 МЛН
Гига богатый геймер vs бедный геймер
30:55
Трум Трум Оки Токи
Рет қаралды 114 М.