Neural audio timbre transfer with RAVE trained on Amen breaks using nn~ in Pure Data

  Рет қаралды 2,247

MARTSM_N /// martsman

MARTSM_N /// martsman

Күн бұрын

Пікірлер: 16
@marvbordello6047
@marvbordello6047 9 ай бұрын
dope
@ShihWeiChieh
@ShihWeiChieh 9 ай бұрын
hey, i really wanna install nn~ in max and learn about how to train my model with giga byte data. do you use google collab? or is there a way to not to? thank you so much and super envy!
@martsm_n
@martsm_n 9 ай бұрын
Colab or Kaggle are solutions if you can't or don't want to train on your local machine. I've set up notebooks for these platforms, you can find them here: github.com/devstermarts/Notebooks
@ShihWeiChieh
@ShihWeiChieh 9 ай бұрын
@@martsm_n thanks a lot! super neat notebook and i think the training is successfully initiated! now 22 epoch. Do you know how to know the max number of epoch?
@martsm_n
@martsm_n 9 ай бұрын
@@ShihWeiChiehtraining time is highly dependent on dataset, parameters and purpose. I can recommend joining the RAVE discord channel and ask around there for hints and intuitions.
@ShihWeiChieh
@ShihWeiChieh 9 ай бұрын
@@martsm_nok I will, thank you and keep up the good work!
@ShihWeiChieh
@ShihWeiChieh 9 ай бұрын
i got some helps from the discord and i have my first .ts file trained. I guess I have to train my prior model next for max/msp and pd usage (because the first .ts file is not compatible with my nn~ object). So i started the MSPRIOR notebook of yours then i stuck with the training cell: "ValueError: cannot reshape array of size 131072 into shape ()". Can you help me with this?
@MFRSIAM
@MFRSIAM 6 ай бұрын
how are u doing latent space manipulation ? i see u are using a resnet but am not sure whats going on here ?
@martsm_n
@martsm_n 6 ай бұрын
This is actually footage from very early and cautious interventions. As far as I can recall, it's not much more than altering the spread of latents per dimension given - in PD that'd be amplitude modulation of a signal value. You can find more videos and techniques of latent intervention in my Neural Audio playlist: kzbin.info/aero/PLmNfQif1XJQ0aNdrqk8HxvwUxq90xc9al
@davidevitturini5837
@davidevitturini5837 5 ай бұрын
Is the model available anywhere?
@martsm_n
@martsm_n 5 ай бұрын
No. I didn't release it. You can train it quite easily yourself, however.
@davidevitturini5837
@davidevitturini5837 5 ай бұрын
@@martsm_n did you used V1 or V2? On V2 and around 6hrs dataset with 2 3060 12gb it's taking around 1 week for 1 M epochs (I'm using a batch size of 36) [do you have a discord for discussing?]
@martsm_n
@martsm_n 5 ай бұрын
@@davidevitturini5837 This one is a V2 model with default regularization. Another V1 model has been used in this video: kzbin.info/www/bejne/gpixnoyLf8aGn5Y
@davidevitturini5837
@davidevitturini5837 5 ай бұрын
@@martsm_n thanks, really enjoyed your creativity
IRCAM Tutorials / Rave and nn~
31:12
Ircam
Рет қаралды 14 М.
Ouch.. 🤕⚽️
00:25
Celine Dept
Рет қаралды 22 МЛН
the balloon deflated while it was flying #tiktok
00:19
Анастасия Тарасова
Рет қаралды 14 МЛН
The Most Sampled Song Ever
18:54
12tone
Рет қаралды 625 М.
IRCAM Tutorials / RAVE - Neural Synthesis in a DAW
8:24
Как работает RAY TRACING и в чем обман?
16:03
Роман Сакутин
Рет қаралды 32 М.
Visualizing 4D Pt.1
22:56
HyperCubist Math
Рет қаралды 912 М.
What Is The Matrix Chord?
7:02
The Art Of Storytelling
Рет қаралды 290 М.
New largest prime number found! See all 41,024,320 digits.
10:14
Stand-up Maths
Рет қаралды 333 М.
Live Jungle - Floor Session
11:04
3scape
Рет қаралды 49