Thorsten-Voice

Thorsten-Voice

Guude! (hi, nice to see you) 👋,

i'm Thorsten 😊.

You like open source, privacy aware and local running voice technology? Me too 😎. You'll find cooking recipe like tutorials on TTS, STT, Voice Assistants, AI, ML and way more cool stuff here. So, hop on and join my amazing community 🥰.

#opensource #voice #cloning #technology #news #tutorial #local #privacy #tech #tts #stt #voiceassistant #raspberrypi #smarthome #homeassistant

* My project website: www.Thorsten-Voice.de
* Me on GitHub: github.com/thorstenMueller

🎙️ Home Assistant Voice Preview Edition (VPE) #02 | First Setup & Connection 🔌

9:10

🎙️ Home Assistant Voice Preview Edition (VPE) #02 | First Setup & Connection 🔌

7 сағат бұрын

🎙️ Home Assistant Voice Preview Edition (VPE) #01 | Unboxing & Tech Specs 📦

10:14

🎙️ Home Assistant Voice Preview Edition (VPE) #01 | Unboxing & Tech Specs 📦

7 сағат бұрын

F5 Text to Speech Tutorial | Hit "Refresh" on Your AI Voice!

24:45

F5 Text to Speech Tutorial | Hit "Refresh" on Your AI Voice!

Ай бұрын

3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local machine

18:42

3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local machine

2 ай бұрын

Best AI Voice Generator | 2024.08

44:54

Best AI Voice Generator | 2024.08

4 ай бұрын

Automate Voice Dataset Creation Using Whisper AI

10:23

Automate Voice Dataset Creation Using Whisper AI

5 ай бұрын

TTS Voice Dataset | LJSpeech | Voice Cloning

13:02

TTS Voice Dataset | LJSpeech | Voice Cloning

6 ай бұрын

Unlock AI Superpowers with NVIDIA CUDA: Boost Performance in Python!

27:03

Unlock AI Superpowers with NVIDIA CUDA: Boost Performance in Python!

6 ай бұрын

Home Assistant ❤️ Voice - Tutorial 05 - Wyoming protocol

11:53

Home Assistant ❤️ Voice - Tutorial 05 - Wyoming protocol

9 ай бұрын

Home Assistant ❤️ Voice - Tutorial 04 - Piper TTS

10:12

Home Assistant ❤️ Voice - Tutorial 04 - Piper TTS

9 ай бұрын

Home Assistant ❤️ Voice - Tutorial 03 - Conversation / NLP

6:54

Home Assistant ❤️ Voice - Tutorial 03 - Conversation / NLP

9 ай бұрын

Home Assistant ❤️ Voice - Tutorial 02 - Text Assist

9:20

Home Assistant ❤️ Voice - Tutorial 02 - Text Assist

9 ай бұрын

Home Assistant ❤️ Voice - Tutorial 01 - Basic setup & demo entities

7:07

Home Assistant ❤️ Voice - Tutorial 01 - Basic setup & demo entities

9 ай бұрын

Running a local Piper TTS server with Python on Linux

17:35

Running a local Piper TTS server with Python on Linux

10 ай бұрын

🔥 Voice interview Michael Hansen | HA | Raspberry | Piper | Rhasspy

1:04:38

🔥 Voice interview Michael Hansen | HA | Raspberry | Piper | Rhasspy

10 ай бұрын

Local voice cloning with 6 seconds audio | Coqui XTTS on Windows

20:22

Local voice cloning with 6 seconds audio | Coqui XTTS on Windows

Жыл бұрын

🇩🇪 Künstliche Sprachausgabe uff Hessisch | Kostenlos und OHNE CLOUD !

16:01

🇩🇪 Künstliche Sprachausgabe uff Hessisch | Kostenlos und OHNE CLOUD !

Жыл бұрын

TEXT TO SPEECH | Piper TTS on Windows 🚀 AI voice 10x faster Realtime!

16:31

TEXT TO SPEECH | Piper TTS on Windows 🚀 AI voice 10x faster Realtime!

Жыл бұрын

XTTS FAQ | Interview with Josh Meyer from Coqui AI

41:50

XTTS FAQ | Interview with Josh Meyer from Coqui AI

Жыл бұрын

Python virtual environment / venv | Windows, Linux & Mac OS X

24:15

Python virtual environment / venv | Windows, Linux & Mac OS X

Жыл бұрын

Free voice recording for BEST voice cloning | Piper-Recording-Studio | Windows

18:10

Free voice recording for BEST voice cloning | Piper-Recording-Studio | Windows

Жыл бұрын

Is Mycroft Mark 2 the better Alexa?! | Private | Voice Assistant

17:26

Is Mycroft Mark 2 the better Alexa?! | Private | Voice Assistant

Жыл бұрын

Create your AI digital voice clone locally with Piper TTS | Tutorial

27:43

Create your AI digital voice clone locally with Piper TTS | Tutorial

Жыл бұрын

Increase Text to Speech pronunciation quality with eSpeak | Tutorial

13:40

Increase Text to Speech pronunciation quality with eSpeak | Tutorial

Жыл бұрын

Talk locally (no ChatGPT) with your documents 😄 | PrivateGPT + Whisper + Coqui TTS

11:56

Talk locally (no ChatGPT) with your documents 😄 | PrivateGPT + Whisper + Coqui TTS

Жыл бұрын

Raspberry Pi | Local TTS | High Quality | Faster Realtime with Piper TTS

5:03

Raspberry Pi | Local TTS | High Quality | Faster Realtime with Piper TTS

Жыл бұрын

Thorsten-Voice TTS in Windows nutzen | DDC / VITS

16:10

Thorsten-Voice TTS in Windows nutzen | DDC / VITS

Жыл бұрын

Thorsten-Voice TTS in Linux nutzen | DDC / VITS / Piper

13:20

Thorsten-Voice TTS in Linux nutzen | DDC / VITS / Piper

Жыл бұрын

Thorsten-Voice TTS in Mac OS X nutzen | DDC / VITS

13:07

Thorsten-Voice TTS in Mac OS X nutzen | DDC / VITS

Жыл бұрын

Пікірлер

@alx8439 10 сағат бұрын

Yeah would be interesting to see if it can be used with other software as well

@font_net 3 күн бұрын

Hello, I want to make a text-to-sound conversion model for Farsi, which videos should I watch now, where can I contact you?

@font_net 3 күн бұрын

Oh, I found your LinkedIn

@alx8439 3 күн бұрын

What is the device with microphone, button and led round light is on your table? Can you please give some more details?

@DichtMe 3 күн бұрын

That is what the video is about. Watch his previous videos.

@alx8439 Күн бұрын

@DichtMe thanks a lot. Will do

@alx8439 3 күн бұрын

New nvidia minicomputer (jetson if I'm not mistaken) which was released just recently is a good replacement for regular bulky PC with full sized GPU card.

@ianicius 5 күн бұрын

Can I use it for German?

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

Hello, IMHO currently not. But you can use my german Thorsten-Voice in Piper or Coqui 😉 (thorsten-voice.de/).

@rabeemohammed5351

@rabeemohammed5351 7 күн бұрын

what is language suport and can help me for give me inforation about modle support language arabic

@MuhammadShahid-bl5hh

@MuhammadShahid-bl5hh 7 күн бұрын

@ThorstenMueller I am facing this error please help me: PS C:\Users\Apple Compter\tts> pip install TTS--0.22.0 Defaulting to user installation because normal site-packages is not writeable ERROR: Could not find a version that satisfies the requirement TTS--0.22.0 (from versions: none) ERROR: No matching distribution found for TTS--0.22.0

@MuhammadShahid-bl5hh

@MuhammadShahid-bl5hh 7 күн бұрын

@Thorsten-Voice I am installing latest TTS but it not work for me please help...

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

Good question. Do you run it with admin privileges? Is free diskspace available? Just thinking because of this message "installation because normal site-packages is not writeable".

@herofahimshahriargaming8288

@herofahimshahriargaming8288 7 күн бұрын

is there any way to run this in python code?

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

That's an interesting question, i thougth about too. But last tine i looked at it, it was just an early codebase on python integration. According to this (github.com/rhasspy/piper/tree/master/src/python_run) there's no recent updates on that.

@jez9999 9 күн бұрын

Coqui appears to have folded now. Confusingly there is a community run fork that is sorted but its docs look very similar to the original.

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

Coqui already shut down by beginning of 2024 and imho the code in the original repo is not maintained any more. I heart about a fork too but didn't have time to give it a try.

@mrechbreger 21 сағат бұрын

@@ThorstenMueller the license is garbage and prevents any further interest... why should anyone keep developing it if he cannot use it for further commercial projects...

@font_net 11 күн бұрын

دوستت دارم

@adamrastrand9409

@adamrastrand9409 13 күн бұрын

Hello Torsten I have heard that some languages in Piper TTS sound pretty bad for example the Swedish model like that when you train a new voice like when you find tune from the existing checkpoint mall that exists it sounds quite bad and such is that true because the default Swedish NST voice sounds very monotone but when you find tune from that will it sound like me or will it sound different just with the pronunciation errors and when you find two from scratch How many hours of speech do you need I have an RTX 40 6016 GB card so is that good for AI training and the thing is also that do I need to set up Linux and Windows at the same time and fiddle around with complicated stuff because it’s just easier to have a Windows set up And not worry about Windows for Linux so can I just do it with a command

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

Hello, i only trained my german "Thorsten-Voice" tts piper voice. So i have no experience on other languages, their quality and need for training material. I used multiple hours (around 10 for finetuning my piper model), but i additionally played around with just 1000 phrases and these worked too. It's a little bit of a try'n error.

@mariuszandrzejewski655

@mariuszandrzejewski655 14 күн бұрын

This is exactly what I was looking for. Unfortunately, after I installed "tts" using "pip install tts --use-deprecated=legacy-resolver" and resolved some dependency issues, I encountered a "Bus error." Installing "tts" without additional flags causes it to take a long time because pip tries to resolve all required package versions. I tried everything, even attempting to install it with "pip install tts --no-deps" to install only the "tts" package. However, after running "tts --list_models," I still get a "Bus error." The version I have installed is 0.22.0. I am trying to install it in a Python virtual environment, of course.

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

Which python version are you using? As Coqui TTS isn't actively maintained this might be a problem with too new python versions.

@mariuszandrzejewski655

@mariuszandrzejewski655 2 күн бұрын

@@ThorstenMueller Thank you for your response. I’m using python version 3.9.2. Later, I’ll try changing the python version and also run tts from a Docker container, which I haven’t tried yet, but I’ll get back to that later. I managed to find free audiobooks for the books I’m interested in. Once again, thank you for your response, and greetings from a viewer in central Poland. P. S.: Wait a minute, aren’t you the guy from AutoIT? I used to work with it years ago, and I recall seeing your name in the libraries.

@EdTimTVLive 17 күн бұрын

It is a nice and useful video. Thank you. I am looking at various options right now.

@ThorstenMueller

@ThorstenMueller 16 күн бұрын

Thanks for your nice comment 😊.

@jimmyjam77 17 күн бұрын

The voice quality is OK, but not great. Did you ever figure out a way to make it better?

@ThorstenMueller

@ThorstenMueller 16 күн бұрын

No in xtts, but (just in case you're looking for an english solution) do you know my f5 tutorial? kzbin.info/www/bejne/d4SpoIeEpdCAbtEsi=gyYl6R8W1xuKoZZM

@rvanner 18 күн бұрын

What's the best TTS for use in an Apple and Android app locally (ie no server connecting)?

@ThorstenMueller

@ThorstenMueller 2 күн бұрын

That's a good question. Honestly i have not taken a closer look to tts on smartphones so i can't tell you (yet).

@CarlinComm 20 күн бұрын

Wow, that's great, thanks for showing this! Subscribed :)

@ThorstenMueller

@ThorstenMueller 16 күн бұрын

Thanks for your nice feedback and welcome 😊.

@charlenechen2507

@charlenechen2507 20 күн бұрын

Hello Thorsten, can you have a check and review of PopPop AI text to speech?

@ThorstenMueller

@ThorstenMueller 16 күн бұрын

Thanks for your topic suggestion 😊. I've added it to my todo list.

@TimothyBakerhistorygym

@TimothyBakerhistorygym 21 күн бұрын

tried getting this to work on my own, couldn't came here, watched this twice and it's up and running. Thank you @ThorstenMueller

@ThorstenMueller

@ThorstenMueller 20 күн бұрын

Thanks for your nice feedback, happy you got it working 😊.

@funcSAGE 22 күн бұрын

can piper switch between voices? previously i have used mimic3 server and requested texts with different specific voices, can piper do the same or is it limited to the voice you start server with? Nevermind, i've found where i can make a few adjustments to the script to pass in a speaker together with a text.

@ThorstenMueller

@ThorstenMueller 20 күн бұрын

AFIK is ssml (which you are looking for) not yet supported in piper. Mimic3 was able to do it. As the developer (Mike) is the same for both projects i am optimistic that ssml will come to piper somewhere in the future.

@maraka0100 24 күн бұрын

okay sehr schön. Im Piper adon funktioniert es. In einer Automation sagt eine weibliche Stimme: set public URL in configuration. Muss da noch was rein geschrieben werden in der Config.yaml?

@ThorstenMueller

@ThorstenMueller 20 күн бұрын

Ich bin gerade nicht sicher, ob man Piper TTS Stimmen in Automatisierungen verwenden kann. Gute Frage, aber müsste ich selber erstmal testen.

@BlackOtaku_Edits

@BlackOtaku_Edits 25 күн бұрын

As a Ghanaian i'm happy to hear TTS in Twi and Ewe and Hausa

@ThorstenMueller

@ThorstenMueller 24 күн бұрын

More effort for underrepresented languages is really important to provide open voice technology for everybody. Happy you found a matching tts voice 😊.

@mercuryin1 25 күн бұрын

I tried this morning and the cloned voices are the best I have never used. I wonder if I can use the cloned voices in some way with Home Assistant through I don´t know know..piper might be ? I can´t find if this is possible to do with this software, it is only tts ? is possible to synthesise a dataset with this ? Thanks

@ThorstenMueller

@ThorstenMueller 20 күн бұрын

AFIK you can use piper tts voices in Home Assistant. But for this you have to record way more audio data to train/finetune a piper tts model. Do you know my video about piper tts voice cloning? kzbin.info/www/bejne/mJDalpKgosZlaJI

@beneadie3202 25 күн бұрын

that's really really good quality for open source

@ThorstenMueller

@ThorstenMueller 25 күн бұрын

Absolutely 👍🏻😎

@yahyajohnlorenzen4713

@yahyajohnlorenzen4713 26 күн бұрын

I like the shirt

@ThorstenMueller

@ThorstenMueller 25 күн бұрын

Thank you, i like these type of shirts 😎.

@MuhammadShahid-bl5hh

@MuhammadShahid-bl5hh 7 күн бұрын

@@ThorstenMueller I am facing this error please help me: PS C:\Users\Apple Compter\tts> pip install TTS--0.22.0 Defaulting to user installation because normal site-packages is not writeable ERROR: Could not find a version that satisfies the requirement TTS--0.22.0 (from versions: none) ERROR: No matching distribution found for TTS--0.22.0

@Queztapotel123

@Queztapotel123 28 күн бұрын

Bär-er nicht Bier-er-Token. just fyi

@ThorstenMueller

@ThorstenMueller 27 күн бұрын

Thank you 😊. I try to keep it in mind for upcoming videos.

@kardiokode-g8v

@kardiokode-g8v 28 күн бұрын

hey @ThorstenMueller, great work as always! one thing that caught my eye: you mention that the code is released under MIT licence, which is right. But i think its also important to note that usually inference code and models have different licences (which you covered on other videos!). Here the model itself has a different licence: at 3:13 you can see it on top middle and in the text under it, that the model files are CC-BY-NC-4.0 licenced, which means no commercial use. This means you can not use generated voices for anything commercial like voice overs for youtube channels or in companies. It would be great to have this information as well in your videos, since people using this in any commercial environment or a simple monetized youtube channel can bring you in trouble if the owner enforces the licence. It would be great if you could make a video with an overview of fully open and free TTS/cloning models, that allow also commercial use.. i havent seen such a list anywhere and im sure lots of people would be interested.

@ThorstenMueller

@ThorstenMueller 27 күн бұрын

Thanks for the clarification 😊. I've seen another comment asking for usage as voiceover - did you reply to this? I added your hint to the video description and linked you - hope it's okay for you. I added your video topic suggestion to my list as i think it is a great idea 👍.

@SyamsQbattar Ай бұрын

Is online Huggingface better than local?

@ThorstenMueller

@ThorstenMueller Ай бұрын

The tts model is the same. It's just the question of your local available compute power. In my case huggingface has been more performant.

@smilingstranger100

@smilingstranger100 Ай бұрын

coqui tts is shutting down

@ThorstenMueller

@ThorstenMueller Ай бұрын

Yes, sadly 😞. They already shut down by beginning of the year 2024.

@mahmoedghazy1745

@mahmoedghazy1745 Ай бұрын

first time to see the channel and the video. was talking about this app with a friend of mine who told me about it yesterday while we were in lunch break. Now i'm interested LOL . thanks for the guide

@ThorstenMueller

@ThorstenMueller Ай бұрын

Hehe, you're welcome 😊. "Sorry" for making you interested 😉.

@JubayerAhmed-f5i

@JubayerAhmed-f5i Ай бұрын

can we use it for making KZbin videos and monetize it ? i mean is legal

@PatrickAngwin Ай бұрын

I'm no expert, but from what I understand, no, because although the f5 model itself is open source and available to use commercially, the license for the dataset on which it was trained is restricted and does not allow commercial use. I would love someone to tell me I'm wrong about this as I was getting really excited about f5 until I found this out...

@ThorstenMueller

@ThorstenMueller 28 күн бұрын

I can not give any legal advices. Here (huggingface.co/SWivid/F5-TTS) is written: "2024/10/14. We change the License of this ckpt repo to CC-BY-NC-4.0 following the used training set Emilia, which is an in-the-wild dataset. Sorry for any inconvenience this may cause. Our codebase remains under the MIT license." So i guess @PatrickAngwin seems right.

@vijisrangoli Ай бұрын

OMG, you are life saver for me!! Awesome!!

@ThorstenMueller

@ThorstenMueller Ай бұрын

Wow, thanks for your kind feedback 😊.

@-bret Ай бұрын

I tired this out on a rtx 3600 12gb model and it's fast. Quicker than speaking, maybe 2x faster to process than to listen to. Sounds really good to me.

@ThorstenMueller

@ThorstenMueller Ай бұрын

Thanks for your helpful comment and performance indicator on a 3600 👍🏻.

@-bret Ай бұрын

@ThorstenMueller I should have said it's paired with a 2700 ryzen. It's a pretty cheap rig now, I think you could buy both parts used for about 300 pounds on eBay. 30 pound cpu and 270 for the gpu. Or wait a year and pick up a 3090 24gb for same price, currently sitting around 500. I did pick up a tesla 24gb I forget model number, from China for 245 which is good for really large llm. Thank you for showing me this, I have project I can purposely upgrade now.

@dtesta 7 күн бұрын

Where can I buy a 3600? I've only have a 3060...

@-bret 7 күн бұрын

@@dtesta I'm sorry, I meant rtx 3060 16gb version

@dtesta 7 күн бұрын

@-bret Cool! Where can I find that 16GB version? I only have 12GB.

@Panda_explains-007

@Panda_explains-007 Ай бұрын

can we run piper tts in gpu using cuda ?

@ThorstenMueller

@ThorstenMueller Ай бұрын

Not tried it myself. According to piper community there seems to be some active discussions on gpu/cuda support. github.com/rhasspy/piper/issues?q=is%3Aissue+cuda

@adityapatil6723

@adityapatil6723 Ай бұрын

This error originates from a subprocess, and is likely not a problem with pip. error: subprocess-exited-with-error im getting this error please help someone

@ThorstenMueller

@ThorstenMueller 29 күн бұрын

Which python version are you using? Did you update "pip" first?

@adityapatil6723

@adityapatil6723 28 күн бұрын

@@ThorstenMueller yes sir i did update it. and python is 3.11.9

@ThorstenMueller

@ThorstenMueller 27 күн бұрын

@@adityapatil6723 I originally thought python 3.11 is not supported, but according their github readme 3.11 should work. But as coqui tts isn't under active development, maybe you should try python 3.10, if this is possible for you.

@christoph9620 Ай бұрын

Hello Thorsten, thanks for your great channel. I came about these videos which shows how one can train F5 with different languages kzbin.info/www/bejne/i4CXpqaXhNSdr9U kzbin.info/www/bejne/iIK7eX6Faqtsnsk As you are experienced with training of speech models, I am wondering how much hours material would be required to train a German language model in good quality and what things should be considered in regards to training data. In the referenced youtube video the creator simply takes audiobooks. Can one expect to get a good quality model in this way?

@ThorstenMueller

@ThorstenMueller 29 күн бұрын

Hello Christoph, thanks for your nice feedback on my channel 😊. Currently f5 tts can't be trained in german, but they are working on it. github.com/SWivid/F5-TTS/issues/87#issuecomment-2418043522 For my german "Thorsten-Voice" datasets i recorded over 30k audio files, but this should not be required now.

@ernieprevost6555

@ernieprevost6555 Ай бұрын

Hi Thorsten, thank you for another excellent tutorial. I have installed f5 on a Raspberry Pi 5 and it generates very good quality output but to be expected it is very slow. I am trying to understand how f5 works, does it take a standard model and modify it in some way using the ref_text & audio before generating the desired output (gen_text)? Is there an intermediate stage that could be executed separately? Thanks Ernie

@ThorstenMueller

@ThorstenMueller 29 күн бұрын

Thanks for your nice feedback 😊. As i can't answer your question you might want to ask this question on their github repo to get (useful) responses.

@jayeshkadam14 Ай бұрын

How much time it took to train one model??? And how powerfull is your system?

@ThorstenMueller

@ThorstenMueller Ай бұрын

For my Thorsten-Voice piper models i used an nvidia jetson agx device and training 24/7 took around 2 month.

@jayeshkadam14 16 күн бұрын

@@ThorstenMueller oh my god. thats 2 months. 2 months!!!!! wtf

@edu.33 Ай бұрын

If anybody is encountering errors when installing TTS, try pip install coqui-tts (tts is deprecated as of november 2024)

@FP_95 Ай бұрын

I went down this rabbithole myself last year. Alas! Have I watched this 6min video I would've saved lotsa disk space and time (weeks). Cheers! ✌

@ThorstenMueller

@ThorstenMueller Ай бұрын

Thank you for your nice feedback and welcome at the "rabbithole" 😄.

@stefankargl Ай бұрын

Hi, Thorsten, the community thrives because of people like you - thanks for your work!

@ThorstenMueller

@ThorstenMueller Ай бұрын

Thank you for your very kind words 🥰

@ikarosound2504

@ikarosound2504 Ай бұрын

thanks! it is faesabel to do all of that trought scripted pyton code?

@ThorstenMueller

@ThorstenMueller Ай бұрын

Good point 👍🏻. I took a quick but did not see an obvious solution for native python integration.

@andtrixr3284 Ай бұрын

the greetin comes from hessen in germany right? :D funny intro and exactly what i am searching for :) abo

@ThorstenMueller

@ThorstenMueller Ай бұрын

Ei sicher 😄. Greetings back from Hessen and thanks for joining my community 😊.

@RaminAssadollahi

@RaminAssadollahi Ай бұрын

What GPU do you have on your computer?

@ThorstenMueller

@ThorstenMueller Ай бұрын

An nvidia 1050 ti in this case.

@dtesta 7 күн бұрын

@@ThorstenMueller You need RTX card for this kind of thing. Anything else would be dogshit :)

@dontmindbeingblindd

@dontmindbeingblindd Ай бұрын

May I ask what gpu you are using, or if it is using a gpu?

@RaminAssadollahi

@RaminAssadollahi Ай бұрын

when you start gradio the fist time and the model is downloading, it shows that pytorch loading the models into CPU, i'll investigate on that

@RaminAssadollahi

@RaminAssadollahi Ай бұрын

correction: I'm running it on a 1080ti, it takes 16 sec for 4 sec of speech to synthesise. Don't know, whether it's always re-analysing the reference as well.

@RaminAssadollahi

@RaminAssadollahi Ай бұрын

okay, further investigation: i let the output text the same but uploaded a longer reference, it then also takes longer to synthesise. so, the whole time is comprising reference learning as well as synthesis. would be interesting to see how much time mere synthesis would take...

@ThorstenMueller

@ThorstenMueller Ай бұрын

If you use f5 on huggingface it will use a random gpu that is available in that momoment. If you use it locally without cuda (nvidia gpu) it will use cpu.

@hashtag_ Ай бұрын

For anyone coming recently, the tts repo isn't maintained anymore according to an issue post on the github. It results in an error when running 'pip install tts'. This fork worked for me instead: 'pip install coqui-tts'

@ThorstenMueller

@ThorstenMueller Ай бұрын

Thanks for that fork hint 👍🏻. Maybe an issue with a (too new) python version.

@elexg6982 Ай бұрын

is there a way to use this with nvidia gpu on windows to speed up performance?

@ThorstenMueller

@ThorstenMueller Ай бұрын

Didn't try it myself, but there's some discussion on CUDA (nvidia gpu) on their repository. Maybe you can find additional info there 😊. github.com/rhasspy/piper/issues?q=is%3Aissue+cuda

@mogbattlesapp Ай бұрын

can this be deployed and hosted on a server?

@ThorstenMueller

@ThorstenMueller Ай бұрын

Yes, absolutely 😊.

@SimpleTechAI Ай бұрын

I tried it and it works but it did not sound like me. Nothing close to what you did. Not a fan at this time it really should have done better. Thanks for sharing you got my thumbs up...

@ThorstenMueller

@ThorstenMueller Ай бұрын

Thanks for your "thumb up" and sorry to hear it didn't work for you as expected.

@SimpleTechAI Ай бұрын

@ThorstenMueller not your fault, you laid it out perfectly. Its probably the quality of my samples. Thanks again

@FrankGraffagnino

@FrankGraffagnino Ай бұрын

great stuff!

@ei23de Ай бұрын

Haha the F5 joke😂. The progress is amazing, right? Still waiting for german support for F5... Anyway in english it is now already easy to create synthetic voice datasets for piper for example, just an idea😊

@ThorstenMueller

@ThorstenMueller Ай бұрын

H(ei) 👋, thanks for your nice comment 😊 and yes, progress is really impressive.