This guy has single handedly allowed me to work on projects and progress my life and hobbies in a way I NEVER would have been able to do without. Thank you sir!
@ZaraTanaka5 ай бұрын
great to know. but what is your hobby?, what do you do?
@Mowgi6 ай бұрын
Love your work, Jarod ❤
@Public-Freakout6 ай бұрын
Is there a way to download pretrained tortois tts models so that we can just plug them in ?
@Public-Freakout6 ай бұрын
Like a place where to download them ?
@BryanFaustino-hy8om5 ай бұрын
Hiya! And thank you for your wonderful work! Just wanted to say, I upgraded to this version and noticed an odd bug where the generated json will remove all periods except those next to a quotation mark. This causes about half the generated audio to hang at the end of a sentence or mispronounce the final word and wasn't an issue in the previous version. The solution so far is to just manually add back in the missing periods if the audio is bad and regenerate.
@Jarods_Journey5 ай бұрын
Odd! I'll take note of this!
@Mowgi6 ай бұрын
I've been having a lot of fun with udio lately. Tried it? I've put in some old unfinished music of mine and it's really surprised me with ways to move the song forward.
@daryladhityahenry6 ай бұрын
Hi Jarod. I want to ask something. Why you choose to still use TortoiseTTS + RVC instead of StyleTTS2? From performance and quality wise, isn't StyleTTS2 better? Or is there anything else you consider? Thankss
@aa-xn5hc4 ай бұрын
Why not combining to M4B ? thank you very much for the update!
@KolikasoMarianos2 ай бұрын
It doesn't work for me. I get an error that there is no file "rvc.json" in the configuration folder.
@Melike-oh1ir4 ай бұрын
The paus doesn't seem to work, everything else is fine. Any clue as to why?
@mauricio95815 ай бұрын
How do you create an audiobook with multiple different voices?
@TheFlexxoo5 ай бұрын
@Jarods_Journey Have you considered working on the code to enable it to run in the background or multithreaded? That way, it won't freeze the program when you regenerate the audio.
@Jarods_Journey5 ай бұрын
I left that in there as you technically shouldn't be doing anything while it's generating audio, it's similar to how I grey out buttons when the main generation block is running. Never went back to make it unblocking
@TheFlexxoo5 ай бұрын
@Jarods_Journey is never a good practice to have a gui freeze like that, it will be better disable the rest of the buttons or add a please wait message box, by the way good work in all you do, just an opinion, thanks.
@duahauvn6 ай бұрын
How to create subtitle for converted audio as .srt file please? Is there any ways to convert audio + subtitle files? Thank you.
@rickyparker29436 ай бұрын
What do I do with this error? UnicodeDecodeError: 'utf-8' codec can't decode byte 0x93 in position 69: invalid start byte
@elliotnewton49376 ай бұрын
Thanks for this Jarod :). Will there ever be a working model of your tortoise build set up in Google Colab? I really like the way it runs, but I simply don't have the processing power. Many thanks for everything you do.
@yoru25306 ай бұрын
It work but a bit hard on my machine. I wonder if there's a way to run my own audio through SVC and record it. Like playing an audio and run it through svc like you do with mic and record it.
@MiniFireball5 ай бұрын
What files do I need to move around to use a voice I trained on the web guide for the audio books?
@09jake125 ай бұрын
how do I make tort stick with one voice instead of doing it randomly? I don't want to make my own voice.
@tigros9992 ай бұрын
will this work with 3080 10 GB? i saw an opened OOM issue on github.
@NFawc4 ай бұрын
RVC makes things sound worse?? Just having a play with this, and the voice pitches are raised/messed around with from the source version from Tortoise? How can you stop the output from being manipulated by Audiobook from what Tortoise generates? ie: The WAV file in the Tortoise results folder sounds fine. But in the Audiobooks output folder, the WAV has been manipulated and sounds worse :( Is it RVC messing it up?
@NFawc4 ай бұрын
Yes! The RVC manipulation of the WAV files makes them sound worse. I've stopped it from working by overwriting the RVC output with the original file and it sounds much better:- orig_audio_path = audio_path audio_path = rvc_convert(model_path=voice_model_path, f0_up_key=f0_pitch, resample_sr=0, file_index=voice_index_path, index_rate=index_rate, input_path=audio_path) shutil.copy2(orig_audio_path, audio_path) # LOSE RVC
@Paul_Wasson6 ай бұрын
The hero we don't deserve!
@nggaknyangka76886 ай бұрын
hello brother, I always get info can't access localhost :7860 (HTTP 1.1 404 Not Found), when activating start_package so I can't access data in the audio book maker.
@gganyuu4 ай бұрын
same
@zonas79156 ай бұрын
Hello Jarod, did you take a look at seed tts?
@Jarods_Journey6 ай бұрын
Yes, it's really really good. However, ain't no way bytedance is releasing their models 😂
@dougmaisner6 ай бұрын
crazy good update!
@keisaboru11556 ай бұрын
wait is this compatible with rvc ?
@Jarods_Journey6 ай бұрын
Yes, it uses RVC models to convert the TTS output from tortoise.
@dolboeb-tz4bw6 ай бұрын
Please review IMS Toucan
@heckyes6 ай бұрын
Windows only?
@titolindj6 ай бұрын
Will it work with other languages?
@Artholos6 ай бұрын
Absolute hero 🎉
@Jarods_Journey6 ай бұрын
🙏🙏
@wnrandom986 ай бұрын
this is awesome! thank you
@giovannif25676 ай бұрын
So useful! Thank you ❤🎉
@vidneypopples5 ай бұрын
I'm getting Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (1/3) Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (2/3) Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (3/3) When trying to select the text file & then start audiobook generation
@gganyuu4 ай бұрын
same
@omarraafat7841Ай бұрын
Hi. Nice work! How can I use Arabic language with this audiobook maker?
@tylerchambliss83795 ай бұрын
Is there a way you can improve the UI to be accessible with screen readers for the blind? The web based gradio interfaces are great but this is almost unusable. As of now if I wanted to make a book I'd have to manually record each sentence from Tortoise in my digital audio workstation sentence by sentence because Tortoise has voice glitches you are all too aware of when using fine tuned models. I hope Style TTS 2 will finally be able to replace Tortoise and sound just as good. Obviously I'd rather something local be as good as Chat GPT4O's voice or 11 labs but I know that's going to be a few years down the line. You know Tortoise would be really great if it wasn't susceptible to the voice glitches.
@tempertephra6 ай бұрын
well done!
@Officialtamilreview6 ай бұрын
levenlabs altrenative create sir please
@donmarshal20706 ай бұрын
Can you integrate pronounciation correction in this (like Balabolka).🧐 If this frontend utility become utilisable like balabolka, [per word realtime pronounciation check, IPA pronounciation/editing, assign shortforms for sentence (AI for Artificial Intelligence)] and inbuilt text editor with pronounciation dictionary. Man, No kidding, I'll become member for lifetime 🙏🏻