Updated AI Audiobook Maker Installation and Bug Fixes

Рет қаралды 4,586

Күн бұрын

Пікірлер: 53

@Dreamcatcher659 5 ай бұрын

This guy has single handedly allowed me to work on projects and progress my life and hobbies in a way I NEVER would have been able to do without. Thank you sir!

@ZaraTanaka 5 ай бұрын

great to know. but what is your hobby?, what do you do?

@Mowgi 6 ай бұрын

Love your work, Jarod ❤

@Public-Freakout 6 ай бұрын

Is there a way to download pretrained tortois tts models so that we can just plug them in ?

@Public-Freakout 6 ай бұрын

Like a place where to download them ?

@BryanFaustino-hy8om 5 ай бұрын

Hiya! And thank you for your wonderful work! Just wanted to say, I upgraded to this version and noticed an odd bug where the generated json will remove all periods except those next to a quotation mark. This causes about half the generated audio to hang at the end of a sentence or mispronounce the final word and wasn't an issue in the previous version. The solution so far is to just manually add back in the missing periods if the audio is bad and regenerate.

@Jarods_Journey 5 ай бұрын

Odd! I'll take note of this!

@Mowgi 6 ай бұрын

I've been having a lot of fun with udio lately. Tried it? I've put in some old unfinished music of mine and it's really surprised me with ways to move the song forward.

@daryladhityahenry 6 ай бұрын

Hi Jarod. I want to ask something. Why you choose to still use TortoiseTTS + RVC instead of StyleTTS2? From performance and quality wise, isn't StyleTTS2 better? Or is there anything else you consider? Thankss

@aa-xn5hc 4 ай бұрын

Why not combining to M4B ? thank you very much for the update!

@KolikasoMarianos 2 ай бұрын

It doesn't work for me. I get an error that there is no file "rvc.json" in the configuration folder.

@Melike-oh1ir 4 ай бұрын

The paus doesn't seem to work, everything else is fine. Any clue as to why?

@mauricio9581 5 ай бұрын

How do you create an audiobook with multiple different voices?

@TheFlexxoo 5 ай бұрын

@Jarods_Journey Have you considered working on the code to enable it to run in the background or multithreaded? That way, it won't freeze the program when you regenerate the audio.

@Jarods_Journey 5 ай бұрын

I left that in there as you technically shouldn't be doing anything while it's generating audio, it's similar to how I grey out buttons when the main generation block is running. Never went back to make it unblocking

@TheFlexxoo 5 ай бұрын

@Jarods_Journey is never a good practice to have a gui freeze like that, it will be better disable the rest of the buttons or add a please wait message box, by the way good work in all you do, just an opinion, thanks.

@duahauvn 6 ай бұрын

How to create subtitle for converted audio as .srt file please? Is there any ways to convert audio + subtitle files? Thank you.

@rickyparker2943 6 ай бұрын

What do I do with this error? UnicodeDecodeError: 'utf-8' codec can't decode byte 0x93 in position 69: invalid start byte

@elliotnewton4937 6 ай бұрын

Thanks for this Jarod :). Will there ever be a working model of your tortoise build set up in Google Colab? I really like the way it runs, but I simply don't have the processing power. Many thanks for everything you do.

@yoru2530 6 ай бұрын

It work but a bit hard on my machine. I wonder if there's a way to run my own audio through SVC and record it. Like playing an audio and run it through svc like you do with mic and record it.

@MiniFireball 5 ай бұрын

What files do I need to move around to use a voice I trained on the web guide for the audio books?

@09jake12 5 ай бұрын

how do I make tort stick with one voice instead of doing it randomly? I don't want to make my own voice.

@tigros999 2 ай бұрын

will this work with 3080 10 GB? i saw an opened OOM issue on github.

@NFawc 4 ай бұрын

RVC makes things sound worse?? Just having a play with this, and the voice pitches are raised/messed around with from the source version from Tortoise? How can you stop the output from being manipulated by Audiobook from what Tortoise generates? ie: The WAV file in the Tortoise results folder sounds fine. But in the Audiobooks output folder, the WAV has been manipulated and sounds worse :( Is it RVC messing it up?

@NFawc 4 ай бұрын

Yes! The RVC manipulation of the WAV files makes them sound worse. I've stopped it from working by overwriting the RVC output with the original file and it sounds much better:- orig_audio_path = audio_path audio_path = rvc_convert(model_path=voice_model_path, f0_up_key=f0_pitch, resample_sr=0, file_index=voice_index_path, index_rate=index_rate, input_path=audio_path) shutil.copy2(orig_audio_path, audio_path) # LOSE RVC

@Paul_Wasson 6 ай бұрын

The hero we don't deserve!

@nggaknyangka7688 6 ай бұрын

hello brother, I always get info can't access localhost :7860 (HTTP 1.1 404 Not Found), when activating start_package so I can't access data in the audio book maker.

@gganyuu 4 ай бұрын

same

@zonas7915 6 ай бұрын

Hello Jarod, did you take a look at seed tts?

@Jarods_Journey 6 ай бұрын

Yes, it's really really good. However, ain't no way bytedance is releasing their models 😂

@dougmaisner 6 ай бұрын

crazy good update!

@keisaboru1155 6 ай бұрын

wait is this compatible with rvc ?

@Jarods_Journey 6 ай бұрын

Yes, it uses RVC models to convert the TTS output from tortoise.

@dolboeb-tz4bw 6 ай бұрын

Please review IMS Toucan

@heckyes 6 ай бұрын

Windows only?

@titolindj 6 ай бұрын

Will it work with other languages?

@Artholos 6 ай бұрын

Absolute hero 🎉

@Jarods_Journey 6 ай бұрын

🙏🙏

@wnrandom98 6 ай бұрын

this is awesome! thank you

@giovannif2567 6 ай бұрын

So useful! Thank you ❤🎉

@vidneypopples 5 ай бұрын

I'm getting Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (1/3) Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (2/3) Error: [WinError 10061] No connection could be made because the target machine actively refused it, retrying... (3/3) When trying to select the text file & then start audiobook generation

@gganyuu 4 ай бұрын

same

@omarraafat7841 Ай бұрын

Hi. Nice work! How can I use Arabic language with this audiobook maker?

@tylerchambliss8379 5 ай бұрын

Is there a way you can improve the UI to be accessible with screen readers for the blind? The web based gradio interfaces are great but this is almost unusable. As of now if I wanted to make a book I'd have to manually record each sentence from Tortoise in my digital audio workstation sentence by sentence because Tortoise has voice glitches you are all too aware of when using fine tuned models. I hope Style TTS 2 will finally be able to replace Tortoise and sound just as good. Obviously I'd rather something local be as good as Chat GPT4O's voice or 11 labs but I know that's going to be a few years down the line. You know Tortoise would be really great if it wasn't susceptible to the voice glitches.

@tempertephra 6 ай бұрын

well done!

@Officialtamilreview 6 ай бұрын

levenlabs altrenative create sir please

@donmarshal2070 6 ай бұрын

Can you integrate pronounciation correction in this (like Balabolka).🧐 If this frontend utility become utilisable like balabolka, [per word realtime pronounciation check, IPA pronounciation/editing, assign shortforms for sentence (AI for Artificial Intelligence)] and inbuilt text editor with pronounciation dictionary. Man, No kidding, I'll become member for lifetime 🙏🏻