First off, great video. I do not know anything about coding and I tried to follow along but I have no idea what I am doing. All I want to do is take a text file and use Watson TTS to create a mp3 file. I realize that this is a video showing how to do just that but I just can't do it. I followed along using the Jupyter webpage but I couldn't get it to work. I just wish I could copy and paste the text into a box and press convert file.
@NicholasRenotte3 жыл бұрын
Heya @Alan, I think there used to be a standalone demo that allowed you to download the voice, there is this: www.ibm.com/demos/live/tts-demo/self-service but you're unable to download the speech as a file!
@PaulSchneider-bp2ic2 жыл бұрын
Did you ever get it to work? I don't know if this will help, but an acquaintance of mine told me not to copy and paste any code which includes quotation marks therein because the pasting thereof fails to guarantee that the quotation marks will be identical to those generated when one presses the quotation mark key on one's keyboard. They look like they are identical, but they are not. Therefore they are not recognized.
@ArshaansEdits3 жыл бұрын
It gives me the following error: [Errno 11001] getaddrinfo failed on_close() takes 1 positional argument but 3 were given.
@NicholasRenotte3 жыл бұрын
Got a full error for me?
@ArshaansEdits3 жыл бұрын
@@NicholasRenotte This is the only thing it gives me everytime I run transcribe.py. :(
@tobys469810 ай бұрын
You have to register with IBM to use Watson. To register you have to give them your credit card details, which I am not prepared to do.
@saalemrafiq26612 жыл бұрын
hey Nic, thaks for the video plz let me know if there's anyway that we can play the audio file directly Instead of saving it locally and then playing
@NicholasRenotte2 жыл бұрын
Take a look at pydub!
@saalemrafiq26612 жыл бұрын
@@NicholasRenotte ok thnx One last question Is there a way to do all these by just using javascript ?
@incrementis28602 жыл бұрын
Thank you for the video. Any chance to make the speaker sound less robotic?
@tiktacktoes3 жыл бұрын
Hi! First i just want to say thankyou this video is helping my homework :). But I found an error while saving the mp3 file to my computer. The message say "Error: Provided API key could not be found, Code: 400" even when i use the same code as yours. Can u help me with this? Thx a lot btw :)
@NicholasRenotte3 жыл бұрын
Sweet! Oh, did you generate your own apikey @woolfing paca? The one in the video would have been deleted that's why it's probably throwing errors.
@BettersizeInstruments3 жыл бұрын
Great video. How can I resolve the error while executing with open(‘ ‘) statement? ApiException: Error
@NicholasRenotte3 жыл бұрын
What's the full error?
@Just-Relax. Жыл бұрын
Great tutorial, but how can you change where the file goes? Right now its saving to my desktop, but I want it to save to another folder. How can I do that?
@michpo144510 ай бұрын
You skipped the whole part about the watson account creation. ITs supposed to be free, but they're asking for my credit card number. Is this right?
@enesturksal75289 ай бұрын
btw also it does not accept any cart
@pravenkumar57397 ай бұрын
Yeah bro it's not free
@farahfekih68023 жыл бұрын
Thank you so much that was useful and super simple . Keep it up !!!
@AdinanBrito3 жыл бұрын
Thank you so much for this video! Is there a way to get the audio file link instead of the audio itself?
@NicholasRenotte3 жыл бұрын
Oooh, I don't believe so, the data isn't stored in the service. Could throw it up into dropbox or something and grab that link?
@josephpark6264 жыл бұрын
I can't find the folder that supposed to have the mp3 files. I followed your instructions on jupyter. Maybe it is because I am using windows or maybe I need to download jupyter, not just use the website version?
@NicholasRenotte4 жыл бұрын
Heya @Joseph Park, it should be in the same folder that your Jupyter Notebook is in 😊. Easiest way to find the folder is to: 1. Go back into the command line you started the notebook from 2. Stop the notebook 3. Type the command "start . " This should open up the folder that your notebook is executing from.
@josephpark6264 жыл бұрын
@@NicholasRenotte thank you
@NicholasRenotte4 жыл бұрын
Anytime @Joseph Park!
@evolve1013 жыл бұрын
Seems awesome! Is it open source? Can people use it on youtube videos commercially or while making money with ads? If so.. Much respect if IBM keeps stuff like that free for users while keeping it open source. It's always the open source ones that gets to cutting edge brilliance in technology. =) It's the best TTS i have ever heard..
@NicholasRenotte3 жыл бұрын
Heya @evolve, the models are consumed via API but there's a super flexible free tier that allows it to be used for a whole bunch of experimentation!
@NicholasRenotte3 жыл бұрын
@@evolve101 yah!
@LpARTURO3 жыл бұрын
LOVE IT! hello nicholas, im new to NLP and ML/AI and just started learning it , im about to work on project of a movil app translator of text to sign language, is any api out there than can help me speed up the process of the project? or any advice on how to approach it? with the little i know i was planing on a sort of clasification model from text to img, but i think it wont be the best thing to do if i want to add also the option to convert the img(sign language image) to text to make the translator more flexible. keep the good work, this is rlly a great content!
@NicholasRenotte3 жыл бұрын
Heya @Arturo 😀, Text to Sign might not require AI/ML. You could just parse the different parts of speech and render video! Are you trying to go the other way as well? Sign to text?
@LpARTURO3 жыл бұрын
@@NicholasRenotte yeah!, my goal is to do an sign language movil app, i would like to approach both, the non-sign language speakers understanding what a sign language user wants to comunicate and viceversa, but im just starting to get use to this 😅and seen many ways i could posibbly approach it, but your channel so far is the greatest source material i've found for this particular topic, im reconsidering the aproach of the app, real time sign detec. + speech to text and then try to get every word to it's equivalent in sign language to cover both way of comunication, all based on topics i've found on your channel, again thnx! your channel is a gold mine.😊
@NicholasRenotte3 жыл бұрын
Awesome stuff @@LpARTURO , let me know how you go! Sounds like there's some architecture stuff to model first, but let me know if you need any pointers!
@MonkeyDLuffy-cq2lo4 жыл бұрын
Thank you sooo much. I have a question, is there any method to integrate IBM TTS to the apple mac speech. I want to call the TTS to read the selected screen text, it is possible?
@NicholasRenotte4 жыл бұрын
Ooooh, I think Apple's models are proprietary. I haven't seen that done yet @da da.
@MonkeyDLuffy-cq2lo4 жыл бұрын
@@NicholasRenotte There is a Firefox/Chrome Extension called read aloud. It allows inputing the IBM Watson credential to use IMB TTS. But it can only read the webpage.
@NicholasRenotte4 жыл бұрын
@@MonkeyDLuffy-cq2lo woah, I've never seen that before but it's super cool. Will take a look!
@Van0883 жыл бұрын
theres an error when I play the mp3, "This file isn't playable. That might be because the file type is unsupported, the file extension is incorrect, or the file is corrupt." i tried installing other media player but still error.
@NicholasRenotte3 жыл бұрын
Hmmm, tried playing it on another device?
@asherageh53303 жыл бұрын
Really like the video. Very clear and easy to understand. But is there a way to convert the text to speech live? So when I give the program something to say it immediately says it without converting to an mp3 first?
@NicholasRenotte3 жыл бұрын
I haven't seen this for TTS but have seen it for STT: github.com/IBM/watson-streaming-stt
@asherageh53303 жыл бұрын
@@NicholasRenotte Thank you for this. I notice that you have a video on speech to text. could I follow that video but use my microphone as the source?
@NicholasRenotte3 жыл бұрын
@@asherageh5330 hmmm, without using sockets you would have to save the audio from your mic first then convert. I might do a streaming demo as a live stream this week or next if you wanted a tutorial. Thoughts?
@stateofbeing88643 жыл бұрын
Sir,I could find the speech.mp3 file and its showing that the size if of 0B.and I'm unable to play that also..Could you help me out?
@NicholasRenotte3 жыл бұрын
Heya @Aruthra, do you get any output from the API when you run the command?
@SantoshLLC3 жыл бұрын
Hey thank you for this video! I wondered if there is a way to directly play the voice without creating an mp3 file?
@NicholasRenotte3 жыл бұрын
There's some audio players available for Python, could take a look at those!
@SantoshLLC3 жыл бұрын
@@NicholasRenotte thanks tried playsound library but didn’t work. Anyway it’s fine, keep going with this incredible content, you are incredible!
@tianqionggao95784 жыл бұрын
OMG, super details ! Thank you !
@NicholasRenotte4 жыл бұрын
👋 glad you enjoyed it @amy gao!
@sarrasalhi23603 жыл бұрын
hey ,haw can i convert the sign language detection in your previous video to speech with same model mobilenet ?Is that possible
@NicholasRenotte3 жыл бұрын
Heya @Sarra, once you've extracted the right you can pass the text to the TTS service and play the audio using something like PyAudio!
@alvarosaez95113 жыл бұрын
Hi Nic, it was a nice tutorial. I've just tried the code and found this problem: "It is required that you pass in a value for the "algorithms" argument when calling decode()".
@NicholasRenotte3 жыл бұрын
Heya @Alvaro, credit goes to one of the other subscribers who figured out this was an issue with PyJWT, try installing 1.7.1 and it should fix the issue. Example install: pip install PyJWT==1.7.1
@rutwikudgata81693 жыл бұрын
Can we use it for youtube vdos? I mean will my channel be monetized if I use this TTS? The new guidelines is against use of TTS I guess! Please help me out.
@NicholasRenotte3 жыл бұрын
Oooo, I'm not too sure about that @Rutwik. I'm not super familiar with the YT guidelines on TTS but everything I've read makes it sound like it'll be tough to get monetised if using TTS.
@andylacron64703 жыл бұрын
This project is stunning, i would like to do one similar to one to this. What can i do in the case that the language i want as a voice output isent included in the list of languages and voices for example Luganda or Lugisu . How can i go about this!! Thank you.
@NicholasRenotte3 жыл бұрын
Hiya @andy, you could try adapting one of the base languages and fine tuning for those languages!
@andylacron64703 жыл бұрын
@@NicholasRenotte Thank you for the reply but what i asking was what can i do get that fine tuning moving. Is there platform or a book any guideline on how i can go about it because am new to nlp and not very sure of what to do to make that idea work. Thanks.
@NicholasRenotte3 жыл бұрын
@@andylacron6470 check this out: cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-customization
@shuvrajyotibose90022 жыл бұрын
we want to build only tts using gaze tracking ... identification of hard to read words from a passage using gaze tracking and it will ( sytem ) will dictate the word to assist reader .
@AlexandertheGlande3 жыл бұрын
Hello friend, I am Brazilian, I would like to know if it is possible to convert the voice from another language to PT-BR.
@NicholasRenotte3 жыл бұрын
Could convert the text from PT-BR first then run it through the TTS pipeline.
@gutandrew3 жыл бұрын
after typing !pip install imb_watson I got ERROR: Could not find a version that satisfies the requirement imb_watson (from versions: none)
@NicholasRenotte3 жыл бұрын
Try !pip install ibm_watson
@santosh22naik3 жыл бұрын
thanks nick. this video been a great help. when I m trying do TTS in Spanish, Spanish text is not being identified correctly. As a result incorrect Spanish audio is getting generated
@NicholasRenotte3 жыл бұрын
Hmmm, what's the audio quality like?
@Pyxhel3 жыл бұрын
Hi, how to "link" Jupyter and our computer (to have localhost in the adress bar instead of Jupyter's url) ? Thanks
@NicholasRenotte3 жыл бұрын
Heya @Cochon, normally you can use localhost:8888 or the link available from the command line. Alternatively you can delve a little deeper with something like this: jupyter-notebook.readthedocs.io/en/stable/public_server.html
@AceStJohn4 жыл бұрын
Im getting an error message "ModuleNotFoundError Traceback (most recent call last) in 1 from ibm_watson import TextToSpeechV1 ----> 2 from IBM_cloud_sdk_core.authenticators import IAMAuthenticator ModuleNotFoundError: No module named 'IBM_cloud_sdk_core' " Can you help me with this?
@NicholasRenotte4 жыл бұрын
Definitely @Ace St. John! That error is because the cloud sdk library is not installed. Just run this line of code in your notebook in a new cell to install the cloud SDK. !pip install ibm-cloud-sdk-core
@AceStJohn4 жыл бұрын
@@NicholasRenotte Awesome! thank you! Last question... What if I have a LARGE text file that I want to convert to MP3... is there a size limit that you know of?
@NicholasRenotte4 жыл бұрын
No problemo @@AceStJohn , there's an 8KB limit for a single synthesis (GET) request. If you hit that limit you can chunk it up and combine them back together using ffmpeg. This video shows how to do it with STT but it would be a similar process with TTS: kzbin.info/www/bejne/kIbWknyFia-pg5Y
@MostHolyPlace24 жыл бұрын
I just completed this tutorial and it worked perfectly. This was a big help for me.
@NicholasRenotte4 жыл бұрын
Hell yes! Awesome to hear @Mike!
@johanvandemerwe90503 жыл бұрын
That was a very useful tutorial. Thank you.
@NicholasRenotte3 жыл бұрын
✌thanks so much @Johan!
@gotavideo3 жыл бұрын
I still have a difficult time doing this. I can't understand how you get 2 in "In [ ]" and so on. I thought you suppose to run all the cells, so that what I find so confusing.
@NicholasRenotte3 жыл бұрын
Heya @Jonathan, this is standard jupyter notebook functionality, check this out for a crash course: realpython.com/jupyter-notebook-introduction/
@Ricocase2 жыл бұрын
Can AI interpret numerical trends then convert it speech? E.g. video game, sports commentary at halftime.
@naturaudaxds2 жыл бұрын
Hello, thanks for the video, it was excellent! One question, how can I add this to an IBM Watson Chat bot? How can I make the chat bot speak to the user? it was the best video!! Greetings from Peru!!
@meirgoldenberg56383 жыл бұрын
Thank you! It would be great to add something on using SSML.
@NicholasRenotte3 жыл бұрын
Coolio, will add it to the list of videos @Meir!
@BettersizeInstruments3 жыл бұрын
Thank you @Nicholas Renotte for this amazing video. I'm new at this, I have the same request. Please please give a demo on how to add SSML in the audio file you named churchill. Much love
@NicholasRenotte3 жыл бұрын
@@BettersizeInstruments been wanting to get to this for a while now, will bump it up!
@PaulSchneider-bp2ic2 жыл бұрын
What programming tools could produce the following result? 1.) Play the musical note which was assigned by me to each character of a text file, as it is read aloud by an appropriately coded program. 2.) Or, as each individual key on a QWERTY keyboard is pressed, cause its assigned musical note to play. As a (partial) example consider: The string "B V G C T S L" , etc., Letter ..........Actual Musical Tone generated B ........................ C4 V ........................ D4 G ........................ E4 C ........................ F4 T ........................ G4 S ........................ A5 L ........................ B5 etc.,
@TheDemolitionmech3 жыл бұрын
Hi Nick, thank you for the helpful video. What if I wanted to make each line a separate audio file? (dividing up the paragraph)
@NicholasRenotte3 жыл бұрын
Would just do each line as a separate transcription, i.e. just run one line at a time.
@gilsmadi36974 жыл бұрын
Thanks for the engaging video. Could you please help me with the error message 2:47 zsh:1: command not found: pip ?
@NicholasRenotte4 жыл бұрын
Heya @GILS & MADI, didn't see this comment til today! Just checking do you have Python installed? pip normally comes prepackaged with the later versions on Python.
@deepakchand30923 жыл бұрын
Great video. How can I resolve DecodeError that I get after executing with open(‘ ‘) statement? DecodeError: It is required that you pass in a value for the “algorithms” argument when calling decode().
@NicholasRenotte3 жыл бұрын
Heya @Deepak, can you share your sample code (excluding API keys) using codeshare.io? Also, was there a larger error?
@deepakchand30923 жыл бұрын
Hi @@NicholasRenotte ,as an absolute novice, from what I read, think it’s due to recent upgrades.Here is the url for share code codeshare.io/al6Yoj
@NicholasRenotte3 жыл бұрын
@@deepakchand3092 one of the other subscribers figured out this was an issue with PyJWT, try installing 1.7.1 and it should fix the issue. Example install: pip install PyJWT==1.7.1
@deepakchand30923 жыл бұрын
@@NicholasRenotte Thank you Nick, and happy 2021 :)
@deepakchand30923 жыл бұрын
Hi Nick, since you have so much experience and insight (at least in Australian context), which specialisation/jobs in computer science you reckon would allow remote working/working from home. Thank you in advance.
@sindugokulapati98343 жыл бұрын
could we take in microphone input rather than audio file
@NicholasRenotte3 жыл бұрын
Check this out: kzbin.info/www/bejne/j3Tcpo2DaZpnlas
@KnowFunOfficial3 жыл бұрын
Wow, this was super helpful. Any idea on how to circumvent the max characters that are allowed to go to IBM Watson ? Right now your solution works if it is a small amount of text for the file, but if the file is large than a certain amount of characters, you get an error when using your method.
@NicholasRenotte3 жыл бұрын
Chunk it up into multiple requests, then concatenate the audio back together.
@kelvinfm37553 жыл бұрын
Great job! Hi Nic, following this video, I converted a text file with two sentences into mp3 file successful. I want to have a pause (1 second or 1.5 seconds) between two sentences, how to do it? do you have another video or sample for doing so? Many thanks.
@NicholasRenotte3 жыл бұрын
Heya @Kelvin, the easiest way to do it is with commas or spaces. TTS interprets these as pauses!
@Wisdomprojectofficial3 жыл бұрын
how to change the pitch of the speaker ??
@NicholasRenotte3 жыл бұрын
Check this out: medium.com/ibm-data-ai/tune-by-example-how-to-tune-watson-text-to-speech-for-better-intonations-bcee8404d927
@Wisdomprojectofficial3 жыл бұрын
@@NicholasRenotte Thankyou so much .
@eternalsun823 жыл бұрын
hello nicholas I need a Turkish speaking voice, how can I find it?
@NicholasRenotte3 жыл бұрын
Heya @Ilker, doesn't look likt there's a Turkish voice available unfortunately.
@linuxtechlab23043 жыл бұрын
hi nicholas, your code works fine, it's great & exactly what i was looking for. NOW I HAVE A QUESTION, HOW TO REDUCE THE SPEECH SPEED BY 20%. CAN YOU SHARE CODE FOR THAT AS WELL. I WILL BE MOST THANKFUL.
@NicholasRenotte3 жыл бұрын
Heya! You can add , or to break up the text and push it out slower! Check this out: cloud.ibm.com/docs/text-to-speech?topic=text-to-speech-timing
@zenmony-dot-com74572 жыл бұрын
@@NicholasRenotte Thanks for the quality tutorial! Is there a way to add the rate SSML property inside python code? Decrease speaking rate by 25%
@itsmemaldy70713 жыл бұрын
Can i use a my Voice and not the predefined voices that i see in the video ?
@NicholasRenotte3 жыл бұрын
I believe you can fine tune it, haven't done a vid on it yet though @Its me Maldy!
@madhav_singh Жыл бұрын
can we use google speech to text for this instead for IBM watson?
@BettersizeInstruments3 жыл бұрын
Thank you @Nicholas Renotte for this amazing video. I'm new at this, I have the same request. Please please give a demo on how to add SSML in the audio file you named churchill. Much love
@NicholasRenotte3 жыл бұрын
You got it! Vid planned!
@BettersizeInstruments3 жыл бұрын
@@NicholasRenotte Thank you!
@BettersizeInstruments3 жыл бұрын
@@NicholasRenotte It will make my day. Thank you so much!
Thanks for a super Tutorial Nicholas. I am stuck with this error message, cant seem to resolve it with the resources available on google. Any help would be greatly appreciated.
@유영재-c9c3 жыл бұрын
Wow! Incredible! what mean accept params?
@NicholasRenotte3 жыл бұрын
It determines which types of parameters to accept in the API request.
@mohammadrezasalemi84353 жыл бұрын
Hi I get this error please help me Decode Error : It is required you pass in a value for the "algorithm" argument when calling decode()
@NicholasRenotte3 жыл бұрын
Heya one of the other subscribers figured out this was an issue with PyJWT, try installing 1.7.1 and it should fix the issue. Example install: pip install PyJWT==1.7.1
@blenderdad Жыл бұрын
Thank you a lot, exactly what i was looking for!!!
@ktrcoyote3 жыл бұрын
Is there a way to use this to create audiobooks? I keep hitting a cap on the text file I'm sending. I guess the best way to go about it would be to break up the text file send it to IBM then combine the output into a single file... I just have no idea what I'm doing.
@NicholasRenotte3 жыл бұрын
With audio books, you'll probably hit the cap pretty quickly. But you're right, you could break it up and get the output :)
@gameon89994 жыл бұрын
Can I follow same procedure to access custom service built in IBM watson?
@NicholasRenotte4 жыл бұрын
Heya @GameON, a custom TTS service?
@gameon89994 жыл бұрын
In my case Im using custom visual recognition service,I have tested it with the watson interface after training.But I want to use the service from python file.. Is it possible? I'm new to this interface and any idea would help,thanks
@NicholasRenotte4 жыл бұрын
Definitely!! I run through how to do it in this video: kzbin.info/www/bejne/imqul4yXiJ6brbM
@gameon89994 жыл бұрын
@Nicholas Renotte exactly what I was looking for..excellent video Thanks a lot!
@NicholasRenotte4 жыл бұрын
Anytime!! Let me know how you go, HMU if you need any help!
@juanricardorivera35913 жыл бұрын
Hi Nicholas. Could you put up a tutorial for image segmentation?
@NicholasRenotte3 жыл бұрын
Yup, working on it as we speak. Code is 80% done just need to finish the viz components!
@juanricardorivera35913 жыл бұрын
@@NicholasRenotte great! Thanks.
@debarghyakundu9089 ай бұрын
sir how can we convert sign language the single letter to WORDS..
@mummyskitchen53114 жыл бұрын
Nice Nick,... haha that beret on you :D :D
@NicholasRenotte4 жыл бұрын
Hahaha, I had to dig it out of my wardrobe this morning!
@mummyskitchen53114 жыл бұрын
@@NicholasRenotte Lol :D
@mikepierce24764 жыл бұрын
Is there a way to have each sentence be saved at it's own mp3 file?
@NicholasRenotte4 жыл бұрын
Heya @Mike Pierce! Definitely, try this: # Open file as per normal with open('churchill.txt', 'r') as f: text = f.readlines() # Remove line break text = [line.replace(' ','') for line in text] # Output each individual file as it's own mp3 counter = 0 for sentence in text: with open('{counter:04d}.mp3'.format(counter=counter), 'wb') as audio_file: res = tts.synthesize(sentence, accept='audio/mp3', voice='en-GB_JamesV3Voice').get_result() audio_file.write(res.content) counter +=1
@nexusloop3 жыл бұрын
hey im new to this i did everything i just cant find that folder in the end where you have your audio file
@NicholasRenotte3 жыл бұрын
Heya @Filip! No stress let's work through it, did you walk through the repository here: github.com/nicknochnack/TextToSpeech
@MarcillaSmith3 жыл бұрын
Where is your Jupyter Notebook located?
@NicholasRenotte3 жыл бұрын
@@MarcillaSmith link's in the description 😊!
@MarcillaSmith3 жыл бұрын
@@NicholasRenotte sorry, I was addressing the OP. I'm happy to report I got it running with no snags (other than my usual typos!). Thanks for UL'ing!
@NicholasRenotte3 жыл бұрын
@@MarcillaSmith oh my bad! Awesome work!!
@kikaFIL4 жыл бұрын
such an easy way to learn it was mighty useful!
@NicholasRenotte4 жыл бұрын
Thanks so much @Clarisse Alvarenga! Anything else you'd like to see or learn?!
@kikaFIL4 жыл бұрын
@@NicholasRenotte yes!!! I'm starting in the business and would love to learn to make an python API integrating STT and NLU
@bottonline4544 Жыл бұрын
Hermanito , usted es un " vergatario " no pares de publicar contenido sobre Ptyhon . 🐍 Muchas gracias
@parthdeshmukh95483 жыл бұрын
I'm getting error :( : Error: No such child resource., Code: 404 , X-global-transaction-id: 0075f199d2533fdac46e7f3d9796ff27
@NicholasRenotte3 жыл бұрын
Heya @Parth, can you double check your APIKey and URL?
@parthdeshmukh95483 жыл бұрын
@@NicholasRenotte hey thanks for the reply Yes i did.
@NicholasRenotte3 жыл бұрын
@@parthdeshmukh9548 hmm, you generated a new service on IBM Cloud and used those credentials?
@parthdeshmukh95483 жыл бұрын
@@NicholasRenotte Yes i generated new credentials, and i was getting error, but when i switched to language translation and identification it worked there.. so no issues now. Thankyou so much man!
@NicholasRenotte3 жыл бұрын
@@parthdeshmukh9548 anytime! Glad you got it up and running!
@the_end_boss2 жыл бұрын
Watson TTS voices are not at a high enough maturity level and still sound over processed like someone is speaking under water. It happens when you put too much of a hard limiter on the sound. Descript voices are further along than this. I wonder if Google and or Amazon have anything in the pipeline to compete?
@henkhbit57483 жыл бұрын
Merci bien Nick😃
@anandsuthar32273 жыл бұрын
Sir , how can I use my own tts voice in text to speech app. Can anybody plz help me..
@NicholasRenotte3 жыл бұрын
Take a look at voice cloners :)
@denpark88872 жыл бұрын
I'm going to trying this... thanks 👍🏻
@oscaregomezc2 жыл бұрын
it would be great to come up with a solution that integrates STT and TTS to create a live translator and adapt it on a device, sorry for the bad english
@kushalg8820 Жыл бұрын
does this project include data mining tooo??
@HusaynTechOfficialChannel3 жыл бұрын
Do I have to learn to coding to do this?
@NicholasRenotte3 жыл бұрын
A little but I take it step by step!
@HusaynTechOfficialChannel3 жыл бұрын
@@NicholasRenotte ok thanks
@eo4wellness3 жыл бұрын
Thanks. This was a great follow-along. I used Google Colab to code along with you and take my own notes. Impressively easy to follow along. I found your video searching for a way to make some KZbin-friendly voices for low-cost voice actors in an upcoming short film I'm working on. One question I have for future ideas for use, how would I go about training a model to be a voice from sound files (I have a ton of sound files of one person's voice--how do I make that a trained model to use like the "voices" here in this example)?
@NicholasRenotte3 жыл бұрын
Heya E. O., check this out: medium.com/ibm-data-ai/tune-by-example-how-to-tune-watson-text-to-speech-for-better-intonations-bcee8404d927
@hautrancong97153 жыл бұрын
thank you so much for a great video!!!
@NicholasRenotte3 жыл бұрын
Thanks for checking it out!
@gravilliarubusta63763 жыл бұрын
How it is working UNICODE?
@NicholasRenotte3 жыл бұрын
Hmmm? Not too sure of the question.
@aakritityagi1393 жыл бұрын
When i am executing 'Hello world' script.. it is giving me certificate verify failed error..
@NicholasRenotte3 жыл бұрын
Double check your URL, might be missing https
@investfoxy3 жыл бұрын
well thats a lot of work to convert text to speech..
@NicholasRenotte3 жыл бұрын
It's all relative @InvestFX! This code can then be embedded into other applications, more often than not that's what the tech is used for.
@loganmurray79923 жыл бұрын
When I run this in PyCharm rather than Jupyter, I just get a corrupt MP3 file. Anyone know how to fix this?
@NicholasRenotte3 жыл бұрын
Heya @Logan, do you get an error on output?
@VirtuelleWeltenMitKhan3 жыл бұрын
7:50 to hear an example ... not great but also not that bad
@VirtuelleWeltenMitKhan3 жыл бұрын
10:21 longer text
@NicholasRenotte3 жыл бұрын
Thanks for annotating @Khan!
@xalidhemidov4 жыл бұрын
Thank you, man! You're awesome!
@NicholasRenotte4 жыл бұрын
Right back at ya, thanks for watching @Xalid Hemidov!
@yasodhar56743 жыл бұрын
How to make a Jarvis can u please tell me
@NicholasRenotte3 жыл бұрын
Like from Ironman?
@atheer21103 жыл бұрын
thank you , it was super easy and understandable 💙
@keerthinettimi8419 Жыл бұрын
how to add emotion to the speech
@keen4e2 жыл бұрын
11:07 ahahah man you are the best!!!
@abhiramisuresh27203 жыл бұрын
I'm getting error. I'm not a coder. Pls help!
@NicholasRenotte3 жыл бұрын
Definitely, whats the error @Abhirami?
@abhiramisuresh27203 жыл бұрын
@@NicholasRenotte How do I share the screenshot of my error? It's not getting pasted over here.
@NicholasRenotte3 жыл бұрын
@@abhiramisuresh2720 hmm, weird, can you paste the text?
@abhiramisuresh27203 жыл бұрын
@@NicholasRenotte !pip install ibm_watson Requirement already satisfied: ibm_watson in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (4.7.1) Requirement already satisfied: requests=2.0 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from ibm_watson) (2.25.0) Requirement already satisfied: python_dateutil>=2.5.3 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from ibm_watson) (2.8.1) Requirement already satisfied: websocket-client==0.48.0 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from ibm_watson) (0.48.0) Requirement already satisfied: ibm_cloud_sdk_core==1.7.3 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from ibm_watson) (1.7.3) Requirement already satisfied: PyJWT>=1.7.1 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from ibm_cloud_sdk_core==1.7.3->ibm_watson) (1.7.1) Requirement already satisfied: six in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from websocket-client==0.48.0->ibm_watson) (1.15.0) Requirement already satisfied: idna=2.5 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from requests=2.0->ibm_watson) (2.10) Requirement already satisfied: certifi>=2017.4.17 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from requests=2.0->ibm_watson) (2020.12.5) Requirement already satisfied: chardet=3.0.2 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from requests=2.0->ibm_watson) (3.0.4) Requirement already satisfied: urllib3=1.21.1 in c:\users\abhir\appdata\local\programs\python\python39\lib\site-packages (from requests=2.0->ibm_watson) (1.26.2) url=api.eu-gb.text-to-speech.watson.cloud.ibm.com/instances/70a6d0e5-f11f-463f-b175-c8fe45d53259 apikey=vLfMPTX0j6d3R1UD9J9i13zLDgLs02vYL7ltnGOxKZtN File "", line 1 url=api.eu-gb.text-to-speech.watson.cloud.ibm.com/instances/70a6d0e5-f11f-463f-b175-c8fe45d53259 ^ SyntaxError: invalid syntax from ibm_watson import TextToSpeechV1 from ibm_cloud_sdk_core.authenticators import IAMAuthenticator authenticator=IAMAuthenticator(apikey) tts=TextToSpeechV1(authenticator=authenticator) tts.set_service_url(url) --------------------------------------------------------------------------- NameError Traceback (most recent call last) in ----> 1 authenticator=IAMAuthenticator(apikey) 2 tts=TextToSpeechV1(authenticator=authenticator) 3 tts.set_service_url(url) NameError: name 'apikey' is not defined with open('./speech.mp3','wb')as audio_file: res=tts.synthesize('Good Morning!',accept='audio/mp3',voice='en_US_MichaelV3Voice').get_result() audio_file.write(res.content) --------------------------------------------------------------------------- NameError Traceback (most recent call last) in 1 with open('./speech.mp3','wb')as audio_file: ----> 2 res=tts.synthesize('Good Morning!',accept='audio/mp3',voice='en_US_MichaelV3Voice').get_result() 3 audio_file.write(res.content) NameError: name 'tts' is not defined
@adilabdella79863 жыл бұрын
to be specific about the video it is 13:56 min long
@NicholasRenotte3 жыл бұрын
😂 you got me man!
@ashleysami16404 жыл бұрын
oui oui 👨🎨
@NicholasRenotte4 жыл бұрын
Mais oui, bien sur! 👩🏼🎨
@rcarrasquel.u4 жыл бұрын
Very good!!!
@NicholasRenotte4 жыл бұрын
Thanks a million!!
@wkml91752 жыл бұрын
tnx bro it was nice
@pavanramp96742 жыл бұрын
can you help me out
@spliter2003 жыл бұрын
10,000 characters?
@NicholasRenotte3 жыл бұрын
8KB input limit (cloud.ibm.com/apidocs/text-to-speech#getsynthesize) but you can chunk the requests and recombine once processed!
@moviethegame88392 жыл бұрын
life-changing w000t!
@draftcraftarchitecturaldes39654 жыл бұрын
Great...
@NicholasRenotte4 жыл бұрын
Thanks so much @Manoj!
@blackitom14083 жыл бұрын
damn it blows my mind bruh
@NicholasRenotte3 жыл бұрын
Oh man, gotta love what's possible with ML!
@SultanKhan-fm6zb3 жыл бұрын
Thank you very much for such a wonderful video. I am not a coder however I am making an attempt to code for the first time and this is the error that I am getting. Need help with open('churchill.txt', 'r') as f: text = f.readlines() --------------------------------------------------------------------------- FileNotFoundError Traceback (most recent call last) in ----> 1 with open('churchill.txt', 'r') as f: 2 text = f.readlines() FileNotFoundError: [Errno 2] No such file or directory: 'churchill.txt'
@shinchannoharaumra5saal2643 жыл бұрын
wrong coding... such disappointment
@NicholasRenotte3 жыл бұрын
Mmmm, was there an error you got that I can help you with?
@DanielSkorski-o5s Жыл бұрын
This could be contained into 3 minute video, rest is useless talking.