Exactly what I needed. Well explained. One question (maybe I missed this in the video) but what is the relation between the environment versus the folder on your computer. Or in other words, in which folder the files created within environment are stored?
@DalazG12 сағат бұрын
Curious, have you tried using audio-webui instead of elevenlabs? elevenlabs is great but costly, would love to see a tutorial with this as an alternative
@maxyang514313 сағат бұрын
Awesome video, thanks so much!
@jaypee876817 сағат бұрын
Adding weights to inputs creates exponentially accentuates biases at the end results. So there is no escaping biases. Only a different type of bias
@farexBaby-ur8ns18 сағат бұрын
Nice and concise and not an hr long - “Engineering a prompt” has been misconstrued by folks to be another branch of engineering called “prompt engineering” 🫢🫢😅 and has triggered a load of “prompt engineering” courses wherein charlatans are making money. Imho, folks should invest in learning langchain which is like a framework that will be around for long
@plowe675121 сағат бұрын
I wish there was a video on how to record system audio.
@ScriptureConsolidatedКүн бұрын
have you ever seen a dual output layer in line?
@rohanchoudhary672Күн бұрын
How to pass config to RealtimeTranscriber object
@maryamaghili11482 күн бұрын
you are confusing the definitions. in the 2:15 minute you claim a batch with 3 data points is coming to the layer and then instead of having 1mean and var for the entire batch, you calc 3 mean and variance for each neuron, which does not look right. Please revisit your video.
@amitsingh76842 күн бұрын
very nicely explained with clear details
@attilakiss85852 күн бұрын
I wonder why you put your face on the screen in a way to cover many relevant content, instead of putting it on top right corner for example.
@rupakvignesh2 күн бұрын
Title says "if I could start over" which assumes someone has gone through it already. If someone did courses and projects I don't agree with your steps. Here's how I would suggest people to do it if they're starting over. 1. Pick a problem you'd like to solve (say object detection or segmentation) 2. Read the seminal papers in this area. You won't understand everything, that's totally normal. Note down the parts you don't understand. 3. Implement the paper yourself without seeing the GitHub. Go deep into the math too if you like it. 4. Repeat for other problems. 5. Have fun.
@vladsaveluc26593 күн бұрын
Impressive you were able to condense it in 60 seconds. Good job!
@Roopkishore7263 күн бұрын
Assembly AI provide excellent services But some problems to make android application. Please make a video to make application for convert audio file to text in android studio when button click and text show in textview. I am waiting...
@DevenderKumar-hx4dc2 күн бұрын
I face same problem
@HarshaJK3 күн бұрын
This video introduces eight Python libraries for audio processing. 🔊 Displaying and playing audio files using ipython display 00:10 📁 Reading and writing audio files with soundfile 0:40 🔊 Opening and writing WAV files using the wave module 1:18 🎤 Reading microphone input with pyaudio 2:06 🎤 Reading microphone input with sounddevice 03:15 🎵 Manipulating audio with pydub 03:58 🎵 Audio analysis with librosa 04:49 🔥 Audio processing with torch audio 05:47
@HarshaJK3 күн бұрын
Great tutorial!! I am curious to know the VS Code theme that you are using. It looks very pleasing to the eye and want to try it out!!
@nemonemo62853 күн бұрын
High end Neural Networks in their current form do not work. They are not scalable, consume massive amounts of energy and compute power, i.e. you need a datacentre, don't learn after training and are inherently very unsafe etc!!!! Solution, Liquid Neural Networks!!!!!
@jamesomina41193 күн бұрын
Great! I Like the explanation.
@jonron38053 күн бұрын
Not sure whats the adwantage of using a "PromptTemplate" instead of an f-string. I mean we could just create a prompt=f"Question '{question_text}' Lets think step by step" Rt? So whats the advantage of Lang Chain here?
@Roopkishore7263 күн бұрын
Assembly AI provide excellent services But some problems to make android application. Please make a video to make application for convert audio file to text in android studio. I am waiting...❤❤❤
@cesaravalos75914 күн бұрын
2:00 Portugeese 🦆🦆🦆🤣🤣🤣
@ozbekcha_minecraft5 күн бұрын
how can i change accent
@dhruvmehta29515 күн бұрын
TranscriptionException: File does not appear to contain audio. File type is text/html (HTML document, ASCII text, with very long lines (56754)). I am getting this error please resolve this
@AssemblyAI2 күн бұрын
Hi there! If you are using a link to a remote file, the file must be (1) publicly accessible and (2) the download link of the audio file. The error you are getting indicates that you are probably not using a download link. To check this, paste the link you want to use into an incognito tab in your browser. If the audio file starts downloading to your computer, then it is a public working download link!
@zyadbrave95545 күн бұрын
Slides please
@sereneThePity5 күн бұрын
backstreet freestyle
@weebiesoftware62965 күн бұрын
I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?
@user-mv9ul9tz1c6 күн бұрын
Hello, I have a long text that I would like to split into 10 segments. I plan to summarize each segment using an API or assistant and then integrate these summaries. How can I ensure that all these interactions occur within the same conversation thread like in ChatGPT interface, allowing the API to remember the context? Thank you.
@jpsl52816 күн бұрын
My friend we are using your solution but we have a problem , this only works for incoming calls, for outgoing calls the stream from twilio looks like S### and hears like it, How do you handle the outgoing call stream. The way we are doing it only works some times
@nemonemo62856 күн бұрын
Perfect thank you.
@QuintinMassey6 күн бұрын
A Woman, questionable (it is 2024 after all). A Female a little more certain (same reason) 😂
@vivekanade4546 күн бұрын
I heard the only way to get AI/ML job is to get Masters or Phd.
@aeharrison1able6 күн бұрын
is that code available?
@Kunalg00036 күн бұрын
api key kaise milegi
@simonsandeep49776 күн бұрын
The programming is not responding after the first introduction ,as shown in the video ;though even after using the github code. Any alternative with step by step instruction video ?
@fluffykitties90206 күн бұрын
is this muli-lingual?
@skybuck20007 күн бұрын
This extension has disappeared from VS Code.
@noobjok3r6408 күн бұрын
can you help me like i want to a image classification ml model to deploy but in ur video its string version
@user-sj6eu8sp9v8 күн бұрын
Thanks for the awesome tutorial! Is there some way to map Speaker A to a known speaker? I was thinking of something like speaker embeddings? Also, is it possible to use this in a realtime application?
@sameerman118 күн бұрын
nice & thanks
@ducbuivan93788 күн бұрын
thank you
@estelitaribeiro41969 күн бұрын
Thanks! Great information in a very objective way!
@harshitvijay1979 күн бұрын
damn this whole series is like a gold mine ... i was suspicious of how a so well know topic be covered in so less time ... the videos might be not good but happy to be proven the wrong. THESE ARE GOLD ... thank you @AssemblyAI & thank you very much Ma'am for helping.
@kerduslegend26449 күн бұрын
using tensorflow for AI is a beginner plaything. try allocating each and every memory on an ASSEMBLY for each and every weights on the neurons for a gigachad move
@ChrisBrogan9 күн бұрын
I just watched an IBM explanation of vector databases and came away lost. Then I watched yours, and got it right away. Point goes to you. ;)
@nithishreddy76849 күн бұрын
An error occured: Could not connect to the real-time service: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:997) what to do with this error?
@islamicinterestofficial8 күн бұрын
same error. You found the solution?
@chittisai47Күн бұрын
most likely your microphone is switched off pls check
@valentinleguizamon99579 күн бұрын
❤❤❤❤
@iainhmunro10 күн бұрын
Hi There - I was just looking at the code. Where is the appointment setting details / info coming from ?
@AssemblyAI7 күн бұрын
All that is coming from the LLM we are using, so it's not hard-coded.
@WizardOrdinals10 күн бұрын
Just found you. So happy rn LOL
@WizardOrdinals10 күн бұрын
Just found you. So grateful
@muhammad.hameem10 күн бұрын
Some of the activation functions are : 1) Sigmoid function 2) ReLu 3) Leaky ReLu