Пікірлер
@rmjjanssen2645
@rmjjanssen2645 8 сағат бұрын
Exactly what I needed. Well explained. One question (maybe I missed this in the video) but what is the relation between the environment versus the folder on your computer. Or in other words, in which folder the files created within environment are stored?
@DalazG
@DalazG 12 сағат бұрын
Curious, have you tried using audio-webui instead of elevenlabs? elevenlabs is great but costly, would love to see a tutorial with this as an alternative
@maxyang5143
@maxyang5143 13 сағат бұрын
Awesome video, thanks so much!
@jaypee8768
@jaypee8768 17 сағат бұрын
Adding weights to inputs creates exponentially accentuates biases at the end results. So there is no escaping biases. Only a different type of bias
@farexBaby-ur8ns
@farexBaby-ur8ns 18 сағат бұрын
Nice and concise and not an hr long - “Engineering a prompt” has been misconstrued by folks to be another branch of engineering called “prompt engineering” 🫢🫢😅 and has triggered a load of “prompt engineering” courses wherein charlatans are making money. Imho, folks should invest in learning langchain which is like a framework that will be around for long
@plowe6751
@plowe6751 21 сағат бұрын
I wish there was a video on how to record system audio.
@ScriptureConsolidated
@ScriptureConsolidated Күн бұрын
have you ever seen a dual output layer in line?
@rohanchoudhary672
@rohanchoudhary672 Күн бұрын
How to pass config to RealtimeTranscriber object
@maryamaghili1148
@maryamaghili1148 2 күн бұрын
you are confusing the definitions. in the 2:15 minute you claim a batch with 3 data points is coming to the layer and then instead of having 1mean and var for the entire batch, you calc 3 mean and variance for each neuron, which does not look right. Please revisit your video.
@amitsingh7684
@amitsingh7684 2 күн бұрын
very nicely explained with clear details
@attilakiss8585
@attilakiss8585 2 күн бұрын
I wonder why you put your face on the screen in a way to cover many relevant content, instead of putting it on top right corner for example.
@rupakvignesh
@rupakvignesh 2 күн бұрын
Title says "if I could start over" which assumes someone has gone through it already. If someone did courses and projects I don't agree with your steps. Here's how I would suggest people to do it if they're starting over. 1. Pick a problem you'd like to solve (say object detection or segmentation) 2. Read the seminal papers in this area. You won't understand everything, that's totally normal. Note down the parts you don't understand. 3. Implement the paper yourself without seeing the GitHub. Go deep into the math too if you like it. 4. Repeat for other problems. 5. Have fun.
@vladsaveluc2659
@vladsaveluc2659 3 күн бұрын
Impressive you were able to condense it in 60 seconds. Good job!
@Roopkishore726
@Roopkishore726 3 күн бұрын
Assembly AI provide excellent services But some problems to make android application. Please make a video to make application for convert audio file to text in android studio when button click and text show in textview. I am waiting...
@DevenderKumar-hx4dc
@DevenderKumar-hx4dc 2 күн бұрын
I face same problem
@HarshaJK
@HarshaJK 3 күн бұрын
This video introduces eight Python libraries for audio processing. 🔊 Displaying and playing audio files using ipython display 00:10 📁 Reading and writing audio files with soundfile 0:40 🔊 Opening and writing WAV files using the wave module 1:18 🎤 Reading microphone input with pyaudio 2:06 🎤 Reading microphone input with sounddevice 03:15 🎵 Manipulating audio with pydub 03:58 🎵 Audio analysis with librosa 04:49 🔥 Audio processing with torch audio 05:47
@HarshaJK
@HarshaJK 3 күн бұрын
Great tutorial!! I am curious to know the VS Code theme that you are using. It looks very pleasing to the eye and want to try it out!!
@nemonemo6285
@nemonemo6285 3 күн бұрын
High end Neural Networks in their current form do not work. They are not scalable, consume massive amounts of energy and compute power, i.e. you need a datacentre, don't learn after training and are inherently very unsafe etc!!!! Solution, Liquid Neural Networks!!!!!
@jamesomina4119
@jamesomina4119 3 күн бұрын
Great! I Like the explanation.
@jonron3805
@jonron3805 3 күн бұрын
Not sure whats the adwantage of using a "PromptTemplate" instead of an f-string. I mean we could just create a prompt=f"Question '{question_text}' Lets think step by step" Rt? So whats the advantage of Lang Chain here?
@Roopkishore726
@Roopkishore726 3 күн бұрын
Assembly AI provide excellent services But some problems to make android application. Please make a video to make application for convert audio file to text in android studio. I am waiting...❤❤❤
@cesaravalos7591
@cesaravalos7591 4 күн бұрын
2:00 Portugeese 🦆🦆🦆🤣🤣🤣
@ozbekcha_minecraft
@ozbekcha_minecraft 5 күн бұрын
how can i change accent
@dhruvmehta2951
@dhruvmehta2951 5 күн бұрын
TranscriptionException: File does not appear to contain audio. File type is text/html (HTML document, ASCII text, with very long lines (56754)). I am getting this error please resolve this
@AssemblyAI
@AssemblyAI 2 күн бұрын
Hi there! If you are using a link to a remote file, the file must be (1) publicly accessible and (2) the download link of the audio file. The error you are getting indicates that you are probably not using a download link. To check this, paste the link you want to use into an incognito tab in your browser. If the audio file starts downloading to your computer, then it is a public working download link!
@zyadbrave9554
@zyadbrave9554 5 күн бұрын
Slides please
@sereneThePity
@sereneThePity 5 күн бұрын
backstreet freestyle
@weebiesoftware6296
@weebiesoftware6296 5 күн бұрын
I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?
@user-mv9ul9tz1c
@user-mv9ul9tz1c 6 күн бұрын
Hello, I have a long text that I would like to split into 10 segments. I plan to summarize each segment using an API or assistant and then integrate these summaries. How can I ensure that all these interactions occur within the same conversation thread like in ChatGPT interface, allowing the API to remember the context? Thank you.
@jpsl5281
@jpsl5281 6 күн бұрын
My friend we are using your solution but we have a problem , this only works for incoming calls, for outgoing calls the stream from twilio looks like S### and hears like it, How do you handle the outgoing call stream. The way we are doing it only works some times
@nemonemo6285
@nemonemo6285 6 күн бұрын
Perfect thank you.
@QuintinMassey
@QuintinMassey 6 күн бұрын
A Woman, questionable (it is 2024 after all). A Female a little more certain (same reason) 😂
@vivekanade454
@vivekanade454 6 күн бұрын
I heard the only way to get AI/ML job is to get Masters or Phd.
@aeharrison1able
@aeharrison1able 6 күн бұрын
is that code available?
@Kunalg0003
@Kunalg0003 6 күн бұрын
api key kaise milegi
@simonsandeep4977
@simonsandeep4977 6 күн бұрын
The programming is not responding after the first introduction ,as shown in the video ;though even after using the github code. Any alternative with step by step instruction video ?
@fluffykitties9020
@fluffykitties9020 6 күн бұрын
is this muli-lingual?
@skybuck2000
@skybuck2000 7 күн бұрын
This extension has disappeared from VS Code.
@noobjok3r640
@noobjok3r640 8 күн бұрын
can you help me like i want to a image classification ml model to deploy but in ur video its string version
@user-sj6eu8sp9v
@user-sj6eu8sp9v 8 күн бұрын
Thanks for the awesome tutorial! Is there some way to map Speaker A to a known speaker? I was thinking of something like speaker embeddings? Also, is it possible to use this in a realtime application?
@sameerman11
@sameerman11 8 күн бұрын
nice & thanks
@ducbuivan9378
@ducbuivan9378 8 күн бұрын
thank you
@estelitaribeiro4196
@estelitaribeiro4196 9 күн бұрын
Thanks! Great information in a very objective way!
@harshitvijay197
@harshitvijay197 9 күн бұрын
damn this whole series is like a gold mine ... i was suspicious of how a so well know topic be covered in so less time ... the videos might be not good but happy to be proven the wrong. THESE ARE GOLD ... thank you @AssemblyAI & thank you very much Ma'am for helping.
@kerduslegend2644
@kerduslegend2644 9 күн бұрын
using tensorflow for AI is a beginner plaything. try allocating each and every memory on an ASSEMBLY for each and every weights on the neurons for a gigachad move
@ChrisBrogan
@ChrisBrogan 9 күн бұрын
I just watched an IBM explanation of vector databases and came away lost. Then I watched yours, and got it right away. Point goes to you. ;)
@nithishreddy7684
@nithishreddy7684 9 күн бұрын
An error occured: Could not connect to the real-time service: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:997) what to do with this error?
@islamicinterestofficial
@islamicinterestofficial 8 күн бұрын
same error. You found the solution?
@chittisai47
@chittisai47 Күн бұрын
most likely your microphone is switched off pls check
@valentinleguizamon9957
@valentinleguizamon9957 9 күн бұрын
❤❤❤❤
@iainhmunro
@iainhmunro 10 күн бұрын
Hi There - I was just looking at the code. Where is the appointment setting details / info coming from ?
@AssemblyAI
@AssemblyAI 7 күн бұрын
All that is coming from the LLM we are using, so it's not hard-coded.
@WizardOrdinals
@WizardOrdinals 10 күн бұрын
Just found you. So happy rn LOL
@WizardOrdinals
@WizardOrdinals 10 күн бұрын
Just found you. So grateful
@muhammad.hameem
@muhammad.hameem 10 күн бұрын
Some of the activation functions are : 1) Sigmoid function 2) ReLu 3) Leaky ReLu