Exploring OpenAI's New GPT-4o Audio Preview Model: The Future of AI Audio Processing

  Рет қаралды 3,560

Bart Slodyczka

Bart Slodyczka

Күн бұрын

Пікірлер: 19
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
🗂 GET ALL THE CODE FILES: bartslodyczka.gumroad.com/l/jeznwq 📋 Take This Quick Survey: forms.gle/otAr1xUamgyYZE5y7 📺Realtime API Tutorial Series: kzbin.info/aero/PLi7jtY2ZZqRYE8Lvw4MuLHTZPYTA4jZHQ&si=7DAE9z7YtQlMrzrd
@AngeloXification
@AngeloXification 2 ай бұрын
Instant subscription, then I saw you build and provide resources. Excellent content.
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
Thanks legend 🤝
@alexanderkingstam5164
@alexanderkingstam5164 2 ай бұрын
You are very pedagogic and explaining very well. Thanks for sharing!
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
thank you very much, appreciate this comment 🙏
@GiovanneAfonso
@GiovanneAfonso 2 ай бұрын
very well structured video and test, great work! hope you do more videos
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
thanks legend! Will do 💪
@derherrdirector
@derherrdirector Ай бұрын
You are an absolute legend! You should have millions of subscribers
@BartSlodyczka
@BartSlodyczka Ай бұрын
haha! thank you my man!
@pixelperfectpravin
@pixelperfectpravin 2 ай бұрын
Most onpoint video 😍 i appreciate you
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
thanks man! I appreciate you too 💪
@Rhiever
@Rhiever Ай бұрын
If you’re just performing audio to text, is it necessary to specify both text and audio modalities? Will the model just ignore the audio file if you don’t specify both modalities?
@BartSlodyczka
@BartSlodyczka Ай бұрын
I haven't tested if the model will ignore it and yeah also not sure if you need to specify both. Made this code a couple weeks back and can't recall from the top of my head 🙏
@vsigal
@vsigal 2 ай бұрын
is it doing diarizarion? separation voices - voice1 - voice2 etc?
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
I just tested using short audio with 2 speakers talking to each other. I asked for a transcript of the convo broken down by speaker and it gave me the below: **Speaker 1:** So, Erin, in your email you said you wanted to talk about the exam. **Speaker 2:** Yeah, um, I've just never taken a class with so many different readings. I've managed to keep up with all the assignments, but I'm not sure how to... how to... **Speaker 1:** How to review everything? **Speaker 2:** Yeah. In other classes I've had, there's usually just one book to review, not three different books. Plus all those other text excerpts and videos...
@vsigal
@vsigal 2 ай бұрын
@@BartSlodyczka wow wow, I will try. thank you
@yurijmikhassiak7342
@yurijmikhassiak7342 2 ай бұрын
Thanks. How is that different from whisper voice to text? For voice to text usecase? The price difference is 10x. Is it faster? Is Quality better? The price looks stull very high. Like 20$/ hour of voice conversation. Almost, the cost of hiring humans for talking).
@BartSlodyczka
@BartSlodyczka 2 ай бұрын
Haven't done any work with whisper voice to text so i cant say, but in the demo I show this new audio model recognise abstract sounds and not just speech. So if whisper is cheaper for now, then you might stick with that for speech to text. Whereas for more dynamic sound recognition, you can use this audio model
OpenAI API Masterclass: Platform, Models & API Explained (Part 1/5)
30:02
15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)
28:54
Mom Hack for Cooking Solo with a Little One! 🍳👶
00:15
5-Minute Crafts HOUSE
Рет қаралды 23 МЛН
Каха и дочка
00:28
К-Media
Рет қаралды 3,4 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
How to Code Smarter with ChatGPT Projects
29:39
Bart Slodyczka
Рет қаралды 2 М.
Understanding OpenAI Real Time API With a Python Demo
15:36
AI Researcher & Developer Frank Fu
Рет қаралды 2,6 М.
GPT-Engineer: Your Own Personal Coding Assistant
18:14
NeuralNine
Рет қаралды 10 М.
Career Advice For A World After AI
23:07
Varun Mayya
Рет қаралды 506 М.
Claude has taken control of my computer...
4:37
Fireship
Рет қаралды 1,1 МЛН
We Proved It: AI Mastering Is A Waste Of Money
23:10
Benn Jordan
Рет қаралды 304 М.