OpenAI's Sora Made Me Crazy AI Videos-Then the CTO Answered (Most of) My Questions | WSJ

  Рет қаралды 543,858

The Wall Street Journal

The Wall Street Journal

Күн бұрын

Пікірлер: 1 400
@wsj
@wsj 10 ай бұрын
Watch the full, wide-ranging interview with OpenAI CTO Mira Murati here: on.wsj.com/3TNv2Mu
@elithecomputerguy
@elithecomputerguy 11 ай бұрын
The CTO can't talk about where the training data came from...
@blackspetnaz2
@blackspetnaz2 11 ай бұрын
They cannot talk about the fishy things they did to achieve this.
@tiromandal6399
@tiromandal6399 11 ай бұрын
​@@blackspetnaz2 Lots of cute animals died very painful deaths for the meds that's been keeping everyone you ever knew alive but you're not complaining about that. What a hypocrite!
@neon9165
@neon9165 11 ай бұрын
@@tiromandal6399Not to mention that a lot of their data comes from their partnership with shutterstock, thats why a lot of the videos look so stock footage like but i do imagine there are still some copyrighted things in there so it makes sense she isnt answering a question that could be used in court, they (and maybe google) are perhaps the closest to non copyright infringing ai
@3dus
@3dus 11 ай бұрын
Well...are you comparing dead animals to copyright royalties? Is it OK for a foreign company to steal OpenAI undisclosed science and use it to create a better model that advances AI?
@christiancrow
@christiancrow 11 ай бұрын
Closed ai open public images free for the taking ?
@revestrek1
@revestrek1 10 ай бұрын
I´m so glad this interviewer asked some real questions, Im sick of all the shallow questions over naive smiles. AI can be good, but also dangerous. It´s important we understand how serious it is, and train and use it carefully.
@DevineLuLinvega
@DevineLuLinvega 11 ай бұрын
"I can neither confirm nor deny that we trained on youtube videos"
@Test-ny6uh
@Test-ny6uh 11 ай бұрын
Even if you have been trained, these are public videos. There's nothing wrong.
@krishmav
@krishmav 11 ай бұрын
​@@Test-ny6uhKZbin is many people's livelihood. If they're training on that data, they're stealing their income in the future. We need to start banning AI for somethings. Or else it's just one job after another.
@Rosscifer
@Rosscifer 11 ай бұрын
@@Test-ny6uh No. That's not how it works. There are very strict laws concerning exactly what constitutes fair use and despite this copyrighted material gets uploaded to youtube a million times a day. It needs to constantly be taken down. Just because the character from Spongebob was visible in some fair use clips and some illegally uploaded episodes doesn't mean the character can be plagiarized at will.
@HighDoping
@HighDoping 11 ай бұрын
@@Test-ny6uh But if you re-upload a KZbin video, you'll be hit with a Content ID claim.
@stuckonearth4967
@stuckonearth4967 11 ай бұрын
@@Rosscifer Oh, c'mon you copyright people are annoying. AI learns just like how humans learn. Its training is like watching movies for us. We also know how certain characters look and so does AI. It can copy from other videos just like how humans copy from other creators. I think good examples of copying are elves, gnomes, orcs etc.
@unmonged
@unmonged 10 ай бұрын
this is a product i made that i am very proud of. i'm the spokesperson of the product and was sent here to speak specificity about the product only. "how did i make my product", you ask? i don't know.
@EggEggEggg1
@EggEggEggg1 11 ай бұрын
The fact that she dodged that question about the training data isn't very "Open"AI of her.
@anirudhnarla4711
@anirudhnarla4711 11 ай бұрын
Honestly thats every ai chatbot,image generator,llm from every company including google,facebook,microsoft,etc. A training model requires huge amounts of data which isnt just available on hand so they scoop the videos and text from the public internet
@BenGrimm977
@BenGrimm977 11 ай бұрын
It's understandable that they wouldn't want to specify beyond publicly available data. It was a bad question.
@SublimeMind
@SublimeMind 11 ай бұрын
@@BenGrimm977No, it was a terrible answer. "Do you use KZbin to train Sora" the CTO of Open AI: "Durrrr I unno maybe".
@slavloli8755
@slavloli8755 11 ай бұрын
she was clearly scared to answer it lol Surprised that she wasn't vetted for the interview, this question was bound to be asked and she fumbled Maybe OpenAI doesn't care about public's perception, but that's something that can be detrimental later on. Given the rise of luddites and how it may lead to policies to impede AI's growth
@andifadeaway11
@andifadeaway11 11 ай бұрын
@@slavloli8755 People who are wary of AI aren't just luddites. They just have a sense of consequence and what letting the whole world be run by companies who own image data rather than individual creatives might do. Sorry you can't think more than one second into the future to understand why AI restrictions are incredibly important to the future well being of our society.
@punk3900
@punk3900 11 ай бұрын
It's like asking a farmer: - Where do you take your apples from? Do you pick them from trees? - I dont know. If they grow on trees, perhaps yes.
@gokuldastvm
@gokuldastvm 11 ай бұрын
Don't forget - they were 'publicly visible' apples.
@jwilder2251
@jwilder2251 11 ай бұрын
This is one of the funniest comments I’ve ever read 😂
@throwawaydude3470
@throwawaydude3470 10 ай бұрын
She picked them from her neighbors trees. Not the same.
@kamu747
@kamu747 10 ай бұрын
This is very funny. You got me cracking me up. So the question would have been phrased more like whose trees do you pick them from asking a farmer who owns an Apple far that spans thousands of acres neighbouring another farmer who also grows apples. So her answer is: only trees I'm permitted from picking, from our farms yes, which other trees, I can not say, but if I was permitted then Yes .
@dlebensgefahr
@dlebensgefahr 10 ай бұрын
No, it is not like that.
@JohnCillian
@JohnCillian 11 ай бұрын
4:50 She's smart enough to know where the models came from and she's hesitant to say because of potential lawsuits.
@mack-uv6gn
@mack-uv6gn 11 ай бұрын
Copyright?
@anirudhnarla4711
@anirudhnarla4711 11 ай бұрын
They scoop the data from literally everything public on the internet which is a lot. And not just sora but every llm,chatbot and image generator too. Its the same for everything
@SublimeMind
@SublimeMind 11 ай бұрын
@@anirudhnarla4711Oh so the "everyone does it so it's legal" defense.
@blackspetnaz2
@blackspetnaz2 11 ай бұрын
She knows
@mistressfreezepeach
@mistressfreezepeach 11 ай бұрын
is she smart? it's not obvious to me
@mrtang18
@mrtang18 11 ай бұрын
Joanna Stern is the best! Love how she grilled her about the training data - super awkward response 😂
@jolness1
@jolness1 11 ай бұрын
The vagueness of her response about what was used from the CTO shows she definitely knows they’re using copyrighted content imo.
@mistressfreezepeach
@mistressfreezepeach 11 ай бұрын
or she really doesn't and is just there as a pretty head
@Korodarn
@Korodarn 11 ай бұрын
Copyright is automatic. Unless they were training on videos from the early 20th century they couldn't be training on videos with no copyright excepting the very few people who bother to put on a CC 0 or similar license
@billf1748
@billf1748 11 ай бұрын
If you watch a video and learn something from it, does that mean you copied it, and thus, violated copyright? Of course not. What people don't seem to understand is that AIs are utilizing digital neural nets; they do not need to copy to learn--they simply adjust their weights and biases. Nothing has been copied. Learning has happened. This is the way it will be argued in courts.
@neon9165
@neon9165 11 ай бұрын
@@KorodarnThey have a Partnership with shutterstock (32 Million videos if shutterstock is making their whole catalogue available) And even then, some countries already established that machine learning isnt copyright infringing
@SahilP2648
@SahilP2648 11 ай бұрын
@@billf1748 well then laws need to change, and fast
@official_youtube
@official_youtube 11 ай бұрын
"My prompt for this crab said nothing about a mustache" is the most 2024 sentence ever.
@KDashCR
@KDashCR 10 ай бұрын
I don’t expect regular people to be impressed by this. For us who work on Computer Science this is like a miracle, can’t wait to dig into this.
@VivekChandra007
@VivekChandra007 10 ай бұрын
AI models can’t cook out of thin air, they always are based on data trained on. If one wants to know the real data it was trained on, look at subtle things it put in not mentioned in prompt
@VejmR
@VejmR 10 ай бұрын
​@@KDashCRby what? Sora or mustache in specific ?
@matheusdemoura5529
@matheusdemoura5529 10 ай бұрын
But it’s clear that Sora AI copied the sponge bob character
@quandovoceleroscomentarios9622
@quandovoceleroscomentarios9622 10 ай бұрын
@@VejmRTHe mustache, obviously
@ryanwilliams989
@ryanwilliams989 4 ай бұрын
AI Stocks are pretty unstable at the moment, but if you do the right math, you should be just fine. Bloomberg and other finance media have been recording cases of folks gaining over 250k just in a matter of weeks/couple months, so I think there are a lot of wealth transfer in this downtime if you know where to look.
@BiancaSherly-qt6sb
@BiancaSherly-qt6sb 4 ай бұрын
you’re right! The current market might give opportunities to maximize profit within a short term, but in order to execute such strategy , you must be a skilled practitioner
@hunter-bourke21
@hunter-bourke21 4 ай бұрын
I've been in touch with a financial advisor ever since I started my business. Knowing today's culture The challenge is knowing when to purchase or sell when investing in trending stocks, which is pretty simple. On my portfolio, which has grown over $900k in a little over a year, my adviser chooses entry and exit orders.
@maggysterling33254
@maggysterling33254 4 ай бұрын
@@hunter-bourke21I'd be glad to get the help of one, but just how can one spot a reputable one? How did you spot this adviser
@hunter-bourke21
@hunter-bourke21 4 ай бұрын
I'm cautious about giving specific recommendations since everyone's situation varies, but I've worked with *Izella Annette Anderson* for years and highly recommend her. See if she meets your criteria.
@TheresaAnderson-kf5xw
@TheresaAnderson-kf5xw 4 ай бұрын
Thanks for sharing, I just looked her up on the web and I would say she really has an impressive background in investing. I will write her an e-mail shortly.
@sdf8085
@sdf8085 11 ай бұрын
I cannot express how wild it seems to me that the *CTO* of one of the biggest genAI companies, promoting their latest genAI product couldn't answer what data the model was trained on.
@tarcus6074
@tarcus6074 11 ай бұрын
It doesn't matter.
@blackspetnaz2
@blackspetnaz2 11 ай бұрын
She got caught, she knows there are lawsuits left and right. What was incredible is she was not prepared for the question. But since they know they did pretty bad praxis she got nervous.
@ayushnews5735
@ayushnews5735 11 ай бұрын
​@@tarcus6074 it does matter.
@Nainara32
@Nainara32 11 ай бұрын
I wouldn't expect the CTO to know the current status of any given contract with a vendor with enough confidence to be quoted on WSJ. These things change all the time and they probably have hundreds of sources of training data just for SORA AI, let alone all the other products that she oversees like ChatGPT and DALI.
@i20010
@i20010 11 ай бұрын
Its all stolen from us.
@acer-star
@acer-star 11 ай бұрын
Remember the Super Phrase: *Publicly available data*
@MrQWUK
@MrQWUK 11 ай бұрын
Exactly my thoughts after watching this. Barely scratching the surface in respect of rigorous query into attribution. Copyright and IP law is being left in the dust, with creative works being flagrantly stolen for marketing hype, stock inflation and profit. It's a sad path "we're" headed down. Another race to the bottom.
@sirdiealot53
@sirdiealot53 11 ай бұрын
She’s right. It’s publicly available. If you don’t want everyone to see your videos don’t post them on the internet. Durr
@JorisGriffioen
@JorisGriffioen 11 ай бұрын
@@sirdiealot53 yes, "see", that's not the contention here.
@carcolevan7102
@carcolevan7102 11 ай бұрын
@@JorisGriffioenIf one can 'see' something, one might also 'learn' from it. So that is the contention. Imagine if you had to get permission before studying anyone else's art. It would be a red-tape paperwork nightmare and will essentially destroy the social function of art. You say copyright law is being violated with "creative works being flagrantly stolen" but copyright law does not require anyone to get permission to view or study anyone else's publicly-available art. Your moral outrage is palpable, but there is no substance to it because copyright law does not require the permission you seem to think it does.
@vrclckd-zz3pv
@vrclckd-zz3pv 11 ай бұрын
Piracy is based. Unauthorized use of personal data is not.
@Manwith6secondmemory
@Manwith6secondmemory 10 ай бұрын
Really good reporting the interviewer asked the correct questions and Mira definitely knows the data they used she does not want to reveal anything because they’ll get crucified
@Manwith6secondmemory
@Manwith6secondmemory 6 ай бұрын
SIKE
@Manwith6secondmemory
@Manwith6secondmemory 6 ай бұрын
SIKE
@Manwith6secondmemory
@Manwith6secondmemory 6 ай бұрын
SIKE
@Outcaster88
@Outcaster88 11 ай бұрын
The way she dodged over and over the question about the training data, clearly their are not using only publicly available data.
@madshader
@madshader 11 ай бұрын
Or, for legal reasons, she is not allowed to make statements on it yet. Not everything is so black and white. One could argue that humans train on publicly available data when they make their own creations, pulling inspiration from everywhere. AI is just much more efficient at this task than a human is.
@shredd5705
@shredd5705 11 ай бұрын
Publicly available doesn't mean you freely use it. Copyrighted work can be publicly available. Just because artwork or contet is online doesn't mean you have rights to use it
@madshader
@madshader 11 ай бұрын
@@shredd5705are you serious? Of course a human can use it as inspiration, the same as an AI. The problem arises when you try and claim the exact copyrighted work is your own. But artists use all kinds of art to draw from and create something new. This has been the practice of every artist through history.
@diyamond
@diyamond 11 ай бұрын
@@madshaderas an artist, i agree that they shouldn’t and it’s disgusting. but through a business perspective… it’s all free to them because technically it is on the internet for anyone to see. it’s not morally correct whatsoever, but it happens.
@Mobri
@Mobri 10 ай бұрын
​@@madshader One could argue that, but completely miss the point or understand why this argument is patently idiotic on the surface of it. Please, go back to speculating about the social importance of dogecoin and leave the rest of us alone.
@ishaankelkar
@ishaankelkar 11 ай бұрын
the questions asked in this interview were really good -- concise and hitting all of the important points. thank you for this informative video
@i20010
@i20010 11 ай бұрын
Yeas, it's rare these days to have intelligent press asking correct questions.
@buttofthejoke
@buttofthejoke 11 ай бұрын
I was just thinking that. All questions that I wanted answered. precise and terse
@BlackParade01
@BlackParade01 11 ай бұрын
Joanna is always a hit
@unawakeful6931
@unawakeful6931 11 ай бұрын
She didn't push back on the jobs subject when the interviewee stated that it will just be a tool. Yes, a tool that anyone can use which will make making movies trivial to where it will undoubtedly have a significant affect on jobs.
@brandnaqua
@brandnaqua 11 ай бұрын
Joanna Stern is the best of the best in the business! 🙌 i love how she's honest without ever being rude. she's top tier! 🎉
@xiphoid2011
@xiphoid2011 11 ай бұрын
In my 40s now, looking back, I am amazed at the acceleration of technological innovation. It's now it's almost impossible to imagine what computing will be like in just 2-3 years.
@Jay-eb7ik
@Jay-eb7ik 11 ай бұрын
1 million x more powerful by 2030
@Vikasslytherine
@Vikasslytherine 11 ай бұрын
We still don't have flying cars
@_ShaDynasty
@_ShaDynasty 11 ай бұрын
wheres my flying car and television watch?@@Vikasslytherine
@ibrahimalharbi3358
@ibrahimalharbi3358 11 ай бұрын
Democracy is a big joke! Taxation is a theft! Laws is only for citizens not owners, for example Copyright. God is real God is not a dead man
@mistressfreezepeach
@mistressfreezepeach 11 ай бұрын
@@Vikasslytherine we do have many things out of 1984, though
@discoverlight
@discoverlight 11 ай бұрын
You should’ve asked Sora to make 10 people playing rock paper scissors in fast pace. 😂
@geaca3222
@geaca3222 11 ай бұрын
😅🤣
@WildVoltorb
@WildVoltorb 11 ай бұрын
😂
@pinch254
@pinch254 11 ай бұрын
😂
@edism
@edism 10 ай бұрын
This is why it's not free, silly people wasting resources. Why can't you just increase the speed of a normal video for this rubbish? Lol
@a.nobodys.nobody
@a.nobodys.nobody 12 күн бұрын
That's funny
@zivzulander
@zivzulander 11 ай бұрын
I'm not going to lie, I spent half the video with my eyes darting to Mira's hands just to make sure she's real 😅
@crazyprayingmantis5596
@crazyprayingmantis5596 10 ай бұрын
She's not
@vectoralphaSec
@vectoralphaSec 10 ай бұрын
​@crazyshe is. prayingmantis5596
@sweetysweety7805
@sweetysweety7805 10 ай бұрын
😂😂
@marttiekstrand4879
@marttiekstrand4879 11 ай бұрын
As Sora can create "cartoon animation" OpenAI should show their list of animation production companies who have licensed their films to be used in the machine learning model. There's not much animation is available in public domain, especially 3D animation.
@shredd5705
@shredd5705 11 ай бұрын
It's obviously stolen stuff. They steal. Most AI companies do. Instead of OpenAI, PirateAI would be more accurate
@bruno2010087
@bruno2010087 10 ай бұрын
although maybe it's possible for it to learn particular styles from static images also, no? then i would say there are many more sources of training data
@marttiekstrand4879
@marttiekstrand4879 10 ай бұрын
Styles of movement in animation can only be modelled from animation. Cartoon characters doesn't move anywhere close to real humans.
@Vincent-lg2jh
@Vincent-lg2jh 10 ай бұрын
Good note, I think they also specifically made a contract with set studios to output "basic" animated movements for the model to learn easily.
@TurnRacing
@TurnRacing 11 ай бұрын
ouch those data training questions hit HARD!
@Jay-rr6me
@Jay-rr6me 11 ай бұрын
They always say AI will change us in good ways and we will be much better but as you know it’s actually gonna go the other way
@fuu812
@fuu812 10 ай бұрын
4:34 GOLDEN reaction
@AkshaySinghJamwal
@AkshaySinghJamwal 11 ай бұрын
Let me get this straight: they've created a model that generates videos indistinguishable from reality, and their genius idea to make that distinction is: a watermark! Wow!
@xlrbossshorts
@xlrbossshorts 10 ай бұрын
There's really nothing they could do as a safety procedure, unless they want to put a huge watermark stamp on the video or limit the things you can create.
@itsmebk6820
@itsmebk6820 10 ай бұрын
Yikes that was one bomb of an interview… they are definitely training it on KZbin/x/fb/ig everything 🤣
@_Tennz
@_Tennz 8 ай бұрын
Publicly shared data and media... Let's all go check the fine print of those terms and conditions we all agree to every time we use social media, shall we?
@alvarortega2
@alvarortega2 11 ай бұрын
That interview took a quick turn! Hehehe
@hemasunchu
@hemasunchu 4 ай бұрын
That sounds like a fascinating journey! Seeing the CTO engage with the community adds so much value. Ever heard of SmythOS? It's like witnessing AI collaborate seamlessly, simplifying complex workflows. Always exciting to stay updated on these tech marvels!
@Jay-rr6me
@Jay-rr6me 11 ай бұрын
What’s the point of replacing creativity we don’t want to live in a world where nothing is creative
@winniethepoohxi1896
@winniethepoohxi1896 11 ай бұрын
If an AI can be creative why does that stop you from being creative separately? Having an AI be creative doesn’t mean you can’t be creative as a hobby all you want. Most artists, authors, and musicians just do it as a hobby already. No AI is going to say you are no longer allowed to be creative. You might not be as creative as an AI but you can still do your own thing.
@midnightjayy
@midnightjayy 10 ай бұрын
⁠@@winniethepoohxi1896AI would not exist with the thousands of artist’s names, works, etc. being inputted into its database. All without anyone’s consent, and the fact that the corporations are profiting off of other’s work, not them. That’s where the problem is. Nothing is stopping anybody from actually learning how to draw, there’s thousands of videos on the internet to tell people how… I just see it as people being extremely lazy and jumping to the end result. It’s cheap.
@stardust6870
@stardust6870 10 ай бұрын
Unfortunately, no one cares. Most people are lazy and happy to say they created something even though it was AI making it for them. They don't care about the creation process. Instead, people care about the output and profit. That's what capitalism did to us. And I'm saying this as a writer who lost work to ChatGPT a year ago.
@SerenityReceiver
@SerenityReceiver 11 ай бұрын
The CTO isn't sure if youtube training data was used???
@dibbidydoo4318
@dibbidydoo4318 11 ай бұрын
well she could've purposely chosen to not look at the dataset to avoid lying in a lawsuit.
@RyanMichero
@RyanMichero 11 ай бұрын
She knows, and can't say.
@krakenj5237
@krakenj5237 11 ай бұрын
Really good interview. Grilled her with sensible questions
@christianf7740
@christianf7740 26 күн бұрын
Of topic but that is the most beautiful CTO I have ever seen.
@kakaeriko
@kakaeriko 11 ай бұрын
not sure of sources??
@jolness1
@jolness1 11 ай бұрын
They’re definitely just feeding every freely usable or copyrighted video around.
@bikedawg
@bikedawg 11 ай бұрын
she's absolutely knows what the sources are. she doesn't want to open openAI for another lawsuit.
@shredd5705
@shredd5705 11 ай бұрын
They are sure, but they don't want to tell because it's stealing copyrighted work. Just like every other AI company training their AIs
@vborovikov
@vborovikov 11 ай бұрын
torrents
@andrewhooper7603
@andrewhooper7603 10 ай бұрын
​@benny-schmidt glad you found a way to shoehorn your personal grievances into this, but she's pretty clearly just lying.
@stealcase
@stealcase 11 ай бұрын
Thank you for asking about the data! This is what people care about: whether their data was ingested to train profitable AI without our consent.
@Fiscotte
@Fiscotte 11 ай бұрын
ClosedAI
@colabear4343
@colabear4343 10 ай бұрын
Q: "Videos on youtube?" CTO: "I'm actually not sure about that."
@therealsimdan
@therealsimdan 11 ай бұрын
“It’s very difficult to simulate the motion of hands.” Every VFX studio from the last 25 years: “Do I mean nothing to you?”
@unknown-fd1yz
@unknown-fd1yz 10 ай бұрын
Doing it with ai and doing it manually is a really different thing. And ai do it quick unlike vfx studios so there is that too.
@MissesWitch
@MissesWitch 10 ай бұрын
When she can't answer where the training data came from , you realise she's just another soulless corporate doll..
@ethanvance3834
@ethanvance3834 11 ай бұрын
I stole a car because i saw it on the street and i thought that was public property so by this logic isn't your car publically available for me to use for my own benefit.
@BrianHill
@BrianHill 11 ай бұрын
Two smart women having a hard-headed conversation about an extraordinarily important topic. Nice vid.
@peterjohansson739
@peterjohansson739 11 ай бұрын
I can confirm that the models were trained on all data, publicly available, licensed and non-licensed etc.
@Brrrrrrrrrtbrrrrttt
@Brrrrrrrrrtbrrrrttt 10 ай бұрын
yeah maybe meta is privately selling them all our data.
@TanakaTsikira
@TanakaTsikira 11 ай бұрын
Hmmmn. The training data issue is going to haunt OpenAI for a while. The law has not caught up with the technology. US Law makers are going to have a tough time with this one.
@dibbidydoo4318
@dibbidydoo4318 11 ай бұрын
It's not going to haunt it, a vital part of every copyright law in the world is that any infringing material has to be substantially similar to the original.
@NoobNoob339
@NoobNoob339 11 ай бұрын
mmm yes cope@@dibbidydoo4318
@salifyanjisimwanza9679
@salifyanjisimwanza9679 10 ай бұрын
​​​@@dibbidydoo4318I agree with you on that part. But I think there's still a significant legal challenge here. The videos/images produced may be substantially different from the copyrighted material but copyrighted material was used to train the models all the same. The consequence of that act of training is what's haunting the law courts at the moment. As a human being, I can observe or read and reproduce what I saw/read in a somewhat different way. AI models have not yet earned that status. Moreover, there are potential data protection issues here. Whatever the case, IP laws are about to undergo possibly the biggest change ever in history.
@dibbidydoo4318
@dibbidydoo4318 10 ай бұрын
​@@salifyanjisimwanza9679 I don't think this will impact IP laws much unless someone believed that the "property" in "intellectual property" was meant to be taken literally.
@Mobri
@Mobri 10 ай бұрын
​@@dibbidydoo4318 In the case of humans making art, yes. In the case of a program creating art, maybe no. But if it is legal to make and sell, then you still have no copyright and I can sell your AI art with no consequence. It can't be both, though.
@atlanta2076
@atlanta2076 11 ай бұрын
CTO: «Sora is based on text prompts». NO! It is NOT. It is based on stolen art. The biggest art robbery in history! And it gets even worse. She pretends (!) that she doesn't know whether they stole from KZbin creators (which they def did). She talks about "licensed data", as if OpenAI had any official license from the Sponge Bob right holders to feed their wicked machine with their property. I'm so disgusted! She says "publicly available" as if it meant "public domain". Any Disney DVD is pubicly available. Don't mean I can use that data as I please. A CTO that is "not sure" where they stole the data from. Give me a break. Furthermore, she keeps saying their in the very early stages, but keeps emphasizing that it'll be ready in a few months with mosts kinks ironed out. Which one is it now? WSJ: why did you not call this women out?
@jacksonfolly
@jacksonfolly 6 ай бұрын
04:33 The grimace says it all
@anirudhnarla4711
@anirudhnarla4711 11 ай бұрын
People are really trying to find faults in it to cope but honestly the flaws are so minor that you have to notice it extensively (except the hands part) and the model is still in its beta phase so this is literally a game changer
@abdulrazack1683
@abdulrazack1683 11 ай бұрын
So true
@jolness1
@jolness1 11 ай бұрын
There are a lot of goofy things that pop up, not just hands. It is super neat but also unbelievably power and compute hungry.
@huckleberryfinn6578
@huckleberryfinn6578 11 ай бұрын
@@jolness1 Just look static image generation from early 2022 and recent images.
@ayushnews5735
@ayushnews5735 11 ай бұрын
Flaw is not in the videos generated. It can be solved. The real problem is her not telling the source of training data despite being the CTO of "Open" AI.
@dibbidydoo4318
@dibbidydoo4318 11 ай бұрын
@@ayushnews5735 Well it's obvious that stuff isn't supposed to spoken about until lawsuits conclude.
@huymaivan8671
@huymaivan8671 10 ай бұрын
The same vibe as "You smuggler, where did you hide those drug" -> "I dont know"
@ropro9817
@ropro9817 11 ай бұрын
Lol, wow, that interview turned really sketchy @4:24... 😆
@ropro9817
@ropro9817 11 ай бұрын
How does the CTO--and former CEO... for 2 days--not know details of what content was used for model training when that is such a controversial hot topic today? 🤔 I call bull💩...
@vetboy627
@vetboy627 11 ай бұрын
@@ropro9817Because that's the magic behind how the models work and she's not going to reveal that to competitors. Also it's not really the CTO's concern what exact data is used to create a model, as long as it's legal and gets results
@IcyyDicy
@IcyyDicy 11 ай бұрын
Obviously. Why shoot yourself in the foot by being correct? Credit is due for WSJ for asking the tough questions and letting viewers judge the non-answers for themselves. I'd argue it tells a lot more.
@teachusmore
@teachusmore 11 ай бұрын
Future lawsuit here…she intentionally concealed the data source. A discovery of their internal communications will likely reveal that they know they are taking copyright material without permission
@lifemarketing9876
@lifemarketing9876 11 ай бұрын
@@vetboy627 Finally someone in the comments using critical thinking instead of dogpiling Mira, when they have absolutely no clue what's going on.
@imadeyoureadthis1
@imadeyoureadthis1 11 ай бұрын
4:32 It makes sense the way she answered it. It's not really a question you can answer honestly. How do you train it? You need everything you can get your hands on.
@EbenezerNimh
@EbenezerNimh 11 ай бұрын
Al + Sora = Job Killer
@trappedcat3615
@trappedcat3615 11 ай бұрын
Job Killer = Job Creator
@aj0zzzz
@aj0zzzz 10 ай бұрын
This was some incredible reporting. Well done to her;
@sushienjoyer
@sushienjoyer 11 ай бұрын
4:34 People are gonna make fun of that answer. However, clearly, the purpose is to dodge, and that answer achieved that.
@MnMEminem
@MnMEminem 11 ай бұрын
She is just a pretty head they put there to reduce the hate they get from the public and reduce the concerns! This girl doesn't have the technology luminance real CTOs have!
@erikouwehand
@erikouwehand 11 ай бұрын
Must mean they are stealing other people's content, otherwise you would not hide it.
@Tharros95
@Tharros95 11 ай бұрын
Thanks captain obvious, what would we have done without you?
@joelface
@joelface 11 ай бұрын
@@erikouwehand I disagree. There are lawsuits and a lot of contention around the training data, and saying a single wrong word could be used against her and the company and end up costing billions of dollars. Of course the interviewer wants details, but that doesn't mean it's smart for her to answer it even if everything seems above board and legal to her.
@the_nows
@the_nows 11 ай бұрын
I think she legit didn't know what the training data specifically is, because she's a bad CTO. Also because that's being kept secret for many
@ariwilsun
@ariwilsun 11 ай бұрын
@3:32 Joanna's smug smile here has me cracking up. 😆
@christiancrow
@christiancrow 11 ай бұрын
Uh oh where that data coming from 😂
@christiancrow
@christiancrow 11 ай бұрын
If it's public , clarify that please ?
@ktktktktktktkt
@ktktktktktktkt 11 ай бұрын
I feel like a CTO should know where the data came from lol
@Hannan_1325
@Hannan_1325 11 ай бұрын
She knows, she is not supposed to make announcements about their secrets on News channel.
@ktktktktktktkt
@ktktktktktktkt 11 ай бұрын
@@Hannan_1325 it's hardly a trade secret if it's publicly available/licensed data and she later confirmed the data includes shutterstock. If anything, she didn't want to say the answer on air because it would sound really bad. AI art has caught a ton of flak for using artists art without permission. I would bet they did actually use youtube videos without permission.
@Korodarn
@Korodarn 11 ай бұрын
@@ktktktktktktkt It's a certainty they used things without permission. There is absolutely no way they could get the permission for all of this, and that's fine, because permission is not required for you to view it, it shouldn't be required for them to view it either.
@ibrahimalharbi3358
@ibrahimalharbi3358 11 ай бұрын
Democracy is a big joke! Taxation is a theft! Laws is only for citizens not owners, for example Copyright.
@carcolevan7102
@carcolevan7102 11 ай бұрын
@@ktktktktktktkt Right. She knows where the data come from and she knows that there is a widespread belief that allowing an AI to study publicly-available images and videos without permission is illegal. It's not, but that doesn't diminish the moral outrage of those who wrongly believe that it is. So the question "where did the training data come from?" is a loaded one premised on the idea that using images and videos scraped from the internet is illegal, even though it isn't. There was no way to educate the public enough during this short interview that an honest answer wouldn't just pour fuel on the misinformation fire that's already blazing around this topic.
@japanmood
@japanmood 7 ай бұрын
How about copyrighting issue. If it's a public available content, this doesn't mean you have the rights to use it for Ai training. While OpenAI makes billions, artists will lose jobs, projects, etc.. Another questions is, does Shutterstock pays the artists if OpenAI is using the huge database from Shutterstock!? The way it should be, is to create a platform/Hub, where all the artists, and writers etc... can upload all their content and get paid by OpenAi for using their content, but not browsing your Facebook or KZbin and basically stealing this info for their benefit.
@rubes8065
@rubes8065 11 ай бұрын
She knows that videos on KZbin were used as training data. She is not a good liar lol it’s her job to know. OpenAI doesn’t want to get sued, that’s why. Sam Altman likes to move fast. They have the LLM models but they don’t have access to enough data. So they take it without paying for it.
@aussiepawsborne9056
@aussiepawsborne9056 11 ай бұрын
I don’t think they legally have to pay for data that is publicly available. The laws haven’t really been established around neural networks yet
@GalaKrond-b7k
@GalaKrond-b7k 11 ай бұрын
Lol, youtube is not entitled to payment.
@choptop81
@choptop81 11 ай бұрын
Sam Altman liked to move fast into his four year old sister’s bed too (read her SA allegations against him)
@coreyjblakey
@coreyjblakey 10 ай бұрын
@@GalaKrond-b7kNo one here gives 2 fs about YT getting a cent dude, we want the video creators to get either paid, or the option to not be in the data, It should be opt in, its currently not even opt out
@collinmartin3589
@collinmartin3589 10 ай бұрын
she certainly did not expect the question bout the source of the training data..... I wish we had journalists like this in South Africa
@shmookins
@shmookins 11 ай бұрын
Why was she avoiding answering the source data? They absolutely know since there is only two major parts: the code, and the learning source. She avoided answering about the Shutter deal as well, even though they had a deal with them and later in the interview confirms that. This doesn't look good.
@FinnC-w3o
@FinnC-w3o 11 ай бұрын
It looks fine. It is obvious she isn't answering because of lawsuits.
@shmookins
@shmookins 11 ай бұрын
@@FinnC-w3o She could have simply said as much. I heard other businesses reply the same, something like: "we can't comment on on going cases" or some such. But even the Shutter deal- which she confirmed later- was also given a vague response at first. It's just odd, that's all. Maybe this person was throne in to fill the simply seat and they don't have experience? Oh, well. Humans gonna human.
@FinnC-w3o
@FinnC-w3o 11 ай бұрын
she obviously has experience, she has been there since 2018. she also basically explicitly said that she cannot comment on it, so idk where the confusion is coming from@@shmookins
@koumorichinpo4326
@koumorichinpo4326 11 ай бұрын
@@FinnC-w3o if they need to be hush hush to not be sued, maybe your little brain could entertain for a moment that its because they are doing the wrong thing
@NoobNoob339
@NoobNoob339 11 ай бұрын
​@@FinnC-w3o "It looks fine. It is obvious she isn't answering because of lawsuits." I see a massive contradiction there
@goodtechdoor
@goodtechdoor 10 ай бұрын
Nice, reliable humans making reliable decisions with far reaching consequences. This is how we got here .. and we shall further!
@billf1748
@billf1748 11 ай бұрын
Unpopular opinion: It doesn't matter that they are using copyrighted material to train their models. Neural nets, whether they are in your brain or in Open AIs models, do not copy information. They use the information (the stimuli) to adjust their weights and biases using their amazing algorithms (backprop, etc.). This is analogous to how we learn. We have not copied the video into our brains, nonetheless, we have learned from it. This is how these cases will be argued in court.
@federicoaschieri
@federicoaschieri 11 ай бұрын
That's just false. Several scientific papers, like "Speak, memory", have proved that AI models store thousands of copyrighted works. For example, all Harry Potter and Game of Thrones saga word by word. Indeed, our brain does have photographic memory, that's why you can recognized your mum 😆
@carcolevan7102
@carcolevan7102 11 ай бұрын
@@federicoaschieri Can you provide a link to this paper? A quick internet search on "Speak, memory AI" turns up a lengthy magazine article about creating an AI version of a recently-deceased friend, but it is not a scientific paper and doesn't say anything about memorizing texts.
@IconoclastX
@IconoclastX 11 ай бұрын
​@@GalaKrond-b7kNah, you're definently in the minority. Youre right on the scientific point but most people think that human societies should be for human beings and that we should regulate things if they are harmful to us(shocking). Irrespective of what the technology does, its bad for society and therefore it needs regulation. Same with how we do nuclear weapons.
@billf1748
@billf1748 11 ай бұрын
@benny-schmidt So if I download Stable Diffusion onto my desktop computer and get it working, I've also downloaded every image it was ever trained on? My computer cannot hold petabytes of data. The reason you see a watermark is because the AI was trained on many images with watermarks. It has learned, incorrectly, that all images, or a certain category of images, should have watermarks. Humans make similar mistakes, like thinking a watermarked image from GenAi means it was copied.
@carcolevan7102
@carcolevan7102 11 ай бұрын
@benny-schmidt Copyright doesn't have anything to say about training on copyrighted works. There is no such thing in copyright law as "unauthorized studying."
@satyapanigrahi6606
@satyapanigrahi6606 11 ай бұрын
Setting aside the concerns and obvious questions around training data, it's so refreshing to see young execs like Mira and Sam taking charge of paradigm shifting technologies
@crypticvisionary
@crypticvisionary 11 ай бұрын
"Im not going to go into the details of the data that was used" = We stole most of the data and wont admit it
@UnsaltedCashew38
@UnsaltedCashew38 10 ай бұрын
*all of it
@Adam-fq5lu
@Adam-fq5lu 10 ай бұрын
I can’t believe you are stealing this video right now :(
@Sma-v4y
@Sma-v4y 10 ай бұрын
4:30 - Lying without blinking - new achievement unlocked!
@freeabt9916
@freeabt9916 11 ай бұрын
will they be using this video as a training to generate a video when the prompt is: did you pay for the data you used?
@gbxgbxgbx
@gbxgbxgbx 10 ай бұрын
Question: "People in the film industry will loose jobs - how do you address that?" Answer: "This is a tool for extending creativity - we want to ask these people to help us develop it to make it even better so they are completely out of jobs".
@Peeps7468
@Peeps7468 11 ай бұрын
Good for the interviewer pushing for the source of the data. That was disappointing that the interviewee didn’t seem to know where the data comes from.
@throwawaydude3470
@throwawaydude3470 10 ай бұрын
she knows exactly where it comes from
@ShaiLysk
@ShaiLysk 10 ай бұрын
Everything will have to be “taken with a grain of salt” we will have to question everything and trust nothing in the future.
@prilep5
@prilep5 11 ай бұрын
Perfect timing release of the best fake video AI maker just in time for the biggest election year in the World
@AnthatiKhasim-i1e
@AnthatiKhasim-i1e 6 ай бұрын
The SmythSO can also provide Email Sending. Not only this, it can do a lot of the things we call an agent
@godmisfortunatechild
@godmisfortunatechild 11 ай бұрын
I think it's definitely worth trying" ie the profit motive is too great not to.
@teebu
@teebu 11 ай бұрын
thats really what keeps her up at night
@carlosdart3118
@carlosdart3118 11 ай бұрын
I not gonna lie the internet is the BEST pool of human knowledge PERIOD!!!
@erikouwehand
@erikouwehand 11 ай бұрын
Part of the internet yes, not all.
@raj5669
@raj5669 11 ай бұрын
Interesting conversation start at 4:42
@looseygoosey23
@looseygoosey23 10 ай бұрын
It is definitely worth it, she is driven, she doesnt care for anything but innovation, seen this on a black mirror episode
@xxxxx409
@xxxxx409 10 ай бұрын
I can literally take watermarked screenshots and get rid of them within 5 minutes because of AI (Which they probably do when they scrape the web for training data).
@thebicycleman8062
@thebicycleman8062 10 ай бұрын
I bet you 100% from this interview onwards OPENAI will have a SPECIALIZED TEAM dedicated ONLY TO TRAIN OPENAI on HOW TO RESPOND TO THAT VERY QUESTION regarding TRAINING DATA - They will be like a pro at it lol
@AndreaDoesYoga
@AndreaDoesYoga 11 ай бұрын
Wow, Sora's capabilities are mind-blowing! 😮
@Keenan686
@Keenan686 11 ай бұрын
what a time to be alive
@NoobNoob339
@NoobNoob339 11 ай бұрын
All the hard work people did that they stole from is amazing, yes.
@stardust6870
@stardust6870 10 ай бұрын
Why does this CTO sound like they found her on the street and asked to participate in a video? Her vocabulary and responses are not something you'd expect from a CTO. She needs to know every detail and understand how to respond to questions professionally. "Mmm, I don't know," and "Actually, I'm not sure" are not valid responses, as no one should know the product better than you. That's why you're the effin' chief technology officer.
@tijujohn83
@tijujohn83 10 ай бұрын
I don't like her
@TheBlackClockOfTime
@TheBlackClockOfTime 10 ай бұрын
Thank you for bringing us the greatest meme in the past 4 days.
@Willibeolder
@Willibeolder 11 ай бұрын
in a distant future I can imagine cops having to shout "hands! show me your hands! Hands in the air!" for more than one reason
@geaca3222
@geaca3222 10 ай бұрын
😂
@sidbrun_
@sidbrun_ 7 ай бұрын
4:30 firstly well done to Joanna for looking closely at the videos and critiquing the flaws, but even more well done to her for asking good questions that challenge the technology which Mira can't seem to answer. Mira's not sure what data they're using to train the models?? How is that not a huge red flag from the Chief Technology Officer? Mira will go from fear mongering by saying that many artists will be out of work, to not being able to even explain where the data comes from. I mean, how can you even train the models without data FROM ARTISTS? These OpenAI people are going to get themselves into some serious trouble in the coming years with where the data's coming from, when some of the large film companies discover their films are being used in the datasets. Can you imagine the Disney lawyers if they ever found one of their files in the training sets?
@davedsilva
@davedsilva 11 ай бұрын
The interviewer forgot to ask if Sora can replace her 😂
@MrRandomPlays_1987
@MrRandomPlays_1987 11 ай бұрын
How can I tell this whole interview wasn't created also by SORA ? LOL, it's that real
@sherriffs2554
@sherriffs2554 11 ай бұрын
Not going into the details of what trained sora because NYT is currently suing...among others.
@joelface
@joelface 11 ай бұрын
exactly.
@ByrdNick
@ByrdNick 11 ай бұрын
SORA exhibits/solicits a human-like inattentional blindness: it produces fluent motion that seems normal at a glance. But upon closer inspection it does stuff that’s as weird as a gorilla walking across the frame (from Chabris and Simon’s famous experiments.)
@diegohernandezp
@diegohernandezp 11 ай бұрын
She absolutely knows what data was used/collected to train the model. Content creators will get zero profits while open AI will become enormously rich. Change the name to pirate AI and stop lying to the public
@carcolevan7102
@carcolevan7102 11 ай бұрын
Are you a pirate for watching this interview video without permission? Your moral outrage is well-meaning, but totally misinformed. Imagine if the law worked the way you seem to want it to and we all had to get every artists permission before we could look at their art. What an absolute nightmare. It would be the total death of all art. Be careful what you wish for.
@NoobNoob339
@NoobNoob339 11 ай бұрын
@@carcolevan7102 nah you just want that to be the case.
@diegohernandezp
@diegohernandezp 10 ай бұрын
@@carcolevan7102 If I watch this interview, meticulously dissect its stylistic elements, and then replicate its script, tone, scene composition, and lighting to produce a super similar content on a mass scale for profit, I would be infringing on the original creators' rights, effectively becoming a pirate. Now, if I were to train a machine to replicate this process, absorbing countless works to generate similar content under a subscription model, yet providing no compensation or credit to the original authors, then my actions would be no different in principle. Don’t get me wrong I love this tools, but they need guardrails.
@PixelsVerwisselaar
@PixelsVerwisselaar 10 ай бұрын
Now its just waiting for the wistleblower 😂🤭
@AntonioAponte00
@AntonioAponte00 11 ай бұрын
We are basically taking everythin we can get out hand
@PHlophe
@PHlophe 11 ай бұрын
Toninho, Meu Deus ! Que situação deprimente
@TotalDec
@TotalDec 10 ай бұрын
They break down human brains, do a seance, then summon Legion. That's the biggest AI secret. Training data, means ppl's memories (life experience).
@DigitalDistortion
@DigitalDistortion 11 ай бұрын
She’s extremely well spoken and it’s a fascinating conversation.
@ropro9817
@ropro9817 11 ай бұрын
Except for the dodge on the training data...
@shredd5705
@shredd5705 11 ай бұрын
Not well informed... or hiding something. They use stolen data, copyrighted data... Also this "It's not a replacement, it's a tool" nonsense is marketing hype they want you to believe. It's a replacement. But they say it's a tool
@xlrbossshorts
@xlrbossshorts 10 ай бұрын
@@ChrisGrump Shes being professional, lawsuit is right around the corner. Its not like everyone is celebrating the creation of this new technology.
@RichardMatey
@RichardMatey 10 ай бұрын
I like how someone who has never been in the creative industry feels like they know the best tools for them after stealing their work just to crush them into poverty. Thanks OpenAi
@pikaso6586
@pikaso6586 10 ай бұрын
The data fell from one of KZbin's trucks
@BuddyBearCreator
@BuddyBearCreator 10 ай бұрын
how are you "concerned" about their "impact"?
@JoshuaFinancialPL
@JoshuaFinancialPL 11 ай бұрын
WSJ you should explore the copyright infringement they're doing
@P4INKiller
@P4INKiller 10 ай бұрын
What copyright infringement?
@matt.stevick
@matt.stevick 8 ай бұрын
Open AI’s Mira is gorgeous 📈🇺🇸🥰
@Bronte014
@Bronte014 11 ай бұрын
This is iPad 12.9 or 11?
@Bronte014
@Bronte014 10 ай бұрын
Now I know, it’s 12.9
@kevinvarghese9919
@kevinvarghese9919 10 ай бұрын
Its incredible that such a cutting edge technology is developed and run by 25-30year old folks.
@punk3900
@punk3900 11 ай бұрын
The data was shady :D
@elinglee8878
@elinglee8878 10 ай бұрын
I cant come up with any way why this will help with humanity
@sapphyrus
@sapphyrus 11 ай бұрын
Looking forward to Stable Diffusion eventually doing this. I'd rather not be bound by censorship and would prefer to create locally without subscription and with open models.
Animals Pick The Super Bowl Winner | Elon Musk Vs. The Weather | Google’s Killer AI
12:24
The Late Show with Stephen Colbert
Рет қаралды 1,3 МЛН
This Video is AI Generated! SORA Review
16:41
Marques Brownlee
Рет қаралды 3,9 МЛН
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН
My scorpion was taken away from me 😢
00:55
TyphoonFast 5
Рет қаралды 2,7 МЛН
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19
What Does the AI Boom Really Mean for Humanity? | The Future With Hannah Fry
24:02
How Google Maps, Spotify, Shazam and More Work | WSJ Tech Behind
48:03
The Wall Street Journal
Рет қаралды 2 МЛН
Why It Was Almost Impossible to Make the Blue LED
33:45
Veritasium
Рет қаралды 30 МЛН
What We Get Wrong About AI (feat. former Google CEO)
11:42
Cleo Abram
Рет қаралды 1,1 МЛН
NVIDIA CEO Jensen Huang's Vision for the Future
1:03:03
Cleo Abram
Рет қаралды 1,1 МЛН
The only Sora video you need to see
16:27
Futurepedia
Рет қаралды 25 М.
How the U.S. and China Compete in Planes, EVs, Chips and More | WSJ U.S. vs. China
1:08:28
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН