big data is anything that is too large to be opened in excel
@senkottuvelan5 жыл бұрын
True.😂
@Rakkoonn5 жыл бұрын
You say that, but many companies who say they use 'big data' really mean a huge spreadsheet.
@ubummer5 жыл бұрын
big data is anything too large to fit in pandas
@minihjalte5 жыл бұрын
For thats there is microsoft access. Its just large excel :^)
@michaelsommers23565 жыл бұрын
_"big data is anything that is too large to be opened in excel"_ So big data is any table with more than 100 rows?
@BaronSamedi19595 жыл бұрын
According to management all big data can be reduced to one nice coloured 3D-pie chart!
@Walleggwp5 жыл бұрын
And if you don't have a nice upsloping line graph, well... I'm sorry but I'd like to speak to you in my office when are you finished.
@napillnik5 жыл бұрын
@@Walleggwp hockeystick!
@napillnik5 жыл бұрын
mmm, pie... I keep suggesting it but my team starts ignoring me after that.
@Manabender5 жыл бұрын
More V's of data! *Volatility*: How likely is it that this data is received intact? How often do the bits get flipped? *Velociraptors*: How much would this data scare xkcd? *Vaingloriousness*: How hard is the creator of this data trying to shove it in your face despite repeated attempts to get them to shut up? *Vanity*: How likely would the data be to win a beauty pageant? *Vampiricism*: When mirrored, does this data delete itself? *Vaccination*: Has the data been protected from viruses? *Vuvuzela*: Honestly, this one should describe itself.
@jackik14105 жыл бұрын
this is glorious! nearly died here XD
@triton626745 жыл бұрын
Superb haha
@letMeSayThatInIrish5 жыл бұрын
Vastness: Does 'huge volume' not even begin to describe the sheer size of the data? Verse: Is the data in verse form? Viscosity: Does the data flow effortlessly, or does it lump up like blood clots? Vikings: Does the data contain false information about vikings, such as them wearing horned helmets? Vendetta: Is the data vengeful? Viciously vindictive? Vincent van Gogh: Is it art? Vortex: Does the data rotate in ever more violent circular motions around the data center? Vulgarity: Must the data be censored for people in the US? Violas: Would a symphony orchestra make fun of the data?
@jackik14105 жыл бұрын
@@letMeSayThatInIrish Holy data, this is even more ridiculus. The beauty is that each of the makes so much sense by itself and represents an actual (kinda) valid query!
@SniperSpy105 жыл бұрын
Virginity: is it new and pure Violence Level: how likely is it to destroy other data Vocal: how easy is it to be heard Viagraity: can it give the reader a hard on
@macchicken985 жыл бұрын
Hands down best explanation of big data I have seen. I‘m coming from a business degree where we often learn about the 5Vs but don’t really touch on what infrastructure is actually used or needed for using/handling big data. Now I definitely have a better perspective on this!
@koz8575 жыл бұрын
I picture a computer scientist somewhere thinking "Hmm gravity of the data is an important aspect that should define big data." and his friends are like "It doesn't start with a 'V' it won't work"
@galgrunfeld99543 жыл бұрын
Value of importance - how important the data is Based on the value you can manage its position in a data pipeline - e.g what dataset you process first, how much computation power going into processing it, what data is sent to nodes in a network first, etc.
@Epic-so3ek10 ай бұрын
I'm pretty sure them using all v's is to appeal to people who don't have a computer science background (aka managers and execs), or maybe people taking a first course in data science. I don't know that for sure but just the fact they used "velocity" instead of throughput makes me think that. If it was for people with a cs/IT background, that would just confuse them.
@allluckyseven5 жыл бұрын
TIL a little bit about Big Data, but also learned that in England a truck is called a lorry.
@randallanderson49995 жыл бұрын
And a highway is called a motorway.
@a.yashwanth5 жыл бұрын
In India too.
@Jamie-st6of5 жыл бұрын
Ande Yashwanth well yeah, cause england invaded india
@lsmeteor46525 жыл бұрын
And in the us, you park on driveways and drive on parkways
@NoseyNick5 жыл бұрын
That's nothing, they come in different colours (with a u) too! Try saying "red lorry yellow lorry red lorry yellow lorry red lorry yellow lorry" really fast.
@senkottuvelan5 жыл бұрын
8:32 Sean Ridley is an awesome editor. Used the word Process to add Pre Process in the video.💯🔥
@MILCHMONSTER3D5 жыл бұрын
my modded skyrim is big data too much for one computer to handle
@hattrickster335 жыл бұрын
I know what you mean. I literally have to run the game at my local rendering farm to get anything over 10 fps.
@manualvarado22125 жыл бұрын
@@hattrickster33 At least you have a local rendering farm.
@edgekane9585 жыл бұрын
Every Computerphile video deserves a like. Change my mind.
@jasetran21864 жыл бұрын
nope. you're right sir!
@code-dredd5 жыл бұрын
"Big Data" is the confusion that follows after marketing people end up describing technical stuff.
@WilliamAncich5 жыл бұрын
Could not agree more.
@cmonkey635 жыл бұрын
Did you know? The term "Machine Learning" was an invention of the marketing team at IBM in 1959. Machines don't learn, silly. Well, neither do people, much of the time.
@MrCmon1135 жыл бұрын
@@cmonkey63 Machine learning describes precisely what it's about. Really, I cannot think of any better term for it. Computer aided reverse deduction? Knowledge discovery in databases? Automated stochastical analysis? Practical function fitting? Those are all obscurantist, *learning* is what it's about. And who learns? A machine.
@alkis24075 жыл бұрын
@@MrCmon113 Statistical model estimation/fitting would be more accurate IMO. Optimization has been around for ages, why call it learning all of a sudden? (hint: money)
@napillnik5 жыл бұрын
@@alkis2407 algorithms learn. They adapt without code being rewritten, and produce outcomes that haven't been preprogrammed, and get better with experience. That's learning.
@Shadow819895 жыл бұрын
Great to see more of Rebecca! This one was much better presented, seems like she's getting some practice (and confidence). :-)
@AndyH2O5 жыл бұрын
...and is being patronised slightly less.
@sumitrana81147 ай бұрын
Let's take a moment and say that computerphile never disappoints.
@fcs_965 жыл бұрын
This channel is super informative. I'm super pleased that I was able to stumble upon it. Broadens my knowledge of Computer Science.
@RichardT21125 жыл бұрын
It’s not the size of your data that matters, rather how well you process it ...
@MrCmon1135 жыл бұрын
No, it's both. We knew about lots of the best machine learning algorithms more than thirty years ago, but we didn't have the datasets to train them sufficiently. Deep neural networks are comparatively simple, but they perform miracles if you throw tons and tons of data at them.
@RichardT21125 жыл бұрын
Taxtro I see humour isn’t lost on you ... thanks for playing along!
@Monk-E5 жыл бұрын
@@MrCmon113 wow you're cool
@Bnelen3 жыл бұрын
She does a good job of covering many of the important basic concepts.
@MrFloris5 жыл бұрын
Thank you for making these and sharing these lovely videos. They're a fantastic resource.
@stefanjooste35982 жыл бұрын
Love the use of old dot matrix printer paper to try and explain the basics of big data.
@ShankarSivarajan5 жыл бұрын
A quote I heard last week about big data: "We are drowning in data but starved for information." (Paraphrasing John Naisbitt, 1982).
@MrCmon1135 жыл бұрын
Information is just the complexity of the data. What you are looking for is knowledge.
@ecelon5 жыл бұрын
Big data for me is when any text editor I try crashes while opening it...
@DantalionNl5 жыл бұрын
Can we also get videos on big data using none Spark based technologies?
@noredine5 жыл бұрын
It's the opposite of ˢᵐᵃˡˡ data
@adityasanthosh7025 жыл бұрын
rofl
@PaulaJBean5 жыл бұрын
How do you make KZbin render small text?
@noredine5 жыл бұрын
@@PaulaJBean Google tiny text
@PaulaJBean5 жыл бұрын
@@noredine ᵀʰᵃⁿᵏˢ ᶠᵒʳ ᵗʰᵉ ᵗᶦᵖᵎ
@Bordsteinpflaster5 жыл бұрын
I started to research to that topic today and was even on this yt channel to search for stuff ... and tadaaah I see this upload in my subbox, perfect timing :)
@DavidLindes5 жыл бұрын
Good stuff. While I knew each of the concepts, I'd not heard of the "5 Vs" (let alone the 10/whatever)... cool! And wait, is this map/reduce video out already? Must find it. I've been wanting a refresher, because I haven't used it in a while, but it could be useful for me soon.
@fruitfcker53515 жыл бұрын
01:28 I haven't seen that wide of a continuous paper in decades
@uristmcdani2 жыл бұрын
Thanks a lot for this explanation, very clear!
@quratulain8396 Жыл бұрын
Productive video
@raffriff425 жыл бұрын
KZbin views and likes are tracked by traditional databases. KZbin recommendation algorithms use "big data" (although they use views and likes as raw input) "Big Data" systems are mainly interested in the _patterns_ in the data (data = whatever information is fed into the system), and the integrity, or confidence in, the individual atom of data is not very important. OTOH, in traditional databases (bookkeeping, inventory, payroll) the integrity of each atom of data is (with some exceptions) very important indeed.
@vedi0boy5 жыл бұрын
Looking forward to the next video, thanks!
@RAZREXE2 жыл бұрын
Big data is the study material folder in the d drive
@moni72354 жыл бұрын
Thank you Rebecca!
@johndripper5 жыл бұрын
i can listen to u all day :)
@kevind8145 жыл бұрын
Big Data: The lifeblood of Big Brother
@robertboran62345 жыл бұрын
Long time ago i was thinking that we can in theory use Big Data to create new electrical energy that can feed other machines or even the Big Data system itself. When we have huge amount of data, some of it is relevant information (this is used for processing) a second type of data is a second relevant data (this is used to train the Big Data system to improve itself) and the last type is total garbage data (this is still data that has 0 and 1). Now we know that when digital information is deleted from the machine the actual bits of information are not lost but transformed via thermodynamic effects into heat (this heat is raising the temperature of the machine) so when digital data is deleted the machine will heat up a little bit. Now we channel all the heat from all the machines and instead of disposing it we reuse it to produce electricity. So we recycle the "heat" from the machine.
@TheSam19025 жыл бұрын
But you forgot something, it's not the heat that is valuable, it's the heat **differential** . Some datacentres in northern countries uses the temperature difference between the inside of the server room and the outside air to power Sterling engines and produce electricity, but it's still not very efficient. Also iirc the swedish military won a wargame against the US because their submarine were (partially) powered by these Sterling engines making them stealthy than nuclear/diesel powered submarines.
@robertboran62345 жыл бұрын
@@TheSam1902 I agree with the inefficiency. Another way to improved this is by increasing the information density. But i still believe that this will be possible if the system is large enough. I am thinking about interplanetary internet where you need to process all the data of an entire planet. Also we know that information at a quantum level is stored in the surface not in volume. so i am thinking of using black holes as memory.
@lmaoukiddin6802 жыл бұрын
3 inches is pretty big right?
@sooskca5 жыл бұрын
How many Apache projects are there?
@AndyVanee5 жыл бұрын
At the moment... exactly 367
@TheSam19025 жыл бұрын
As much as the number of feathers on a peacock.
@gorgolyt Жыл бұрын
There's only three Vs, the last two were clearly added on because somebody wanted five "Vs" but they really have nothing to do with whether something is big data.
I think it's "rotate / move the rocket" but I hope we learn more about Rebecca's Rockets in a future computerphile video!
@dancingCamels5 жыл бұрын
@@NoseyNick oh yes, on looking again you're right. Hopefully we will find out what it's about!
@nathangek5 жыл бұрын
That's data but, like, really big.
@dirkdigglerswonderlandempo51704 жыл бұрын
How times have changed in my day it was the 4F's now its the 5V's
@hillwin105 жыл бұрын
Does size really matter? It is how the data is used. edit: or "data are"
@michaelsommers23565 жыл бұрын
It depends on whether you are referring to the data individually or collectively.
@thomaspearson87825 жыл бұрын
@@michaelsommers2356 wouldn't you use datum if it was singular, and data otherwise, using "is" for both?
@michaelsommers23565 жыл бұрын
@@thomaspearson8782 Sure, but I was mostly joking.
@MrCmon1135 жыл бұрын
Ok so you have to tell me what distribution produced the following input-output pair: A -> 0 Do you think your chances of guessing the right function improve if I give you more examples? If not, why do you think learning is even possible?
@farqueueman Жыл бұрын
"how big is big" giggles
@jvne_5 жыл бұрын
"How big is big data?" Me: big
@strydomobile Жыл бұрын
Lorries are awesome.
@rock3tcatU2335 жыл бұрын
It's not the size of the data that matters, but how you use it.
@MrCmon1135 жыл бұрын
The size of the data matters a lot. Some things you can only learn from incredibly huge sets of data.
@rednull83155 жыл бұрын
640 kB
@satyris4105 жыл бұрын
That's a big excel file
@maulanaibnusabil52804 жыл бұрын
Can someone explain me the difference between Big Data, ETL (Datawarehouse), and Data Engineer. I'm really confused
@Treviath5 жыл бұрын
Would it be possible for you to do a video on the piece of art that is called Wireguard?
@gabetower5 жыл бұрын
I won't be content until you have more V's than the speech from V for Vendetta. Voila!
@MoonMarshmallow5 жыл бұрын
Rebecca is so cute!! ❤
@polygondwanaland83905 жыл бұрын
@MichaelKingsfordGray What's your address and credit card number? Wouldn't want to be anonymous and cowardly, big man.
@inzanozulu5 жыл бұрын
Use your inside voice. It's not a problem to find somebody attractive, but did that really need to be in a comment on this video?
@olik1365 жыл бұрын
I think data has to be at least this >| |< big... maybe even this >| |< big...
@edge46945 жыл бұрын
I hate how the sound of the pen lags behind the actual pen
@rendogsbiggestfan5 жыл бұрын
I didn't realize but now I can't not realize, you monster
@StrangeIndeed4 жыл бұрын
I've realized that 5 V makes for a very nice mnemonic. V is 5 in roman numerals, so you can pretty easily remember that there are 5 Vs. It's probably just an accident, but makes it things a little easier to remember c:
@laser-sj5 жыл бұрын
Mind tickled :p
@LathosZan5 жыл бұрын
Always like for gals in tech!
@axcelleria5 жыл бұрын
Well she's sweet
@snake1625b5 жыл бұрын
Generally, more than 10 terabytes is big Data usually
@Alex18915 жыл бұрын
When I was a kid, I used to say things priced at $30 or greater were expensive, regardless of context. ;)
@snake1625b5 жыл бұрын
@@Alex1891 most things in life are subjective and don't have a definitive answer. But it's definitely possible to give a generalized average answer. In this case you can say the AVERAGE server can only process less than 1 terabyte of typical data and thus you'll need multiple computers to process the data. The most unhelpful and pedantic answers you can give is something annoying like " it depends. It's subjective. It varies from problem to problem".
@BlackHermit5 жыл бұрын
big data is anything that is too large to be opened in excel
@realityveil61515 жыл бұрын
Hey, she's back! The cute nerdy chick!
@mannycalavera1215 жыл бұрын
I like to pretend i'm smart enough to understand what's going on in this video :)
@kennb335 жыл бұрын
Isilon
@JanB16055 жыл бұрын
How I love me some pretty, intelligent women in STEM. Great Video, was always wondering what big data really is.
@xakkep90005 жыл бұрын
coool
@grainfrizz5 жыл бұрын
Vig data
@Ubeogesh5 жыл бұрын
so where's that map reduce video?
@misterhat58235 жыл бұрын
She's adorable.
@willhendrix865 жыл бұрын
In before your entire life and your rights are represented in a 5 star rating system; And yes I have seen that black mirror episode ( ' ', )
@gqh0075 жыл бұрын
In before killer robot bees
@BrokebackBob5 жыл бұрын
Data storage is now totally separate physically from the computers that access it. The idea of defining big data as the max that a single computer can process is laughable.
@szebohalasz77935 жыл бұрын
I dont thik so, given the fact that you mostly need the "computer" to process the data. Also its just metaphorical not absolute definition, as the BigData itself.
@senkottuvelan5 жыл бұрын
KZbin IS BIG.
@mtranchi5 жыл бұрын
gah, too cute
@SephirothDL5 жыл бұрын
Splunk
@MarcelRobitaille5 жыл бұрын
A buzzword?
@rogerc7960 Жыл бұрын
NSA/GCHQ giggles at your ignorance
@alittlebyte5 жыл бұрын
00:01 "How big is big?" LOL
@mochrDK5 жыл бұрын
Do you pronounce it Data or Data?
@sensibleb5 жыл бұрын
Star Trek TNG changed the way I pronounce 'data'.
@iLiokardo5 жыл бұрын
big data is management and measurement of private data
@nosuchthing85 жыл бұрын
Lorries are trucks in pondese
@TheGreenTaco9995 жыл бұрын
Big data energy
@pajeetsingh3 жыл бұрын
I want to **** *** *****
@catklyst5 жыл бұрын
BIG DATA ENERGY
@vjndr324 жыл бұрын
Can you please stop writing on paper with markers or atleast suppress the sound of it? The sound gives me goosebumps while watching videos. Doesn't it happen to anyone else?
@mohammedmohammed5194 жыл бұрын
Vijender Singh No.
@maggieselbstschopfer19563 жыл бұрын
0:02 How Big is Big!!! My ex BFs mmmm..
@AnimilesYT5 жыл бұрын
Big Data? You mean that folder on my computer called "internet explorer"? xD
@billykotsos46425 жыл бұрын
Yeah BABY
@BrandonJohnson2k75 жыл бұрын
the lack of technical words being used to describe computers and the database hurts. relational database, datacenter, server
@RatnaMaharaj5 жыл бұрын
The nerds really need to learn to communicate well, otherwise the likes of Zuckerberg will keep winning.
@jonasgrnbek71135 жыл бұрын
wtf haha
@MrCmon1135 жыл бұрын
What's wrong with Zuckerberg? What nerds are you talking about anyways?
@AudioPervert15 жыл бұрын
No one seems to Talk about the huge amounts of energy and electricity needed to churn all such big data (all fossil fuel based) ** the big data can stop the sixth mass extinction ?? Carry on technology based omnicide.
@jmonsted5 жыл бұрын
Our datacenters were pretty much entirely wind powered. Many of the big datacenters try to make deals to buy 100% green power.
@polygondwanaland83905 жыл бұрын
Where do you get the idea that datacenters are "all fossil fuel based" when there's entire states, provinces and countries that don't run any fossil fuel on their grid?
@hoola_amigos5 жыл бұрын
This is a relatively poor computerphile video.. all theory and bookish explanations.. I didn't take home satisfactory amount of knowledge.. Wouldn't be able to explain anything (except for 5 Vs) :/ Maybe it is the topic itself which makes it very difficult to practically explain it to someone who isn't familiar with all those terms..
@mini_frank5 жыл бұрын
when he said how big is big, my first thot was the unit inches...
@mountedczarina92055 жыл бұрын
She’s very cute.
@bbuggediffy5 жыл бұрын
No that's not the definition. It's the CAP theorem that defines it.
@Ashkenya5 жыл бұрын
Sorry but there is no such thing as Big Data. There might be big database, or big chunks of data, large amount of data, heaps of data. But data in computer term is just a collection of what we humans classify/distinguish/label as "1s & 0s"
@ivahardy48854 жыл бұрын
but is it pronouced data? or data?
@roshsiddiqui87455 жыл бұрын
you are so cute and adorable, i am in love cuz along with all of that u are into tech and have a techy brain
@iiiiii-w8h5 жыл бұрын
That's a funny looking kitchen
@omri93255 жыл бұрын
The definition is really lame
@joonasfi5 жыл бұрын
You can make any data big data by exporting it in XML
@Lodinn5 жыл бұрын
It's hoooman-readable format though amirite? :>
@satyris4105 жыл бұрын
More than 16,384 columns = Big Data.
@derpimusmaximus88155 жыл бұрын
"This data is small, but the data over there is far away."
@recklessroges5 жыл бұрын
Thanks Ted.
@bencrossley6475 жыл бұрын
Best / most unexpected comment I’ve ever laughed at. I can see him looking so confused.