Foundations of Data Visualisation - Computerphile

  Рет қаралды 72,735

Computerphile

Computerphile

Жыл бұрын

Following a look at 'Sensemaking' Associate Professor Dr Kai Xu delves into some more tricks of the visualisation trade.
Kai's presentation:
docs.google.com/presentation/...
/ computerphile
/ computer_phile
This video was filmed and edited by Sean Riley.
Computer Science at the University of Nottingham: bit.ly/nottscomputer
Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Пікірлер: 83
@patton72010
@patton72010 Жыл бұрын
"Mr. Anderson, you are the expert in all matters related to drawing red lines. We need you to draw seven red lines. All of them strictly perpendicular, some with green ink and some with transparent. Can you do that?"
@3zrv
@3zrv Жыл бұрын
* nervously sweats with 2 perpendicular sweat drops on his forehead and the 3rd drop doesn't know where to go *
@JinKee
@JinKee Жыл бұрын
When you inflate the balloon, can you do it in the shape of a kitten?
@patton72010
@patton72010 Жыл бұрын
​@@phandao5404You are talking about different Mr Anderson lol
@arnoldbr8418
@arnoldbr8418 11 ай бұрын
yo thats a swastika?
@Aziqfajar
@Aziqfajar Жыл бұрын
Can't wait to have my data visualized with electric shock intensity! ❤
@alphgeek
@alphgeek Жыл бұрын
Let me run that past the ethics committee for a sec.
@moralboundaries1
@moralboundaries1 Жыл бұрын
If we're talking about, visualizations, that must mean the electrodes will be hooked up to our eyeballs. Extra FUN!
@TheAgentOfDeath
@TheAgentOfDeath Жыл бұрын
Thanks for the free, high-quality education.
@F_L_U_X
@F_L_U_X Жыл бұрын
Welcome.
@Tiwo1991
@Tiwo1991 Жыл бұрын
When he mentioned marks and channels with the specific examples, I immediately think of the analogy of how these are used in cartography and the choices made there.
@moralboundaries1
@moralboundaries1 Жыл бұрын
This is the other kind of graph theory.
@mattgenovese
@mattgenovese Жыл бұрын
I would consider data visualization as closer to the discipline of User Experience design.
@MaeLSTRoM1997
@MaeLSTRoM1997 Жыл бұрын
To anyone who is interested in data visualization: I highly recommend the five books on data visualization by Edward Tufte, particularly the first one, "The visual display of quantitative information." He is the founding figure of the field of data visualization and his books are very interesting and pleasant to read.
@leogama3422
@leogama3422 2 ай бұрын
thanks!
@Nafrodite
@Nafrodite Жыл бұрын
dr xu has impeccable drip
@willemrood
@willemrood Жыл бұрын
Really interesting stuff. I'm very surprised to see that using color lum/sat for highlighting magnitude is considered worse than markersize. I mean, it makes sense that without a colorbar it is very difficult to say what saturation/transparency corresponds to anything in absolute or relative sense. So I feel like when utilizing those, a colorbar is mandatory. I'd be interested to see Dr. Xu's take on how to visualize high density datasets. Because when you have a set of n=1e6, a scatter plot that uses area to denote magnitude will not really be usable due to the markers overlapping or being to small to be visible. I'm expecting that at some point you have to shift from using scatterplots to porkchop plots and so on. Would be nice to see something of an overview between data set size and plotting formats!
@xbzq
@xbzq Жыл бұрын
RGB on a computer screen is a super poor way to refer to saturation. HSV and HLS are super poor derivatives. If you do not understand color spaces and vision in depth you shouldn't be talking about levels of saturation because that is a very complex topic and knowing only RGB values and your run of the mill Photoshop color pickers will lead you only to talking a lot of pseudoscientific nonsense that misleads people. To be more accurate it is necessary to talk about the Lch, Luv and L*a*b color spaces. Then you can use words like "twice as saturated" and have it actually mean something. Luminance is also fraught with peril. Something twice as bright doesn't look twice as bright. When printing on paper or viewing on a monitor there's a maximum brightness but this isn't true in general. Twice the number of lightbulbs will give it twice the brightness but it doesn't look it. It's all a bit too complicated for this comment but suffice it to say that there're not many people that understand it yet there's a large percentage that really think they know when in fact they do not.
@willemrood
@willemrood Жыл бұрын
@@xbzq Yeah exactly. I completely agree. To obtain something absolute from "a color" is incredibly sensitive and depends on way more factors than you'd expect. It becomes even worse when you use colormaps that vary multiple channels.
@lens07
@lens07 Жыл бұрын
I can do another video if there is enough interest about visualising large dataset (I assume this is what you mean by 'high density'). A bit of spoiler: there is no silver bullet for large dataset unfortunately.
@xbzq
@xbzq Жыл бұрын
@@lens07 How do you define "saturation" that allows you to say that one area has twice the saturation of another?
@willemrood
@willemrood Жыл бұрын
@@lens07 Yeah, although what I mean with high density is quite specific. So I'm not too sure about the wording. When doing optimization problems, the results should converge to the optimum. So what happens is that your distribution of markers is quite dense around the optimum. Which doesn't leave a lot of space for varying marker size (and thus the possibility to distinguish between the individual markers). Which makes me always opt for varying color (however a porkchop is superior of course). Hopefully that clears the "high density" part up a bit. Thanks for the reply!
@squishmastah4682
@squishmastah4682 Жыл бұрын
I knew I liked Dr. Xu early on, but that sensation only magnified as time went on; much like electroshock.
@Hatamoto95
@Hatamoto95 Жыл бұрын
Computerphile invites data visualisation expert, films his presentation from a distance on a tiny screen
@jamesusespivot
@jamesusespivot Жыл бұрын
Idk if you were just making a joke or if you actually want to see the presentation. But if so, it's in the description.
@Pedritox0953
@Pedritox0953 Жыл бұрын
Great video!
@sanketdutta4981
@sanketdutta4981 Жыл бұрын
Really informative video! Can someone please name the books on screen between 0:25 - 0:47. I can recognize Tufte’s classic from a mile away but I can’t see the other two. One of them is surely a springer handbook not sure which one.
@AF-lt2fr
@AF-lt2fr Жыл бұрын
The visual display of quantitative information - second edition The grammar of graphics - second edition Visualisation analysis and design
@Computerphile
@Computerphile Жыл бұрын
Apologies: photos.app.goo.gl/brmCVQYgFked85kx8
@sanketdutta4981
@sanketdutta4981 Жыл бұрын
@@Computerphile Thanks a lot
@sanketdutta4981
@sanketdutta4981 Жыл бұрын
@@AF-lt2fr Thank you
@AF-lt2fr
@AF-lt2fr Жыл бұрын
@@sanketdutta4981 no problem - I ended up getting it by changing the video quality under advanced to 4k and zooming in.
@andrewnemov
@andrewnemov Жыл бұрын
Can you, please, provide names of the books in beginning of the video.
@pengain4
@pengain4 Жыл бұрын
Is it possible to get original presentation somewhere?
@31b41a59l26u53
@31b41a59l26u53 Жыл бұрын
I also would like to have the slides.
@jamesusespivot
@jamesusespivot Жыл бұрын
It's in the description.
@truthprogrammer7594
@truthprogrammer7594 Жыл бұрын
good video and explanation
@nervous711
@nervous711 Жыл бұрын
So it's about categories and magnitude, and how you should represent them depends on how accurate you want them to be. But what about relation, trend, and connection intensity?
@kenakins
@kenakins Жыл бұрын
Can you guys do a video on the new SLP bug CVE-2023-29552? I think it would be really interesting and would love to hear your professional takes on it!
@jmasterX
@jmasterX Жыл бұрын
great video thanks so much!!!!!!!!!!!!
@HighMansx
@HighMansx Жыл бұрын
I was quite shocked that they were all 2x darker, longer, and larger! I had guessed, 2.5, 2.5 and 2!
@MrKrock164
@MrKrock164 Жыл бұрын
Wouldn't a pie chart (area, 0.7, underestimated) be more reliable than a straight line (length, 1, normalized)? I think there's more tricks like that to improve visualization for better precision of the estimated value.
@idaho777
@idaho777 Жыл бұрын
I think pie charts utilize a mix of area and length. The example in the video with squares compares 2 geometrically similar squares with different areas. The pie chart's pie's are not geometrically similar with changes in area but instead the arclengths change (since radius is constant to the bounds of the entire pie). I'm assuming this study comparing areas used uniform scale. We could change the visualization to compare a square and scaled square along one dimension (rectangle) because now you can compare side lengths (linear term), or display the numerical values inside the marker.
@rugbybeef
@rugbybeef Жыл бұрын
No, people's perceptions of triangular area or sectors (pie slices) are unreliable especially those undergoing rotation rely on their ability to estimate angular displacement. If instead of percentage of totals they display raw count data, say 9,720 votes, 9,000 votes, 8,280 votes, and 7,200 votes, and 1,800 votes, you would have trouble recognizing that these are 27%, 25%, 23%, 20%, and 5% respectively from a total of 36,000 votes. A linear graph with 4 closely but separated marks at each raw vote count and another very near 0 would show the differences of 720 votes between the first three and 1,080 votes to the fourth and the very wide gap to the last 5%. You may even notice that the gap between the 3rd & 4th values is wider than those between 1st & 2nd and 2nd & 3rd which are of equal width.
@chinobambino5252
@chinobambino5252 Жыл бұрын
Yes as someone in a field where visualization of data is very important (biology) i have been told to always steer clear of pie charts. Honestly i’ve never seen someone use one in a talk, and i think there would be snickers from the crowd if they did.
@tronster
@tronster Жыл бұрын
Overall good talk. Disappointed in tinting most everything green when showing the chart for the Magnitude Channel in the rows for "Color luminance" and "Color saturation" as well as for the Identity Channel for the row "Color hue"; these should not have been tinted to all be green.
@thomash4810
@thomash4810 Жыл бұрын
Cool video
@matbronk1
@matbronk1 Жыл бұрын
The experiment was a bit biased towards giving a different answer for each one, I'd say. Having three things to judge and three judgements to use may cause you think you have tot use all three judgements once
@hanswoast7
@hanswoast7 Жыл бұрын
Yep, but can you visualize it?
@gabrote42
@gabrote42 Жыл бұрын
Cool video. I wonder if you are preparibg one about the GPT-4 pause
@davidmorrison7742
@davidmorrison7742 Жыл бұрын
... but my boss wants 3D pie charts and 3D stacked bar charts. Basically, add 3D to everything.
@bscutajar
@bscutajar Жыл бұрын
I wonder why they used voltage with electric shock and not power, since it would make sense pain is proportional to power.
@goopytoobers9397
@goopytoobers9397 Жыл бұрын
Isn’t this a re-upload?
@cmuller1441
@cmuller1441 Жыл бұрын
Can someone add caption (not automatic) ?
@me0101001000
@me0101001000 Жыл бұрын
I don't have a CS background. I'm more of a traditional engineer in ChemE/MatSci. For people like me, you really can't separate engineering and design. In fact, I'd argue that Engineering is just a small circle inside of the larger circle that is design. Is it similar for CS, where all kinds of CS work has to involve some kind of design?
@carl8703
@carl8703 Жыл бұрын
So this suggests to me that any visualization whatsoever should only ever use distance to represent numeric data, since anything else would potentially deceive the audience. Any other channel like RGB, hue, etc. should be used strictly to distinguish nonnumeric data.
@thirdcoffee
@thirdcoffee Жыл бұрын
Thanks for the great video. One question... this laptop looks amazing. Is it a macbook or a windows machine? Which one? Does anyone know?
@HebaruSan
@HebaruSan Жыл бұрын
So ironic how there's nothing to look at through so much of this video!
@lashoes2207
@lashoes2207 Жыл бұрын
Electric shocks? Suffer to get your data puny human
@harriehausenman8623
@harriehausenman8623 Жыл бұрын
That's obviously how he got his funding 😆
@griggiorouge
@griggiorouge Жыл бұрын
genius stuff.
@trikers471
@trikers471 Жыл бұрын
Whatever you did to the video made the length example wrong, that line is not twice the first line, it is clearly more, as measured with a ruler on my screen
@SebastianSchleussner
@SebastianSchleussner Жыл бұрын
At 10:00? It's your screen doing something funny. Here it is precisely 4.5 vs 9.0 cm.
@misium
@misium Жыл бұрын
10:15 Hmm the infamous electric shock visualiser.
@alexxx4434
@alexxx4434 Жыл бұрын
Strangely saturation was guessed right, and length wrong.
@technicalcked
@technicalcked Жыл бұрын
❤❤❤❤
@jaydeep-p
@jaydeep-p Жыл бұрын
Wow
@marklonergan3898
@marklonergan3898 Жыл бұрын
I think visualization is great, but should always be accompanied by the raw data itself, unless the presenter is deliberately trying to mislead. So many charts are presented without labels on the scales - it might not be 0-based, the scale might be logarithmic, etc. The raw data at least can't be "misinterpreted". The main reason i mention this is because of the statement "a small increase in the voltage is perceived as a large increase by the subject". Are we talking a small increase in units or a small increase in percentage? Human perception has been shown to be logarithmic naturally (you can very quickly differentiate 4 lions from 5 lions at a glance, but not 100 lions from 101 lions). I'm not accusing your example of being misleading, but moreso backing-up my point that raw data should always be included so there's no room for misinterpretation.
@gloverelaxis
@gloverelaxis Жыл бұрын
and how is the raw data formatted?
@olgierd245
@olgierd245 Жыл бұрын
Why the dislikes tho?
@arcdam7041
@arcdam7041 11 ай бұрын
The options in the test were confusing the user and were manipulating his mind, so i think if wouldn't interven and let the user to give anwer without any assitance the result would be more accurate
@TracesOfNuts
@TracesOfNuts Жыл бұрын
9:32 me most of the time
@guilherme5094
@guilherme5094 Жыл бұрын
👍
@MrFrondoso
@MrFrondoso Жыл бұрын
Didn't mention Jacques Bertin in the important books about Data Visualization. Sorry but that's a red flag for me. He, and no one else wrote the first and widest intent to provide a theoretical foundation to Information Visualization, and his works are still valid. Is it because an Anglo centrism? I'm deeply sorry, much because I deeply like your work and everything you brought to me.
@glamourread9392
@glamourread9392 Жыл бұрын
Its a QR code 😂
@johnsenchak1428
@johnsenchak1428 Жыл бұрын
BORING THIS CHANNEL IS GOING DOWN THE TUBES
@SebastianSchleussner
@SebastianSchleussner Жыл бұрын
Nah. It's only you.
Data Analysis 2: Data Visualisation - Computerphile
17:42
Computerphile
Рет қаралды 125 М.
Emulation - Computerphile
22:36
Computerphile
Рет қаралды 200 М.
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 106 МЛН
Каха ограбил банк
01:00
К-Media
Рет қаралды 9 МЛН
Backstage 🤫 tutorial #elsarca #tiktok
00:13
Elsa Arca
Рет қаралды 47 МЛН
Wait for the last one! 👀
00:28
Josh Horton
Рет қаралды 96 МЛН
The beauty of data visualization - David McCandless
18:18
TED-Ed
Рет қаралды 1,4 МЛН
Glitch Tokens - Computerphile
19:29
Computerphile
Рет қаралды 313 М.
Bing Chat Behaving Badly - Computerphile
25:07
Computerphile
Рет қаралды 323 М.
Moving Beyond Data Visualization | Frank Evans | TEDxOU
12:51
TEDx Talks
Рет қаралды 27 М.
Model Driven Software Engineering - Computerphile
14:12
Computerphile
Рет қаралды 93 М.
Garbage Collection (Mark & Sweep) - Computerphile
16:22
Computerphile
Рет қаралды 235 М.
Data Visualization Crash Course | Consulting Best Practices
25:57
Analyst Academy
Рет қаралды 141 М.
Vectoring Words (Word Embeddings) - Computerphile
16:56
Computerphile
Рет қаралды 282 М.
🚨 YOU'RE VISUALIZING YOUR DATA WRONG. And Here's Why...
17:11
Adam Finer - Learn BI Online
Рет қаралды 23 М.
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 106 МЛН