What is a bit? (Information Theory)

  Рет қаралды 49,238

Art of the Problem

Art of the Problem

Күн бұрын

How can we quantify/measure an information source? We introduce the ideas of Nyquist & Hartley using a simple game involving yes/no questions. It's important to realize all of this happened before Claude Shannon arrived on the scene. However, this measure applies only when communication involves random sequences...
References:
Hartley - Transmission of Information
www3.alcatel-lucent.com/bstj/v...
Nyquist - Certain Factors Affecting Telegraph Speed
www3.alcatel-lucent.com/bstj/v...

Пікірлер: 53
@ArtOfTheProblem
@ArtOfTheProblem 3 жыл бұрын
Link to series playlist: kzbin.info/aero/PLbg3ZX2pWlgKDVFNwn9B63UhYJVIerzHL
@specialkender
@specialkender 4 жыл бұрын
This series is GOLD. GOLD I'M SAYING!
@ugestacoolie5998
@ugestacoolie5998 6 ай бұрын
it really is, I wished I known more interesting playlists like these
@alonsorobots
@alonsorobots 2 жыл бұрын
What an amazing series. All things we learn through education (ideas) are just tools in our tool belt to know how to influence the world around us. Grounding information theory literally in TOOLS that shaped history is a good way to get us to care about them. Following a semi chronological order of how things were invented is such a natural way to slowly follow along. Thank you for allowing more to compress the world around them.
@davidmuller7274
@davidmuller7274 8 жыл бұрын
Btw. For anyone wondering, why it is H: I believe it's not "H" but originally a capital greek "Eta" η for "Entropy",... which is written as H but is still a "capital Eta". :D
@IdleGod
@IdleGod 11 жыл бұрын
I love how the W and M are switched in the tiles. Also, great video; these go a long way to explaining these complex topics.
@MrFischvogel
@MrFischvogel 6 ай бұрын
This is so good content. Unbelievable. I am very, very grateful to you for this traceable explanation of complex concepts.
@ArtOfTheProblem
@ArtOfTheProblem 6 ай бұрын
thanks! was just thinking about this series as the video i'm releasing next week will mention it (it's on LLM's) stay tuned
@ThanadejR
@ThanadejR 3 жыл бұрын
This channel is real gem. Thank you so much.
@ustpCZ
@ustpCZ 9 жыл бұрын
5:56 "at most 29 binary digits" is incorrect. If you are extremely unlucky you will have to send 30 digits (message "fffff") if you are lucky, you will have to send only 24 digits ("ddddd"). 29 is average number of digits.
@cocothetimeless8382
@cocothetimeless8382 2 жыл бұрын
dude you deserve more views
@ArtOfTheProblem
@ArtOfTheProblem 2 жыл бұрын
glad you enjoyed
@Boxcow45
@Boxcow45 8 жыл бұрын
Hey, that's a binary search!
@ArtOfTheProblem
@ArtOfTheProblem 11 жыл бұрын
Very, very good catch. I should have mentioned that the cards are chosen with replacement...which means this isn't a "true" poker hand - just 5 selections from a set of 52.
@CoreyOgburn
@CoreyOgburn 11 жыл бұрын
This sounds like it's warming up to a lesson on entropy.
@henriquedeoliveira7835
@henriquedeoliveira7835 10 жыл бұрын
Also, the calculation you presented is the required one to transmit both the cards in the hand AND the order that the cards appear in the hand. If you don't care about the order, as you normally wouldn't in a true poker hand, the number of possibilities is smaller.
@ericX97
@ericX97 11 жыл бұрын
In the poker hand example you did not account for the fact that when you pick a card from a deck, it is no longer there, thus reducing the message space and consequently the average number of questions.
@DamesPoker
@DamesPoker 11 жыл бұрын
Never mind, I just noticed the date of the upload, I will be eagerly anticipating the last 2 episodes :). I have a feeling they have something to do with entropy...
@rotorickle
@rotorickle 11 жыл бұрын
At 9:32 the 'M' tile and the 'W' tile are switched. That 'Z' tile also looks suspiciously like another 'N' tile. However, at 1:21, the tiles all look normal. What happened? Also, I love this series and am looking forward to the next installment.
@DanCrystalis
@DanCrystalis 11 жыл бұрын
I would assume, when sending words, that for the larger ones you would get to a point at which the combination of bits could only represent that word and that specific word alone. You could then save on info by 'shortening' the amount of bits that the word would otherwise consists of, so you could save quite some money and information if your message is long enough.
@naboogom2733
@naboogom2733 7 жыл бұрын
this was just great
@SOSTacoJohnson
@SOSTacoJohnson 11 жыл бұрын
At the end you posed a question. I'm guessing the answer was to compile all the words in the english language and make an algorhythm that would know the entire set of possible subsequent letters based upon the beginning pattern. Is there another way to limit the letters to less than 4-5? Also, I was thinking the number of signals is important, u can use a pause to end the letter, but isnt the speed also important? & if so, the wouldn't you need a number to declare the end of the letter/unit?
@korth66
@korth66 11 жыл бұрын
Will you upload the next 2 videos? Very interesting topic.
@StephenRoseDuo
@StephenRoseDuo 7 жыл бұрын
CANADA MAKING US PROUD!
@ArtOfTheProblem
@ArtOfTheProblem 7 жыл бұрын
go team!
@brianjblairdotcom
@brianjblairdotcom 11 жыл бұрын
Whoops got cut of. Video. The answer is sending words and maybe phrases instead of letters.
@vazixLT
@vazixLT 11 жыл бұрын
There should be a base fee included, because you need to ask a few questions first to know how long will be the message that comes (if you don't have an agreement of how long the pauses have to be) and also what type of questions you need to ask. At first you would have needed to ask 2 questions do know if the information sent will represent H/T, letters or cards. These points are a bit advanced, if most people would learn and understand at least the concept presented here, it would be very cool
@NiftyFingers
@NiftyFingers 11 жыл бұрын
Everything in information boils down to the bit though. You can have nats and hartleys as well. Nats are when you use euler's number instead of 2, and hartleys are when you use 10 instead of 2. If you give me a quantity of information, say 5 nats, that's (1 over ln2) bits, or about 1.44 bits. Words and phrases are multiple bits, so there's really no difference. How do you send a phrase over a wire by the way? 1V = "to", 1.1V = "the", 1.2V = "an"... engineers would say what a pain that would be.
@malchar2
@malchar2 11 жыл бұрын
If the message is known to be an English word, and you receive the letter "T" then it's more likely that the next letter will be "H", which might spell the common word "THE". At each node in the decision tree, the letters should be arranged between the two branches so that you have a 50% chance of ending up on either branch. Having an equal quantity of letters on each branch doesn't really make sense, it should be based on how likely they are to follow the previous letter or even previous words.
@ericX97
@ericX97 11 жыл бұрын
Thank you. I'm surprised I caught it myself.
@StefanTheFink
@StefanTheFink 5 жыл бұрын
My english is not perfect, what exactly does "message space" mean?
@ericX97
@ericX97 11 жыл бұрын
It's the principle behind it that counts.
@nubSawace
@nubSawace 10 жыл бұрын
wait... isn't the code with which you decipher the message considered "information"? shouldn't she be charging for that?
@16D
@16D 11 жыл бұрын
I'm guessing transfer protocols?
@87knox
@87knox 11 жыл бұрын
Is 4.7 the minimum number of bits per letter, or the average number of bits per letter? You seem to change your mind at least twice every sentence, on average.
@ZonkoKongo
@ZonkoKongo 7 жыл бұрын
87knox average
@joshuaronisjr
@joshuaronisjr 5 жыл бұрын
The average calculation isn't correct though. Let's say we have three letters 'A' 'B' 'C' And each message is only one letter long. So let's say we represent them like this: A:0 B:00 C:01 We need 2 bits for B, 2 bits for C, and only 1 bit for A. Therefore the average # of bits is (1 + 2 + 2) / 3choices, or 1.666666 bits per letter on average. Not log2(3), which gives 1.585.... ...?
@meribold
@meribold 5 жыл бұрын
I think 1.585 bits is the average we can achieve as the message length approaches infinity. If we send a message of length 10000 composed of the letters A, B, and C, there are 3^10000 possible messages. The base 2 logarithm of 3^10000 is log2(3^10000) ~= 15849.6 We can round this up to 15850 so we definitely have enough bits for all possible messages (2^15850 > 3^10000). On average, this means 15850/10000 = 1.5850 bits per letter. Maybe it's more intuitive to think about a message of length 2. We can already get a better per-character average than ~1.667 here by optimally encoding all 9 two-letter combinations.
@Ryanwazhere4490
@Ryanwazhere4490 11 жыл бұрын
Khhhhhhhaaaaaaann!!
@ssfox8860
@ssfox8860 3 жыл бұрын
WHO THE HELL DISLIKED THIS VIDEO. Its a great video
@ArtOfTheProblem
@ArtOfTheProblem 3 жыл бұрын
glad you enjoyed, stay tuned
@plekkchand
@plekkchand 6 жыл бұрын
ET- cetera, not ECK-cetera.
@DamesPoker
@DamesPoker 11 жыл бұрын
Where is 11 and 12????????????????????
@AnnanFay
@AnnanFay 8 жыл бұрын
I'm pretty sure you only need 22 bits to send a poker hand? You cannot get duplicate cards which reduces the amount of data you need to send!
@anuarandom
@anuarandom 8 жыл бұрын
Can you show us how did you get the 22 bits for a poker hand, please?
@meribold
@meribold 5 жыл бұрын
That's a good point, but how do you arrive at 22 bits? Since log2(52)+log2(51)+log2(50)+log2(49)+log2(48) ~= 28.2, we only reduce the number of bits from 28.5 to 28.2.
@distantuncertain630
@distantuncertain630 7 жыл бұрын
I'm sorry, three things in this video don't make any sense to me. In the example with the six-letter word, Brit shows the most efficient way to determine the letter (eliminating about half of the letters with every question). With this method you will never need to ask more than five questions, and sometimes four will be enough. There is no better way of doing it. Based on that he then goes on to calculate the average amount of questions you'd need to ask for one letter - and here's my first point: If you use the described method and you go threw every possbile letter, you will find that for exactly 6 of them you'll only need to ask four questions ( for example a, d, g, n, q, t, if you always choose the smaller one of a pair of three). The 20 remaining letters require five questions. Now if you calculate the average you get (6*4 + 20*5)/26 = 4.769... questions per letter. But Brit does the following and gets a different answer: He uses the formula to calculate the message space based on the differences per symbol and the symbol rate. Message space = (differences per symbol) to the power of (symbol rate). Basically what he calculates is "How many times do I have to sent a bit (two differences per symbol) to be able to construct 26 different messages (i.e. 26 different combinations of those bits) ?" And his answer is log base 2 of 26 = 7.004... . But then he says: "So, on average, approximately 4.7 questions will be needed per letter at minimum [...]" But besides the first two digits his number is different than what I just calculated. Now, did I make a mistake in calculating the average number of questions per letter, or did he simply calculate a different thing? Also, does saying "on average at minimum" make any sense? It's either the average or the minimum, isn't it? And he then goes on to calculate 6*4.7 = 28.2 and states that "Bob can expect to ask, at minimum, 28.2 questions, which means Alice will need to send, at most, 29 binary digits." But clearly, the minimum number of questions would be 24 if you happen to only need four questions for every letter. And Alice will need to sent, at most, 30 bits, if all the letters require five questions. So this is the second thing I don't understand. And finally, does using the formula for the message space in this case make any sense at all? Because you can't sent a bit 4.7 times. Either 4 times or 5 times. And if it's not the average amount of bits, then what is the point of this calculation? I really hope some of you can understand my confusion and maybe help me out ':) (Btw it's also possible that I just didnn't understand what he says correctly because english is not my native language, but I don't think that's the case)
@brianjblairdotcom
@brianjblairdotcom 11 жыл бұрын
First view.Great
@mijnmuziek454
@mijnmuziek454 6 жыл бұрын
Stolen from Khan academy?
@dibbidydoo4318
@dibbidydoo4318 6 жыл бұрын
mijnmuziek454 nah this video is older
@IrregularPineapples
@IrregularPineapples 6 жыл бұрын
This is the original.
@glenrigby5675
@glenrigby5675 11 жыл бұрын
fees in canadian!!!!!
@tsunamio7750
@tsunamio7750 Жыл бұрын
5:47 Error detected: Worst case, every letter is 5 questions: 6letters x 5questions = 30symbols Best case, every letter is 4 questions: 6letters x 4 questions = 24symbols Average case, 4.7 letters per question: 6letters x 4.7 questions = 28.2 symbols
What is a Markov chain?
7:15
Art of the Problem
Рет қаралды 147 М.
Why Information Theory is Important - Computerphile
12:33
Computerphile
Рет қаралды 146 М.
Они убрались очень быстро!
00:40
Аришнев
Рет қаралды 1,6 МЛН
🍕Пиццерия FNAF в реальной жизни #shorts
00:41
Who’s more flexible:💖 or 💚? @milanaroller
00:14
Diana Belitskay
Рет қаралды 19 МЛН
Huffman Codes: An Information Theory Perspective
29:11
Reducible
Рет қаралды 221 М.
Information Theory Basics
16:22
Intelligent Systems Lab
Рет қаралды 60 М.
What is a computer? (the history covering Leibniz, Babbage & Lovelace)
10:18
How Much Information?
5:47
Veritasium
Рет қаралды 2,5 МЛН
Discovery & History of Static Electricity
9:02
Art of the Problem
Рет қаралды 69 М.
Aging reversed? Testing the information theory of aging.
13:22
The Sheekey Science Show
Рет қаралды 19 М.
Channel capacity (baud rate)
10:37
Art of the Problem
Рет қаралды 38 М.
Uses of Information Theory - Computerphile
14:48
Computerphile
Рет қаралды 73 М.
The Mathematics of Consciousness (Integrated Information Theory)
18:36
Astonishing Hypothesis
Рет қаралды 85 М.