Measuring information | Journey into information theory | Computer Science | Khan Academy

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 พ.ค. 2024
  • How can we quantify/measure an information source?
    Watch the next lesson: www.khanacademy.org/computing...
    Missed the previous lesson? www.khanacademy.org/computing...
    Computer Science on Khan Academy: Learn select topics from computer science - algorithms (how we solve common problems in computer science and measure the efficiency of our solutions), cryptography (how we protect secret information), and information theory (how we encode and compress information).
    About Khan Academy: Khan Academy is a nonprofit with a mission to provide a free, world-class education for anyone, anywhere. We believe learners of all ages should have unlimited access to free educational content they can master at their own pace. We use intelligent software, deep data analytics and intuitive user interfaces to help students and teachers around the world. Our resources cover preschool through early college education, including math, biology, chemistry, physics, economics, finance, history, grammar and more. We offer free personalized SAT test prep in partnership with the test developer, the College Board. Khan Academy has been translated into dozens of languages, and 100 million people use our platform worldwide every year. For more information, visit www.khanacademy.org, join us on Facebook or follow us on Twitter at @khanacademy. And remember, you can learn anything.
    For free. For everyone. Forever. #YouCanLearnAnything
    Subscribe to Khan Academy’s Computer Science channel: / channel
    Subscribe to Khan Academy: th-cam.com/users/subscription_...

ความคิดเห็น • 60

  • @mrmr4737
    @mrmr4737 3 ปีที่แล้ว +12

    This is simply amazing.

    • @shilaakter5387
      @shilaakter5387 ปีที่แล้ว +1

      ৮এীা😊😂😂😅

  • @redwatch.
    @redwatch. ปีที่แล้ว +1

    Love this video. Demystifying, makes me feel a little less aptidudinally challenged.

  • @Matt-xb5iq
    @Matt-xb5iq 5 ปีที่แล้ว +1

    Loved it so much! Thank you!!!

  • @archismandas7760
    @archismandas7760 2 ปีที่แล้ว

    This is a very joyous moment for me as I finally understood the significance of something I always considered as unnecessary

  • @nanba009
    @nanba009 3 ปีที่แล้ว

    what a gem of youtube this channel is.

  • @hrivera4201
    @hrivera4201 2 ปีที่แล้ว +5

    previous lesson: th-cam.com/video/W-OskAegz5o/w-d-xo.html
    next lesson: th-cam.com/video/Ws63I3F7Moc/w-d-xo.html

  • @buddhanag4987
    @buddhanag4987 5 ปีที่แล้ว +3

    Very informative

  • @whygoogle5051
    @whygoogle5051 4 ปีที่แล้ว +7

    This idea of communication being a mix of predictability and surprise reminds me of the double-slit experiment; while you can't predict where exactly a particle would land, the overall pattern could be -- the message space was definable.
    It's like: so the infinite and the void were having a conversation and...

    • @yodae7621
      @yodae7621 2 ปีที่แล้ว +1

      What if the perspective was wrong and forever decided that if you can define everything exactly then...

    • @SachinYadav94
      @SachinYadav94 ปีที่แล้ว

      @@yodae7621 lmfaooo

  • @skreutzer
    @skreutzer 5 ปีที่แล้ว +4

    3:03 No. You need a way to tell the receiver in advance if you're talking about cards or letters in advance (if no prior knowledge/arrangement can be assumed), and as your medium is limited to binary encoding and no other channels, you need transmit more bits for the "protocol" to determine what the following bits are all about, rendering "bits" useless as a measure for expressing meaning. You need additional bits to declare if the message/transmission is about coinflips, characters of a word or a poker hand, and it would be limited to these three, and if you would bill by the amount of bits, you either have to increase the charge for everybody by the bits that serve as indicator of the domain of the message, or come up with a scheme that favors some domains over others in terms of prices. By limiting the transmission medium to digital bit encoding, simple coin flips are always cheaper, but bits are in no way a useful measure for information, as transmitting coinflips over a medium that supports 26 states would make it just as expensive as transmitting letters, if no additional encoding scheme would be used, which then again needs to indicate it's presence somehow. A unit of base 26 could just as well "measure information" if that's the nature of the used medium despite being lossy/inaccurate for transmitting information that just needs to indicate yes/no states. Binary could be assumed as the most atomic instance of state difference indication/signalling of a hypothetical transmission medium, causing that you can only talk about yes/no changes on both sides of the wire, not related to coinflips or anything else, just yes/no state changes.
    In essence, for charging, it all depends on your s at 8:03, so why would you assume s = 2, where there could be transmission media that are much more favorable to the customer sending letters or poker cards, which is just as well information as somebody transmitting coinflips, not to speak of the problem of interpreting the meaning of the bits without out of band communication or prior knowledge?

    • @vedantjhawar7553
      @vedantjhawar7553 3 ปีที่แล้ว

      Regarding the first part of the statement, couldn't it be assumed that the friends between treehouse A and treehouse B know what type of information they are giving? In that case, wouldn't you be able to solve the issue by asking two questions of whether or not the following data is from a certain person. You'd only need to ask the question 1 more time for the next person in line, assuming that there are only 3 people. Or am I wrong? Also, what did you mean by 26 states? Do you mean what would happen if we were transmitting 26 coinflips?

  • @jongseokchoi4038
    @jongseokchoi4038 3 ปีที่แล้ว +1

    Think about it. This has something to do with entropy, microscopic states and Boltzmann’s constant, which he thought of it around 1800s

  • @mallika1156
    @mallika1156 ปีที่แล้ว

    Very nice explanation 👍

  • @zacharywong1994
    @zacharywong1994 9 ปีที่แล้ว

    In short, information is based on # of possibilities. The more what we called "accurate" and "precise" information meaning eliminating other possible odds. The more loosely information, the higher chance it comes with more alternatives. Such theory known as Information theory by Claude Shannon and also many other researchers, especially Ralph Hartley. For every 1 bit of information, it equals to 2 possibilities.

  • @funnyXP26
    @funnyXP26 7 ปีที่แล้ว +6

    When it comes to the pokerhand, you can only draw a card once, therefore leaving one less card each time you draw one. Making the amount of bits needed less, or am I totally wrong?

    • @jmmahony
      @jmmahony 6 ปีที่แล้ว +1

      Right, also in poker the order of the cards doesn't matter.

    • @_nemo
      @_nemo 5 ปีที่แล้ว +6

      Yes. The messagespace would be all possible subsets of 5 cards from 52, which has binomial(52,5) possibilities. We then get log_2( binomial(52,5) ) = 21.309... as the average number of questions.

  • @TimothyKirkby
    @TimothyKirkby 2 ปีที่แล้ว +1

    Well, that went way over my head!😅🤪

  • @anjumaddi5595
    @anjumaddi5595 4 ปีที่แล้ว

    If we know the the probability of the symbols then we can ask the questions in a different way (unlike is it less than m and dividing the alphabets into equal halves) like is it a letter "aei.." etc..

  • @AlphA12647
    @AlphA12647 3 ปีที่แล้ว +24

    “*Naturally* use a log of base 2” lol

    • @Darieee
      @Darieee 3 ปีที่แล้ว

      yeah also carved me a bit that one - cool way of putting it either way

    • @cariboux2
      @cariboux2 3 ปีที่แล้ว

      😅

    • @Ikbeneengeit
      @Ikbeneengeit ปีที่แล้ว

      2.72ly use a log of base 2

    • @mridullatest1344
      @mridullatest1344 ปีที่แล้ว

    • @mridullatest1344
      @mridullatest1344 ปีที่แล้ว

  • @luamfernandez6031
    @luamfernandez6031 3 ปีที่แล้ว +1

    Can somebody explain to me why the formula "2^number of questions = message space" is correct?

    • @agentedofbidisfarcado461
      @agentedofbidisfarcado461 3 ปีที่แล้ว

      To find the correct information most efficiently, you need to ask questions that cut the set of possible answers in half (the set of answers has to be ordered), and thats a binary search, because it has two possible answers: yes or no, so the base is 2 since there's 2 possible outcomes, so in other words, you could think about 2^n°questions as doing an inverse binary search, that is, since the choosen number counts as 1 element in the set of all possible numbers, you need to do this operation times the average amount of questions)

  • @patmull1
    @patmull1 4 ปีที่แล้ว

    Does Hartley's formula H = n log (s) works also for non-binary cases, if we would use different base of logarithm than 2?

    • @vedantjhawar7553
      @vedantjhawar7553 3 ปีที่แล้ว

      Yes, I'm sure we can. Though I really think it depends on what type of information being presented. Every type of information can be represented in binary, however, it would probably be more difficult to do with a higher number of bases. For example, how would you represent whether a coin has been flipped or not with 3+ possible signals? I'm sure that there would be ways, but none as efficient as binary. I think the reason they use binary here is because it's the most general and versatile base for measuring information that's relatively consistent in efficiency.

  • @tinshusasi8246
    @tinshusasi8246 2 ปีที่แล้ว

    Can anyone explain hartley formula H = n log (s) with deck of cards example.
    ie what is the value of s , n and base of the log and how we got the values of s and n

  • @MrMarkgyuro
    @MrMarkgyuro 3 ปีที่แล้ว

    this music is so fuckin information theory feel :D

  • @saththiyambharathiyan8175
    @saththiyambharathiyan8175 2 ปีที่แล้ว

    what if the middle of the alphabet is sent........ how could we decide that......?

  • @indramaniarora7901
    @indramaniarora7901 4 ปีที่แล้ว

    4:46. The Number of Questions will be at least 3. In the case of alphabets, Suppose we need to find G, then we can ask:
    1. Is it less than N?
    2. Is it less than G?
    3. Is it greater than G?

    • @vasudevsharma7723
      @vasudevsharma7723 3 ปีที่แล้ว

      No its beacuse we do not know if it is really g...cause everyte we rule out other possibilities we divide them again in half and we dont know when we have to stop doing it like in case on G

    • @hdraresro4399
      @hdraresro4399 3 ปีที่แล้ว

      @@vasudevsharma7723 he's not talking about a general posibility, he's talking about a minimum number of questions. Suppose we need to find N. The following will happen:
      1.Is it less than N? We recieve 0.
      2.Is it greater than N? We recieve 0.
      Then we deduce it's N. So, the minimum number of questions would be 2, not even 3. Even if we need to find a letter that requires a greater amount of questions, we would still need at least 2 questions to find it.

    • @robertasova304
      @robertasova304 3 ปีที่แล้ว +1

      @@hdraresro4399 Yes but if it's not N you've just wasted 1 question, thus increasing the maximum amount of questions which we ask.

    • @aidanschofield2743
      @aidanschofield2743 3 ปีที่แล้ว

      @@hdraresro4399 What you're describing is the case of a singular letter that happened to match the questions. You're talking about minimum number of questions for one option, they've created a system which optimizes all options to the fewest questions on average, which will be the far better communication system in the end.

  • @vedantjhawar7553
    @vedantjhawar7553 3 ปีที่แล้ว

    Shouldn't it be at least 3 and at most 5 instead of at least 4 and at most 5?

  • @therealrictuar
    @therealrictuar 4 ปีที่แล้ว +5

    whats with this super creepy music?
    eitherway, it might be nice to explicitly state the idea that for any piece of information that is a part of a finite set, there exists a a set of questions, each with a finite set of answers, that can represent that piece of information

    • @cariboux2
      @cariboux2 3 ปีที่แล้ว

      😅😅😅

  • @dutdut11
    @dutdut11 ปีที่แล้ว

    Shout out kay sir japz

  • @M_K171
    @M_K171 2 ปีที่แล้ว

    I don’t see what information adds to anything. Before, we called these things “properties.” If you were to strip a table of its properties: hardness, squareness (or its shape), brownness, etc., do you still have a table? It seems that the properties are as much a part of the table, as the table itself is. Information seems to be talking about the same thing. The properties of the table “convey” information to the environment. The only thing I see differently is that information theory is looking to add a bit unit to the world and give it measurements. I think it does nothing for science. Information is a platonic sort of concept, it doesn’t belong in physics. The bit unit was used for computers. This is a prevalent technology and so we project it onto the world. I do believe objects and properties and the world emanates information, but the information is a metaphysical entity, just as say universals are.

  • @Thelunamiah
    @Thelunamiah 4 ปีที่แล้ว

    • @valor36az
      @valor36az 3 ปีที่แล้ว

      Lydia Riley I believe he means the rank of the letter starting from A

  • @MA-jg6tw
    @MA-jg6tw 2 ปีที่แล้ว +1

    why at most 29? If the word is "FFFFFF" (f 6 times) wouldn't that take 30

    • @arc6fire
      @arc6fire 2 ปีที่แล้ว

      i agree, i thought this too at the time

  • @enderboy6491
    @enderboy6491 2 ปีที่แล้ว

    🕸️

  • @user-bj8xw1nx4d
    @user-bj8xw1nx4d 9 หลายเดือนก่อน

    কেমন হয়েছে

  • @Thelunamiah
    @Thelunamiah 4 ปีที่แล้ว

    Did anyone else notice at 4:01 He asks, "Is it less than G?" Then he answers "Yes." As in yes it is less than G. But he then proceeds to use A-F opposed to G-M. He did not pay attention.

    • @robertasova304
      @robertasova304 3 ปีที่แล้ว

      Ammm...the letters from A to F are less than G
      G=7
      F=6
      6

  • @Peaceful_Heaven.
    @Peaceful_Heaven. 5 ปีที่แล้ว

    Upload it in hindi

  • @NishanthSalahudeen
    @NishanthSalahudeen 6 ปีที่แล้ว

    Fail