Huffman encoding: the cat has ten million hairs, four legs, two ears, two eyes. Here's a tree with all the parts necessary and this string is the instruction to reassemble it.
HI I CANT GET THE VIDEO TO LOAD IT SAYS I AM NOT CONNECTED TO THE INTERNET ...WHY ARE COMPUTERS SO COMPLICATED...WTF IS THE INTERNET ANYWAY, I JUST WANT TO LEARN HOW TO COMPRESS MY FILES I HAVE THOUSANDS OF PAPERS EVERYWHERE AND NEED TO COMPRESS
HI I CANT GET THE VIDEO TO LOAD IT SAYS I AM NOT CONNECTED TO THE INTERNET ...WHY ARE COMPUTERS SO COMPLICATED...WTF IS THE INTERNET ANYWAY, I JUST WANT TO LEARN HOW TO COMPRESS MY FILES I HAVE THOUSANDS OF PAPERS EVERYWHERE AND NEED TO COMPRESS
Compression is really quite incredible. I had a high resolution heic image and it was only around 70MB. I exported it as a tiff and it was almost 500MB
heic is new kind of compression algorithm which is also being used for videos but it is always lossy unless specified as lossless. Another kind of such lossy compression is the djvu format which is capable of downsizing entire scanned thick books into multipage image file with several hundred pages. Converting that djvu to pdf will show that effect of 70mb djvu file to 500mb pdf.
another thing with compression: images; when the area has a slight deviation in color but still enough to look weird if it is all the same color; instead of storing all the 24 bits to get 255:255:255 color, you can just take a pixel and relate all other similar colors to it. so if there is barely any difference, you can express that in far fewer bits. and about text you can overlap selection. so the phrase Harry Potter and the philosopher's stone contains several frequent words, that you don't need to repeat
JPEG XL is an upcoming compression method that will solve most of our problems with image files. It has a lossy mode that is way smaller than JPEG. It has a lossless mode that is smaller than PNG. And has support for all the kinds of metadata that you could want, including high color depth.
But the same is true of JPEG2000, and WebP, and HEIF, and FLIF. It's not enough to be better: Any replacement has to be so much better it would overcome the convenience factor using a universally-supported format like JPEG.
Situation: there are 5 competing image formats That's ridiculous, let's develop one that covers all use cases! Result: there are 6 competing image formats
Another interesting form of compression is huffman coding. The jist of how that works is that it orders the different characters by order of probability of occurrence of a file on a binary tree, then it uses its placement on the tree as a stand in for its value. The effect is reduced number of bits for common characters at the cost of extra number of bits for less-used ones. A very cool technique and is very complimentary with other compression algorithms, and is also used commonly over channels.
There are 2 alternatives: arithmetic coding (e.g. CABAC) which compresses way better, but is way more complex, and there's Asymmetric Numeral Systems (ANS) which compresses just a bit worse than CABAC while being only slightly more complex than Huffman, making it a perfect deal and being used by lots of tech giants, and also in next-generation JPEG XL image format.
Yo I just want u to know I appreciate u so much man. I started studying computer engineering at a uni and the lectures just made me loose all most all interest I had in CE. But ur the guy who bought back my passion. Thanks G.
raw to jpeg actually gets rid of a lot of info, including the dynamic range and editing capabilities, because it has all of the info from sensor, so you can edit the white balance perfectly
@@sanjacobs6261 That's not really what lossy compression means though. You lose the same info when converting to PNG which is a lossless format. The difference is that with a PNG you only lose this kind of metadata information whereas with jpeg you lose actual image information. Although how much and whether it's even noticeable very much depends on the settings since you can configure how strong jpeg should compress an image. But with high compression, you will very much notice artifacts on jpeg compressed images and for images with plain backgrounds/colors and text, it's much worse and much more noticeable even on high-quality settings. But removing metadata has nothing to do with compression. There is no compression whatsoever going on there. You are just deleting information. It's like saying "I'm compressing an image" when you delete the lower half of it. The amazing thing about lossy compression is that it removes information but is still able to reconstruct an image that ideally looks exactly the same to the human eye from the remaining data.
Thank you for this video! i was really confused whenever my friends said "compress the file", i tried to compress it by throwing my Hard drive out the window but my performance was a little worse after that so i decided to look on the internet for a new SSD and I saw this video! i will remember this for about a month, bcs i still need to test if throwing a SSD out the window saves storage capacity, Thx!
Compression aka that conversation between the hardware and software for finding the data in ram and hoping we don't end up fusing all our memory cats together. Anyhow I am a sucker for these types of niche videos and their surprisingly relaxing to watch.
Zooming in the Bliss image shows evidence of 8x8 JPEG compression artefacts, one problem with lossy compression is that not only does it discard some things, but it can add things that weren't there before, such as those JPEG artefacts or some quite nasty effects if MP3 is pushed too far. Recompression is particularly bad, if you compress to JPEG, and then recompress the result again to a smaller file, the quality will be worse than if you started again with an uncompressed or at least best quality original, as the second compression will be counting the artefacts of the first as part of the image.
You need to put a magnet on your hard drive to draw all the data into one place and nicely compact it to make more space. Make sure to move it around a bit in case some data is stuck
going to computer information college and kept reading compression and didnt understand it my ADHD was frustrated from not knowing XD thank you for the informative video and your time
its incredible how this works! program repacks are especially interesting to me, its just amazing how we can take a game thats ~100+ gb and repack it, losslessly, into a 40gb package!
Compression is definitely crazy stuff. A couple examples I've seen: a steam game with a download of around 7gb, and an uncompressed size of around 12gb. Another example: I exported one of my songs in the WAV format and it was over 50mb in size, while the MP3 version was less than 5mb. Also, about "some rural places not being able to stream 4k," before I got starlink, which gives me a download speed of 100+Mbps, my old internet had a download speed of under 12mbps on rare occasion.
So, compression is basically just refactoring code but for content. That's actually really interesting and I'm kinda surprised I never thought of it like that. In a way, a .zip file is more like a blueprint for how to build the file than just packing the file really tightly.
Wow it's amazing how we can compress information and send it through electricity and microwave. imagine how quantum computing will compress information?
I've seen an example of this where people will put a lot of a single letter in a file, zip it, the file looks tiny but when unzipped it crashes your computer or something because it's many terabytes in size and the computer doesn't have enough storage space
One correction I'd make is that the difference between a raw photo and a jpg is actually significant. Very significant. Much more than just Metadata. Raw photo files have much more data than can be seen by just viewing a side by side preview on a 8bit monitor. If you've ever edited raw files from a professional camera and also jpgs side by side you'd emmidiately understand how much data is within those raw files. You can take a photo that is visually pitch black and lift exposure to find detail. You can pull highlights down from white. Color banding is pretty much nonexistent if you're editing in 16bit as well. Now once it's edited it will look more or less the exact same to anyone not in the know. Jpgs are great for presenting or even printing nowadays. But not for editing and not for storing all a photos potential
Compression mostly takes advantage of similar colors being side by side. It can encode the variation from the initial shade in fewer bits. Repetition only appears in artificial images like screenshots that contain exactly the same shapes.
You mentioned at the beginning of the video that reading a lossy compressed image like a jpeg does not involve decompression. I would argue that decoding any jpeg image involves a sort of decompression. In order to display a jpeg image on a screen, it needs to be decoded which involves computing every single pixel value in the image. Lossy compression just means that the decoded image result will not exactly match the original, but decoding a jpeg still seems like a sort of decompression.
I love how he says "In fact some rural places still can't stream 4k" I'm from Germany, have live in a small city, have one of the best internet connections in a 20km radius and I sometimes can't even stream 1080p... I love Germany...
They should have included this technical part in the sitcom it would have been great and exclusive content for nerds shows amount of hard work that goes into building these things
on TH-cam, i often have to put the video in 1440p to view it in real 1080p... also something funny i noticed is that in the past the limiting factor to compression was processing power! since they could compress it further but decoding it in real time was harder on some machines. tho i do not miss the day where i waited a week for one 90 MB blurry anime episode to finish downloading...
The worst types of compression actually both happened in the 90s: hard drive compression only achieved ratios similar to ZIP and would grind any CPU at the time to a halt. The other was "SoftRam" that was said to compress data in memory to free up space when in actuality it faked the memory space without doing anything. Compression used for ill gotten gains in both cases really...
Dude how do you output so many videos on a weekly basis, im geniunly curious, they are super high quality and informative, by all means, please keep it up, but hoooow??
For kitty compression (comPURRession?), you need a container the size of your desired result. All you need to do is place them in the same directory and the cat will actually compress into the container all on its own.
heres an even better one, what if you just called all the characters from a tree and didnt write coordinates since thats bloat just write them in order
Audio compression knowledge is a very professional nature of knowledge and video compression is at least 4-16x more technical. Knowing this information is only useful to music producers/distributers, and commercial industrial movie makers; for reasons like managing how much truth you can fit on a Blu-ray disk so your customers get more bang for their buck, instead of paying full price for a product soo thick that the dvd player struggles to play it without being over worked. Knowledge of picture compression is much more relevant to poeple because of how many people actually do graphic design nowadays, and have to actually manage quality control on their finished product. For video compressions, it can be a nightmare considering how far commercial practices have gone to allow only exact settings to be applied to work on a universal scale. For example, for your dvd players, you have to encode in mkv aac, otherwise it absolutely will not be recognized by any common dvd movie player. That being said, you can't just burn mp4 files to a dvd disk and assume that because the files read on your computer, that your Blu-ray player will look at it the same way, because odds are that your br player won't even know what it's looking at. I think this problem low-key comes from 2 things. 1. Combatting bootleggers (glhf). 2. Making it harder for the smaller entrapaneurs to succeed. [Same as it ever was.) Companies like Nero, Adobe, and Sony have their software suites that are made to do the heavy lifting of ensuring you are compressing in exact commercial standards that are decided and agreed upon by lawyers, manufactures, and filming companies together. BUT, you may notice how expensive it is buy these programs/tools. This is a definite example of how it pays to be smart, if you are a DIY hero.
From 2:32 to 2:45... I would say it is totally wrong to see lossless compression like you did. IMHO it is not a simplification, it is just not true. However, I will think about that in the next days, I could be wrong, but it seems to contradict most I know about information theory. Anyway, thank you so much for your contents, good job!
The reason why they're are so many formats for each nature of data presentation, is because different types handle different natures of data differently. Gif handles black and white pictures best, png holds data very accurately and is this very heavy normally, and jpg is like the middle ground that is like the minivan for pictures. It's kind ugly but it can carry a respectable amount of accuracy in relation to the original source, without requiring a ton of gas. Same goes for videos like mkv, mp4, avi, wmv. When it comes to music or audio, I like my absolute favorite songs to be in hefty flac, instead of mp3 so I can squeeze all of the quality, that the producer meant me to hear, out of it. If it's a normal song 192-320kbps mp3 is plenty smooth /clean enough. If it's an audiobook or old-school game music, 96-128kbps is functionally plenty. Funny enough, audiobooks are commonly marketed out into 96kbps, BUT at 48000khz range instead of the common 44100 in order to retain the depth of audio that can easily be lost when compressing into the fuzzier/blurrier lower density bit rates. This is usefull to know if you like to share files online and want to shrink enormous files to cut down on a file size, transfer times, or if you feel chained by a mysterious 8mb file size limit. Rule of thumb for simple folk would be that full quality (lossless) is only completely utilized if you have the equipment for it. If your device is low quality, you will have a harder time noticing the fine difference in lossless vs lossy compressions, and you will be wasting time and energy on something out of reach. Last note, all compressions have xy or z amount of options to perfect quality vs file size. I recognize wav as having minimal options for audio compression and being rudely hefty in data, compared to flac with more flexibility. I like mkv, png, jpg, gif, mp3 and flac. I hate avi, bmp, and wav the most.
I think I have an idea on how you could compress _any_ file down to less than 1% of it's original size. It'd take a lot of computational power (I'm talking entire days of computation), but it could work. I'll keep it a secret for now though. I might learn python to see if this could work
@@redstonerti9918 All I'll say is that it involves compressing the raw binary data I don't know if it's even possible to access this data using python or any regular programming language, so it'd probably require a specialized computer to compress and decompress. All I can do with my little big brain for now is test the idea and lay the foundation in python. Going down to the basics of how computers work to compute this sounds like a pain, but if it is necessary, so be it
Every video this channel shits out of nowhere is so interesting. you cant do that. i hate this. i dont have time to go thru them all. such an underrated channel. why lossless when im downloading illegal repacks like right now. how do you know so much about the compression method of cats. WHO are you?
don't need to code the line number, just char number in whole text 135,50 - 6 chars or 2 numbers 33540 - 5 char or one number that makes reinsertion a lot easier - string are arrays of chars so that should not be a problem. what's the point of doing additional split by " "?
@@iskamag games you download once and you know it will take a long time, meanwhile when you load a website the amount of users that click off is correlated to how fast it loads. A megabyte is a lot when you think of the web
RAW > JPG definitely loses a LOT of data, but the difference is how much that data matters. If you're just viewing it, doesn't really matter at all. But when it comes to people that need to edit photos, having RAW or higher bit-depth data drastically increases how well you can apply certain effects to an image or do color grading. For example, I took some pictures today with a blue sky. However, my camera by default saves to JPG and is inherently 8 bits per pixel. The result is that when I tried to do some color grading on it, I started to reveal some banding on the blue sky. I can use certain algorithms to deband it, but it would be nicer if I just saved it in RAW and was able to manipulate the data that way, despite the files being significantly larger and slower to save into the camera.
Can someone explain how is it that sometimes when a file is compressed from a more complex format to a less complex, the outcoming file is bigger instead of smaller? Usually in image editing format this occurs when setting quality 1:1
Wouldn't it be even more efficient to replace every incidence of a repeated phrase with a unique char that doesn't appear in the original file at all? Like "Harry Potter and the Philosophers Stone" = "x" and replace all incidents of it with x? Or does having the x in a location take up more data than having the cords?
You have a limited set of unique characters unless you want to have the computer create new ones for each thing and then it just depends how many places things will be repeated to determine whether creating new characters uses more data than just listing the coordinates
Language is the lossy compression-decompression cycle of the human mind. Your thoughts are much larger than anything you can convey in speech, but when you tell someone your idea, they decompress it in their own minds and fill in the gaps with their own ideas. We should incorporate this concept into AI to accelerate our progress towards AGI.
Huffman encoding: the cat has ten million hairs, four legs, two ears, two eyes. Here's a tree with all the parts necessary and this string is the instruction to reassemble it.
@Anôn Imo pissed off much?
@Anôn Imo nobody asked
HI I CANT GET THE VIDEO TO LOAD IT SAYS I AM NOT CONNECTED TO THE INTERNET ...WHY ARE COMPUTERS SO COMPLICATED...WTF IS THE INTERNET ANYWAY, I JUST WANT TO LEARN HOW TO COMPRESS MY FILES I HAVE THOUSANDS OF PAPERS EVERYWHERE AND NEED TO COMPRESS
@@geist453 you can thank Steve jobs and the UX for normalizing computing for normies 😌 otherwise IT professionals would be out of a job.
@@gickygackers sure.....
"squish that cat"
HI I CANT GET THE VIDEO TO LOAD IT SAYS I AM NOT CONNECTED TO THE INTERNET ...WHY ARE COMPUTERS SO COMPLICATED...WTF IS THE INTERNET ANYWAY, I JUST WANT TO LEARN HOW TO COMPRESS MY FILES I HAVE THOUSANDS OF PAPERS EVERYWHERE AND NEED TO COMPRESS
@@geist453 squish 'em papers
Aha!!!
@@geist453 you sound like you are not experienced with computers
Xd
just put your hard drive in a hydraulic press
yes
yes
I alredy did that with some floppy disks
Meanwhile on another channel :
"This futa hentai is really dangerous and could attack at any moment, so we have to deal with it"
@@egg5474
w h a t
Compression is really quite incredible. I had a high resolution heic image and it was only around 70MB. I exported it as a tiff and it was almost 500MB
heic is new kind of compression algorithm which is also being used for videos but it is always lossy unless specified as lossless. Another kind of such lossy compression is the djvu format which is capable of downsizing entire scanned thick books into multipage image file with several hundred pages. Converting that djvu to pdf will show that effect of 70mb djvu file to 500mb pdf.
Sadly heic and djvu are not widely adopted and used yet
heic sucks
avif better
Are you lot even speaking words?
dude that picture is bigger than half life blue shift
loving the thumbnail buddy. keep it up!
hahaha me too
all his thumbnails are cool, not like ur typical teacher trying to use a 2011 meme unironically.
@@yekh why is this so damn true
@Zonno5 Nice haircut
Yeah but...... I wanted a meme.....
"In some rural areas you still can't stream 4k video" In entire countries man, entire countries
I find it really funny that you can do it in a Brasilian fabella but can't in an Australian city
@@cahallo5964 Cause all the kangaroos stomping on the internet cables, and the damage causes the cables to be less efficient, and in return, slower.
My country be like
@@TtEL bruh the kangaroos hate technology
In my country the highest quality is 1k on mobile, and HD on Pc, it's very normal
another thing with compression:
images;
when the area has a slight deviation in color but still enough to look weird if it is all the same color; instead of storing all the 24 bits to get 255:255:255 color, you can just take a pixel and relate all other similar colors to it. so if there is barely any difference, you can express that in far fewer bits.
and about text
you can overlap selection.
so the phrase
Harry Potter and the philosopher's stone
contains several frequent words, that you don't need to repeat
letter combinations*
JPEG XL is an upcoming compression method that will solve most of our problems with image files. It has a lossy mode that is way smaller than JPEG. It has a lossless mode that is smaller than PNG. And has support for all the kinds of metadata that you could want, including high color depth.
But the same is true of JPEG2000, and WebP, and HEIF, and FLIF. It's not enough to be better: Any replacement has to be so much better it would overcome the convenience factor using a universally-supported format like JPEG.
Situation: there are 5 competing image formats
That's ridiculous, let's develop one that covers all use cases!
Result: there are 6 competing image formats
@@fatweeb1545 Linux distros in summary.
@@dIancaster no because u dont need one distro only
@@snowwsquire you don't? My mint isn't enough?
1:27 “whereas the lossless one clearly there is some information lost” just wanted to point it out
Just making sure
Did I lose this part of the video?
Apparently
@@alexismandelias *compressed
This is the first time I see the XP background in wide aspect ratio. Didn't even know there are other hills in the background.
Another interesting form of compression is huffman coding. The jist of how that works is that it orders the different characters by order of probability of occurrence of a file on a binary tree, then it uses its placement on the tree as a stand in for its value. The effect is reduced number of bits for common characters at the cost of extra number of bits for less-used ones. A very cool technique and is very complimentary with other compression algorithms, and is also used commonly over channels.
There are 2 alternatives: arithmetic coding (e.g. CABAC) which compresses way better, but is way more complex, and there's Asymmetric Numeral Systems (ANS) which compresses just a bit worse than CABAC while being only slightly more complex than Huffman, making it a perfect deal and being used by lots of tech giants, and also in next-generation JPEG XL image format.
Jpeg's use the discrete cosine Fourier transform for all you math nerds
Thanx ᕙ( ͡° ͜ʖ ͡°)ᕗ
I appreciate that
talk about the wavelets too...
Thanks
i prefer french cosine oui
i don’t understand anything about your content but i like your personality and you’re funny so im going to sub
Do you know why you don't understand it?
For me its like half on half. Sometimes I get it but once he rambles on I just lose it but I love the video
@@yniekac8851 a rambling teacher is the best teacher. And that's real btw
same. Some I get Some I don't get
Story of my life haha 😂😅
Yo I just want u to know I appreciate u so much man. I started studying computer engineering at a uni and the lectures just made me loose all most all interest I had in CE. But ur the guy who bought back my passion. Thanks G.
Nobody:
Torrents: **Puts a .zip in a .zip in a .zip... in a .zip**
At that point you are just adding meta data to the file and making the file larger
iso to zip to rar to tar
Those kinds usually hide malware
@@someonestolemyname the metadata isnt too big of an isue. but you cant get any more compression anyways from doing that
Why are you trying to download 42.zip?
Lossless
“As you can see some of the information has been lost”
Wait w h a t ?
The visible pixels.
do I *look* like I know what a jay-peg is? I just want a picture of a god-dang hot dog
Jpeg*
Whips out mini wd40
@Tech Help Portal Extras yeah thanks for that. He definitely wasn’t misspelling for comedic effect or anything
I'm looking for a tap & die and some wd40. Do you know what an hammer is?
@Nem Gam Boi stf redditor
"Whoops, didn't mean to delete everything there."
its almost impossible to tell the difference between the side-by-side images on my crappy rural connection
@@weakspirit_ bro starlink will change everything, my ISP wont know what hit them
1mb/s gang
@@afallingtree9114 elomegaluln musk
@@xfied you bet
There was a time I download torrent over ppp connection, 12~64 kbps
And this was 2008.
Really taught me patience.
KGB: Throws cat into the garbage compressor.
Unpacking the poor thing would make a big mess though...
I think that would be a lossy compression. End result: minus one cat.
@@robdom91 end result: 27 cat parts
ok that went really dark
*Tom scream*
Bro that hit me way too hard, it took me four days to compress a 5gb folder
Based and catpilled thumbnail
forsenCD ✌️
THE TWO TIME BACK TO BACK 💿
Imagine actually having to concatenate stuff.
that profile pic didnt age well
raw to jpeg actually gets rid of a lot of info, including the dynamic range and editing capabilities, because it has all of the info from sensor, so you can edit the white balance perfectly
Hence, lossy compression
Edit: Yes, I know.
@@sanjacobs6261 That's not really what lossy compression means though. You lose the same info when converting to PNG which is a lossless format. The difference is that with a PNG you only lose this kind of metadata information whereas with jpeg you lose actual image information. Although how much and whether it's even noticeable very much depends on the settings since you can configure how strong jpeg should compress an image. But with high compression, you will very much notice artifacts on jpeg compressed images and for images with plain backgrounds/colors and text, it's much worse and much more noticeable even on high-quality settings.
But removing metadata has nothing to do with compression. There is no compression whatsoever going on there. You are just deleting information. It's like saying "I'm compressing an image" when you delete the lower half of it. The amazing thing about lossy compression is that it removes information but is still able to reconstruct an image that ideally looks exactly the same to the human eye from the remaining data.
@@sanjacobs6261 You know what ?
@@kiyu3229 1vader pretty much summed it up
@@sanjacobs6261 wow you replied very fast
Thank you for this video! i was really confused whenever my friends said "compress the file", i tried to compress it by throwing my Hard drive out the window but my performance was a little worse after that so i decided to look on the internet for a new SSD and I saw this video! i will remember this for about a month, bcs i still need to test if throwing a SSD out the window saves storage capacity, Thx!
You did it the wrong way, you need to smash it to flatten more of the surface before you throw it.
Thank me later 👍
@@averagehololiveenjoyer8496 thanks! that really helped, although, idk if it will work yet bcs i've got to find the missing pieces to reattach
@Technophilius Pulcherax i tried that but i just ended up with a low-profile ssd
See what you need to do is launch it into space at a black hole, perfect compression to just a few bits of info
@@Music-nn9mi i'll met it around the other side of the wormhole, thx!👍
Compression aka that conversation between the hardware and software for finding the data in ram and hoping we don't end up fusing all our memory cats together.
Anyhow I am a sucker for these types of niche videos and their surprisingly relaxing to watch.
Zooming in the Bliss image shows evidence of 8x8 JPEG compression artefacts, one problem with lossy compression is that not only does it discard some things, but it can add things that weren't there before, such as those JPEG artefacts or some quite nasty effects if MP3 is pushed too far.
Recompression is particularly bad, if you compress to JPEG, and then recompress the result again to a smaller file, the quality will be worse than if you started again with an uncompressed or at least best quality original, as the second compression will be counting the artefacts of the first as part of the image.
Kenny keeping the thumbnail metagame high, as always ! Great vid sir
The thumbnails on this channel are amazing
1:27 I think you mean the lossy compression type. You said lossless when explaining both examples.
You need to put a magnet on your hard drive to draw all the data into one place and nicely compact it to make more space. Make sure to move it around a bit in case some data is stuck
This actually sounds convincing lmao
going to computer information college and kept reading compression and didnt understand it my ADHD was frustrated from not knowing XD thank you for the informative video and your time
its incredible how this works! program repacks are especially interesting to me, its just amazing how we can take a game thats ~100+ gb and repack it, losslessly, into a 40gb package!
i only compress isos onto a flash drive to distribute to my school mates
>whether you're loading the data off a CD
_laughs in uncompressed Red Book audio_
*Laughs in FLAC*
"like a boomer"
*looks at my CD collection* "its okay i still lvoe you"
My vinyl collection..
Compression is definitely crazy stuff. A couple examples I've seen: a steam game with a download of around 7gb, and an uncompressed size of around 12gb.
Another example: I exported one of my songs in the WAV format and it was over 50mb in size, while the MP3 version was less than 5mb.
Also, about "some rural places not being able to stream 4k," before I got starlink, which gives me a download speed of 100+Mbps, my old internet had a download speed of under 12mbps on rare occasion.
Just causal awareness of packed steam games vs raw.
Converting mp3 to wav is a sin.
1:17 dam lossless and lossless have so many differences thank you for this information!
So, compression is basically just refactoring code but for content. That's actually really interesting and I'm kinda surprised I never thought of it like that. In a way, a .zip file is more like a blueprint for how to build the file than just packing the file really tightly.
I already know all this but for some reason I still want to watch this and your other similar videos
The irony of showing a before and after compression picture and then run it through compression before presenting it to his viewers.
Thanks, you explained compression to me and I understood you.. That's impressive 👍
Wow it's amazing how we can compress information and send it through electricity and microwave. imagine how quantum computing will compress information?
I can't. A quantum computer is not just a much faster desktop computer
@@celticwinter is it a brain of an ia?
The thumbnail is just great
Great videos I think you touched most subjects!!!
I 100% clicked the video due to to the thumbnail ngl, I still liked the information given by the content of it
Saw a screenshot of the thumbnail on tumblr and I just had to come see this video because of it
Legit a better explanation than my 60,000 dollar cs course...
I've seen an example of this where people will put a lot of a single letter in a file, zip it, the file looks tiny but when unzipped it crashes your computer or something because it's many terabytes in size and the computer doesn't have enough storage space
I don't know why I watched this video but thank you for making it
One correction I'd make is that the difference between a raw photo and a jpg is actually significant. Very significant. Much more than just Metadata. Raw photo files have much more data than can be seen by just viewing a side by side preview on a 8bit monitor. If you've ever edited raw files from a professional camera and also jpgs side by side you'd emmidiately understand how much data is within those raw files. You can take a photo that is visually pitch black and lift exposure to find detail. You can pull highlights down from white. Color banding is pretty much nonexistent if you're editing in 16bit as well. Now once it's edited it will look more or less the exact same to anyone not in the know. Jpgs are great for presenting or even printing nowadays. But not for editing and not for storing all a photos potential
Compression mostly takes advantage of similar colors being side by side. It can encode the variation from the initial shade in fewer bits. Repetition only appears in artificial images like screenshots that contain exactly the same shapes.
top tier thumbnail game as always...DNA would be a good comparison for cat compression
This guy literally uses the same Vim theme I do and explains JPGs via the Bliss wallpaper. How did I stumble onto this near perfect channel?
I don't watch cat videos. some how the algorithm decide to give me cat videos recommendation after watching how file compression works! Thanks u 2
I got clickbaited by those kitty files...😂
Same
Talk about single GPU passtrough on Gentoo, there's a reddit post on r/Gentoo with the title: "gentoo single gpu vfio passthrough scripts"
Yes please do
You mentioned at the beginning of the video that reading a lossy compressed image like a jpeg does not involve decompression. I would argue that decoding any jpeg image involves a sort of decompression. In order to display a jpeg image on a screen, it needs to be decoded which involves computing every single pixel value in the image. Lossy compression just means that the decoded image result will not exactly match the original, but decoding a jpeg still seems like a sort of decompression.
I love how he says "In fact some rural places still can't stream 4k"
I'm from Germany, have live in a small city, have one of the best internet connections in a 20km radius and I sometimes can't even stream 1080p...
I love Germany...
They should have included this technical part in the sitcom it would have been great and exclusive content for nerds shows amount of hard work that goes into building these things
They could have taken wav to mp3 as a use case
Showing that compression in not literal sense but how excess bandwidth is reduced to give the same experience which isn’t detected by the ear
Way better explanation than what my teacher did lmao
Absolutely loving these videos
on TH-cam, i often have to put the video in 1440p to view it in real 1080p... also something funny i noticed is that in the past the limiting factor to compression was processing power! since they could compress it further but decoding it in real time was harder on some machines. tho i do not miss the day where i waited a week for one 90 MB blurry anime episode to finish downloading...
I have been using tiling wm for so long that now I feel uncomfortable when I see that strange thing above each window.
The worst types of compression actually both happened in the 90s: hard drive compression only achieved ratios similar to ZIP and would grind any CPU at the time to a halt. The other was "SoftRam" that was said to compress data in memory to free up space when in actuality it faked the memory space without doing anything. Compression used for ill gotten gains in both cases really...
"pishure" i like the way you pronounce the word "picture"
As we all know, every time a file is compressed a cat is mercilessly squished into a box.
Dude how do you output so many videos on a weekly basis, im geniunly curious, they are super high quality and informative, by all means, please keep it up, but hoooow??
Theres many videos to make
I love how he uses bliss as an example at 6:23
I am trying to use 7zip in my cat and it doesn't compress, any way to fix it?
For kitty compression (comPURRession?), you need a container the size of your desired result.
All you need to do is place them in the same directory and the cat will actually compress into the container all on its own.
@@HokeyBugle Thanks
@@HokeyBugle But it may not want to decompress itself back :-)
Please note the difference between a picture and a pitcher...
Genuinely interesting, thanks for the video.
Man thanks for the video. Probably I have crappy speakers but the sssss and zzzz are way worse than other videos on line.
Awe, I thought you were going to compare the algorithms (zip vs rar) and how they work. :D
Still a great video
Love the Videos 👍🏻❤️
Ah yes, my favourite tech video thumbnail in youtube
This guy knows how to make a good thumbnail. I clicked because cat
1:27 Lossy*
o
i
i
v
thank you for such a wonderful thumbnail
Basically, compression is like making a pancake. Flattening all the good stuff up and remains delicious!
The best way to compress Harry Potter is to take every single vowel and consonant and give each one a coordinate.
heres an even better one, what if you just called all the characters from a tree and didnt write coordinates since thats bloat just write them in order
@@U6kCtBuN Just write the characters based on their HEX values.
Just take .jpg scans of every page 5head then you have no text data to store
Nice vid! You should do a part 2 and include audio compression
Audio compression knowledge is a very professional nature of knowledge and video compression is at least 4-16x more technical.
Knowing this information is only useful to music producers/distributers, and commercial industrial movie makers; for reasons like managing how much truth you can fit on a Blu-ray disk so your customers get more bang for their buck, instead of paying full price for a product soo thick that the dvd player struggles to play it without being over worked.
Knowledge of picture compression is much more relevant to poeple because of how many people actually do graphic design nowadays, and have to actually manage quality control on their finished product.
For video compressions, it can be a nightmare considering how far commercial practices have gone to allow only exact settings to be applied to work on a universal scale. For example, for your dvd players, you have to encode in mkv aac, otherwise it absolutely will not be recognized by any common dvd movie player. That being said, you can't just burn mp4 files to a dvd disk and assume that because the files read on your computer, that your Blu-ray player will look at it the same way, because odds are that your br player won't even know what it's looking at. I think this problem low-key comes from 2 things. 1. Combatting bootleggers (glhf). 2. Making it harder for the smaller entrapaneurs to succeed. [Same as it ever was.)
Companies like Nero, Adobe, and Sony have their software suites that are made to do the heavy lifting of ensuring you are compressing in exact commercial standards that are decided and agreed upon by lawyers, manufactures, and filming companies together. BUT, you may notice how expensive it is buy these programs/tools.
This is a definite example of how it pays to be smart, if you are a DIY hero.
that thumbnail is incredible
Very informative, I learned some stuff!
From 2:32 to 2:45... I would say it is totally wrong to see lossless compression like you did. IMHO it is not a simplification, it is just not true. However, I will think about that in the next days, I could be wrong, but it seems to contradict most I know about information theory. Anyway, thank you so much for your contents, good job!
I tap this video to see the cats.. where are they?
The reason why they're are so many formats for each nature of data presentation, is because different types handle different natures of data differently. Gif handles black and white pictures best, png holds data very accurately and is this very heavy normally, and jpg is like the middle ground that is like the minivan for pictures. It's kind ugly but it can carry a respectable amount of accuracy in relation to the original source, without requiring a ton of gas.
Same goes for videos like mkv, mp4, avi, wmv. When it comes to music or audio, I like my absolute favorite songs to be in hefty flac, instead of mp3 so I can squeeze all of the quality, that the producer meant me to hear, out of it. If it's a normal song 192-320kbps mp3 is plenty smooth /clean enough. If it's an audiobook or old-school game music, 96-128kbps is functionally plenty. Funny enough, audiobooks are commonly marketed out into 96kbps, BUT at 48000khz range instead of the common 44100 in order to retain the depth of audio that can easily be lost when compressing into the fuzzier/blurrier lower density bit rates.
This is usefull to know if you like to share files online and want to shrink enormous files to cut down on a file size, transfer times, or if you feel chained by a mysterious 8mb file size limit.
Rule of thumb for simple folk would be that full quality (lossless) is only completely utilized if you have the equipment for it. If your device is low quality, you will have a harder time noticing the fine difference in lossless vs lossy compressions, and you will be wasting time and energy on something out of reach.
Last note, all compressions have xy or z amount of options to perfect quality vs file size. I recognize wav as having minimal options for audio compression and being rudely hefty in data, compared to flac with more flexibility.
I like mkv, png, jpg, gif, mp3 and flac.
I hate avi, bmp, and wav the most.
I think I have an idea on how you could compress _any_ file down to less than 1% of it's original size. It'd take a lot of computational power (I'm talking entire days of computation), but it could work. I'll keep it a secret for now though. I might learn python to see if this could work
They had us in the first half not gonna lie
@@redstonerti9918 All I'll say is that it involves compressing the raw binary data
I don't know if it's even possible to access this data using python or any regular programming language, so it'd probably require a specialized computer to compress and decompress. All I can do with my little big brain for now is test the idea and lay the foundation in python.
Going down to the basics of how computers work to compute this sounds like a pain, but if it is necessary, so be it
Every video this channel shits out of nowhere is so interesting. you cant do that. i hate this. i dont have time to go thru them all. such an underrated channel. why lossless when im downloading illegal repacks like right now. how do you know so much about the compression method of cats. WHO are you?
1:27 He never talks about the Lossy one. It's always the Lossless one!
Sometimes he says A but actually means B. Overall, great interesting video.
don't need to code the line number, just char number in whole text
135,50 - 6 chars or 2 numbers
33540 - 5 char or one number
that makes reinsertion a lot easier - string are arrays of chars so that should not be a problem. what's the point of doing additional split by "
"?
It's 2021, games can easily take hundreds of gigabytes and we still use JPEGs to save 10 kilobytes
not all games are bloat tho
@@igorswies5913 i know, but most of them are still heavy as fuck
@@iskamag games you download once and you know it will take a long time, meanwhile when you load a website the amount of users that click off is correlated to how fast it loads. A megabyte is a lot when you think of the web
@@igorswies5913 it's not that much heavier if you compress it good enough _looks at apple_
@@iskamag Not gonna lie tho, there are many games compressed from 30gb to 4gb because they are just pure chonk
I recommend you check out Floyd Steinberg dithering, it's literally wizardry
I demand a link to that meme, compressed or not.
P.S. nicely explained, very comprehensive.
The thumbnail is gold
RAW > JPG definitely loses a LOT of data, but the difference is how much that data matters. If you're just viewing it, doesn't really matter at all. But when it comes to people that need to edit photos, having RAW or higher bit-depth data drastically increases how well you can apply certain effects to an image or do color grading.
For example, I took some pictures today with a blue sky. However, my camera by default saves to JPG and is inherently 8 bits per pixel. The result is that when I tried to do some color grading on it, I started to reveal some banding on the blue sky. I can use certain algorithms to deband it, but it would be nicer if I just saved it in RAW and was able to manipulate the data that way, despite the files being significantly larger and slower to save into the camera.
love your vids
Thanks for the video!
Can someone explain how is it that sometimes when a file is compressed from a more complex format to a less complex, the outcoming file is bigger instead of smaller? Usually in image editing format this occurs when setting quality 1:1
Wouldn't it be even more efficient to replace every incidence of a repeated phrase with a unique char that doesn't appear in the original file at all? Like "Harry Potter and the Philosophers Stone" = "x" and replace all incidents of it with x? Or does having the x in a location take up more data than having the cords?
You have a limited set of unique characters unless you want to have the computer create new ones for each thing and then it just depends how many places things will be repeated to determine whether creating new characters uses more data than just listing the coordinates
Language is the lossy compression-decompression cycle of the human mind. Your thoughts are much larger than anything you can convey in speech, but when you tell someone your idea, they decompress it in their own minds and fill in the gaps with their own ideas. We should incorporate this concept into AI to accelerate our progress towards AGI.