How Does Optical Character Recognition (OCR) Work?

แชร์
ฝัง
  • เผยแพร่เมื่อ 26 ธ.ค. 2024

ความคิดเห็น • 420

  • @freedomofmotion
    @freedomofmotion 7 ปีที่แล้ว +134

    Irish travelers will be deeply hurt that OCR and even you don't accept that dag is a word.
    Has no one ever tried to sell you a dag?
    Or admired your dag?

    • @chantafreak
      @chantafreak 7 ปีที่แล้ว +15

      Ya like dags?

    • @ataksnajpera
      @ataksnajpera 7 ปีที่แล้ว

      Knackers do not even speak english ;)

    • @GewelReal
      @GewelReal 7 ปีที่แล้ว +2

      hey kid, you wanna buy some dags?

    • @EvadingFate
      @EvadingFate 7 ปีที่แล้ว +18

      Oh, dogs. Sure, I like dags. I like caravans more.

    • @chantafreak
      @chantafreak 7 ปีที่แล้ว +5

      This is the post I was waiting for.

  • @DustinRodriguez1_0
    @DustinRodriguez1_0 7 ปีที่แล้ว +7

    OCR was one of the first practical uses of neural networks back in the 70s or 80s. Maybe even earlier? When I took an AI class in college, we wrote a simple OCR neural net and it was pretty easy.

  • @TheOriginalFayari
    @TheOriginalFayari 7 ปีที่แล้ว +30

    That was the smoothest transition to a sponsor spot I've ever seen.

  • @jamesklein4399
    @jamesklein4399 7 ปีที่แล้ว +411

    FILE FORMATS AS FAST AS POSSIBLE!
    png vs jpg
    mp4 vs mkv
    mp3 vs ...?

    • @laser5317
      @laser5317 7 ปีที่แล้ว +44

      James Klein MP3 vs WAV

    • @RobertHildebrandt
      @RobertHildebrandt 7 ปีที่แล้ว +36

      mp3 vs flac

    • @coffeen8128
      @coffeen8128 7 ปีที่แล้ว +2

      James Klein png keep the quility

    • @smarthd7749
      @smarthd7749 7 ปีที่แล้ว +5

      MP4 and .mkv Is not a file format, IT is a container. And ITS not many difference between mkv and MP4 the only difference is that mkv can hold some more codecs.

    • @cldream
      @cldream 7 ปีที่แล้ว +2

      SmartFyrHD Also Matroska can also embed multiple subtitle formats (SRT, SSA/Advanced SSA)

  • @jandresshade
    @jandresshade 7 ปีที่แล้ว +2

    the OCR can use different techniques to recognize character, one is creating a model based on data of different characters and training the sofware to recognize them( Artificial neural networks is an example of this)

  • @ShreyPandya150
    @ShreyPandya150 7 ปีที่แล้ว +7

    When Luke said it wouldn't look as crisp and the video resolution went down I instantly checked if I was at 1080p

  • @ziyitan8996
    @ziyitan8996 7 ปีที่แล้ว +3

    I love how Luke explains stuff :D

  • @dav2mai
    @dav2mai 7 ปีที่แล้ว +70

    Will it also recognize language?
    because "dag" translates to "day" in Danish

    • @Meg_A_Byte
      @Meg_A_Byte 7 ปีที่แล้ว +31

      Is there anything on this world that recognizes danish?

    • @22RH544
      @22RH544 7 ปีที่แล้ว +10

      Nope, as a Dutch guy i can read it just fine, but when it is spoken.................I quit.

    • @TheDyingFox
      @TheDyingFox 7 ปีที่แล้ว +5

      Same result when translated to Swedish xD

    • @Mr.FastZombie
      @Mr.FastZombie 7 ปีที่แล้ว +3

      I would assume it sticks to one language, but some can probably change their language. Also perhaps some could be able to determine the language based on what it has already recognized.

    • @crewskater06
      @crewskater06 7 ปีที่แล้ว +3

      It's from the movie Snatch

  • @moenbase1
    @moenbase1 3 ปีที่แล้ว

    In my industry, which is electronics. We use OCR in our automated optical machine to detect component marking on components as small as micro BGA's that are like 400microns wide. It's amazing to see how you can push it's limits. Just, sometimes like when there's a sufficient amount of flux on the components it makes it impossible to read.

    • @Ahmed71616
      @Ahmed71616 2 ปีที่แล้ว

      What is the best scanner that does the same job as your devices

  • @cestsibon2468
    @cestsibon2468 3 ปีที่แล้ว

    This is the first time i've watched a tech video and actually not had a headache after. Waiting for the interpretive google dance hehe

  • @Chris.Woodcock
    @Chris.Woodcock 7 ปีที่แล้ว +7

    I gotta say, I really liked this one! Very informative but what really made it for me was the seamless sponsor spot. I'd love to see more in such a way!

  • @HolarMusic
    @HolarMusic 7 ปีที่แล้ว +1

    Is that an 8k green-screen video? Looks super clean

  • @rediculousman
    @rediculousman 7 ปีที่แล้ว

    convolutional and LSTM neural networks are the cutting edge for these applications

  • @littletomatomonkeysmeeeeel8324
    @littletomatomonkeysmeeeeel8324 2 ปีที่แล้ว +2

    Highly recommend PaddleOCR! 80 languages supported! Good performance! Easy to use! It would be great if bloggers could do a comparative evaluation of the popular OCR tools.

  • @sabaamin3179
    @sabaamin3179 3 ปีที่แล้ว

    Just what I was looking for. Good Job!

  • @CatBroiler
    @CatBroiler 7 ปีที่แล้ว +1

    My ScanSnap IX500 usese software to make scans readable. It works pretty well and the IX500 is blisteringly fast.

  • @bradad1111
    @bradad1111 7 ปีที่แล้ว +10

    Saw OCR and immediately thought it had something to do with the Exam Board.

    • @craigmalcom6294
      @craigmalcom6294 7 ปีที่แล้ว

      bradad111 Lool same

    • @StickyBagel
      @StickyBagel 6 ปีที่แล้ว

      So did youtube, i was watching a revision playlist and here i am??

  • @hillppari
    @hillppari 7 ปีที่แล้ว +2

    Google translate app with OCR is pretty nifty when you can translate foreign signs etc.

  • @OMNIA_RH
    @OMNIA_RH 6 ปีที่แล้ว

    Thank so much for you explaining Sir.

  • @MotivationAdonis
    @MotivationAdonis 7 ปีที่แล้ว

    Linus tech tips as fast as possible

  • @TheDyingFox
    @TheDyingFox 7 ปีที่แล้ว

    I was going to ask "How about Voice Recognition next?" but searched your channel, and I'll be damned, 1 year ago, you guys work fast! (Not sure how I've been missing it though, alot of content much?).
    It's a shame neither is "How to create your own Voice Recognition and Optical Character Recognition as fast as possible"

  • @fleksimir
    @fleksimir 4 ปีที่แล้ว +1

    Linus ad (pulseway) on linus video. I love this ahahaha

  • @quenjankosky7348
    @quenjankosky7348 7 ปีที่แล้ว

    Well, with OCR, there is an exception for the lack of accuracy. When basic modern OCR was being developed, they made a series of fonts deigned to be as accurate as possible. These fonts were OCR-A and OCR-B. These fonts are super accurate with OCR, and there is usually never any error with them.

  • @SnypeSin
    @SnypeSin 7 ปีที่แล้ว +1

    that's good and all but I would have thought you'd give us and idea of what kind of devices use OCR for consumer/business.

  • @Mr.FastZombie
    @Mr.FastZombie 7 ปีที่แล้ว

    There are also programs for character recognition on your screen.
    Project Naptha is a Chrome extension that can let you copy and paste words in an image.
    And ShareX has OCR that you can use for any program.

  • @Seag-Gaming
    @Seag-Gaming 7 ปีที่แล้ว

    Who else had nostalgia @ 0:36?

  • @JRDev4All
    @JRDev4All 7 ปีที่แล้ว

    You should do an as fast as possible on assistive technologies such as screen readers

  • @saisagarmrcool4610
    @saisagarmrcool4610 3 ปีที่แล้ว +1

    it was the most simpler way to understand

  • @KX36
    @KX36 7 ปีที่แล้ว +1

    I did some OCR recently. Tesseract on Linux was the best at recognising the text accurately, but it outputs plain text only. There are 3rd party GUIs, but still none really preserve formatting.
    ABBYY FineReader on Windows (the gold standard for home use) was quite good at preserving formatting but worse at recognising text accurately. My scan was 200 pages of black 12pt Times New Roman on white paper scanned at 300dpi which should be one of the easiest things to process, and it regularly made mistakes on 1 vs l vs I , y vs v, H vs II etc. And these were often in places the dictionary should have easily known what it should have been. How often do you get a lower case L in the middle of a long number or a double upper case I at the start of a word or a v at the end of a word. It took 3 hours to go through the document correcting the mistakes it highlighted. Don't know how many mistakes are in there that it didn't highlight.

  • @jamilangon5798
    @jamilangon5798 7 ปีที่แล้ว +1

    well google releases a OCRT (optical character recognition translator). which translate even other character aside from ASCII (chinese, japanese, thai and other non alpha character)... it become useful for those who travel and find themselves trap into a place where no one can speak or understand english.

  • @thornejman6467
    @thornejman6467 7 ปีที่แล้ว +2

    Thumbs up if anyone else checked the videoquality at 0:36 xD

  • @rinoy_43
    @rinoy_43 7 ปีที่แล้ว +1

    I've tried Tesseract. Its free and pretty accurate.

  • @unguidedone
    @unguidedone 5 ปีที่แล้ว +1

    we need a firefox plugin that will log what youtube upload has paid promotions, skip past it and end the video when teh promotion happens.
    this video is an example of native advertisting

  • @Golde2Good
    @Golde2Good 7 ปีที่แล้ว

    You should explain core parking in the near future.

  • @stayprofessional2453
    @stayprofessional2453 7 ปีที่แล้ว

    Make an episode on network topologies

  • @JOELwindows7
    @JOELwindows7 7 ปีที่แล้ว +1

    Wow, I saw this video right near before my National exam days.

  • @narutosasuke30
    @narutosasuke30 5 ปีที่แล้ว

    Which OCR recognizes Handwritten text that you have shown at the end? I couldn't find anything which actually does that within a permissible error rate :/

  • @ulashofficial
    @ulashofficial 4 ปีที่แล้ว

    Sir can you tell me how can i find duplicate numbers with any OCR app or how should i pursue to make an app for that ?

  • @Odinvalknir
    @Odinvalknir 7 ปีที่แล้ว

    Micky: Ya like dags?
    Tommy: Dags?
    Micky: Yea, dags
    Tommy: OH, dogs, sure I like dags

  • @howardt12345
    @howardt12345 7 ปีที่แล้ว +2

    Dennis: "You are dancing?"

  • @leivadaros
    @leivadaros 7 ปีที่แล้ว +1

    Haven't read a single comment regarding the video's topic.... only "First", "Notification Squad where you at" and comments trying to be witty.....
    Great video by the way, i love getting general introductory information on the subject of my studies (computer engineer). Keep at it TechQuickie :D

  • @jean-lucasymptotic5083
    @jean-lucasymptotic5083 7 ปีที่แล้ว

    Speaking of machine learning..... that would make a good techquickie :D

  • @johneygd
    @johneygd 7 ปีที่แล้ว

    But can OCR ever distinguich hand written numbers and letters from eachother? Such as 0's & o's, G's & 6's, 1's & i's ,H's & 4's , j's & i's, 7's & 1's ,0's & 8's etc,,,, because numbers and letters looks similar to eachother.

  • @MiMiOrt
    @MiMiOrt 3 ปีที่แล้ว

    I downloaded but , I thought that it will recognize the different fonts that are someonetimes in just ONE page. Does anyone know an APP/Program that can recognize the font on a scanned document?

  • @angelstrife
    @angelstrife 7 ปีที่แล้ว +15

    Hi! Could you do a FPS 1%low explaination? I have seen so many tech reviewers use this term but i have no idea what it means.

    • @sniperunrepeat752
      @sniperunrepeat752 7 ปีที่แล้ว +18

      Long Nguyen Games tend to have "stutters" (i.e. briefly running out of VRAM on say, a 1060 3gb) which can temporarily bring the minimum fps incredibly low. So 1% lows are used. All they mean is the minimum fps that doesn't factor in the bottom 1% of frames, to give a more realistic minimum

    • @Bayonet1809
      @Bayonet1809 7 ปีที่แล้ว

      Could also be called the 99th percentile.

  • @TheZorch
    @TheZorch 7 ปีที่แล้ว

    I've got a Chrome extension that does OCR within images. Sometimes comes in really handy.

  • @Quack201
    @Quack201 7 ปีที่แล้ว +1

    So I guess the real question here is why is Luke only wearing socks while recording this? Doesn't Linus give sandals to all the employees?

  • @DanRobards
    @DanRobards 7 ปีที่แล้ว +1

    Man, the ACR was great. Hardly any recoil

  • @Jinni_SD
    @Jinni_SD 7 ปีที่แล้ว

    I really like Tesseract withHomebrew on Mac for OCR.

  • @zcuipylo
    @zcuipylo 7 ปีที่แล้ว

    TPS reports!!!!!! What a perfect example. Almost an easter egg.

  • @feni_1553
    @feni_1553 2 ปีที่แล้ว

    Images in video editing?

  • @mickeyhage
    @mickeyhage 7 ปีที่แล้ว

    OCRs font work ive tried them but they dont properly. They dont read encrypted documents they spit out random incorrect letters.

  • @macpclinux1
    @macpclinux1 7 ปีที่แล้ว +1

    luke are you finally using linux? i saw that little ubuntu font box :D good job mate!

  • @bassmickey
    @bassmickey 7 ปีที่แล้ว

    Funny used OCR last night. What a coincidence

  • @Pi7on
    @Pi7on 7 ปีที่แล้ว

    why isn't there an OCR software to scan videos?
    I mean, there are literally one or two, and they can't do much.
    It should be relatively simple since a video is composed by images.
    But I can't find ONE program that does that.
    And why doesn't Google release a standalone app/software to OCR things since it's OCR is the best? I'd pay for that.

    • @barnstormer322
      @barnstormer322 7 ปีที่แล้ว

      I don't think OCR on video is all that practical. Plus you'd have to do things like work out if it's the same text but scrolling between frames, recognise transitions and animations, and also deal with the processor time that analysing at least 24 frames for every second would take.

    • @Pi7on
      @Pi7on 7 ปีที่แล้ว

      barnstormer322 Well, yes but it's not mandatory to analyze every frame in real time, even if i think Google could do it if you have a good upload speed to upload frames to them in real time.
      also I think It would be VERY useful for the anime community, and not only for that.
      to distinguish text from animations should not be that difficult since there are freeware that already do that ,it just need to be improved a bit.

  • @Reign14forever
    @Reign14forever 6 ปีที่แล้ว

    Watching this 10 minutes before class because I have a home and I’m a highly functional college student

  • @arnatsemtappra3822
    @arnatsemtappra3822 6 ปีที่แล้ว

    Very useful knowledge and easy to understand provided to the new faces of this technology.

  • @Lorten369
    @Lorten369 7 ปีที่แล้ว

    YEES More history please. love knowledge.

  • @9421Bro
    @9421Bro 5 ปีที่แล้ว

    Can you please tell me about any OCR software for devanagari language .
    Which can cost me less

  • @Juiceman777
    @Juiceman777 3 ปีที่แล้ว

    I couldn't help but to think of the line from the movie Snatch when Brad Pitt said "ya like dags?" lol

  • @isabellaereshki
    @isabellaereshki 7 ปีที่แล้ว

    I liked your dancing, ignore dennis. great video.

  • @DeppImAll
    @DeppImAll 7 ปีที่แล้ว +1

    I mean tbh ... when I write in OneNote some text and microsoft can figure out what I just wrote and convert it into real characters I'm always astonished since my handwriting is horrible.

  • @rushabmehta
    @rushabmehta 7 ปีที่แล้ว

    Can you do video on Virtualization such as hardware, network and storage Virtualization.

  • @MrTuffarts
    @MrTuffarts 7 ปีที่แล้ว

    Dag is a word OCR software would not pick this up spellcheck does not pickup this also

  • @joerider5063
    @joerider5063 7 ปีที่แล้ว +1

    Do speech recognition as fast as possible please.

  • @SuperManitu1
    @SuperManitu1 7 ปีที่แล้ว

    Tesseract is the best OCR program out there. It is Open Source and runs on all major OS

    • @9421Bro
      @9421Bro 5 ปีที่แล้ว

      How can I run it on Windows

  • @182ndNegociator
    @182ndNegociator 7 ปีที่แล้ว

    What if it's supposed to say dag, that's also a completely legitimate word used in Australian English, plus it could also be used to describe a Directed Acyclic Graph, also known as a tree.

  • @antonjohansson1384
    @antonjohansson1384 7 ปีที่แล้ว +4

    Dag is in swedish day

  • @jehdo144
    @jehdo144 7 ปีที่แล้ว

    great video!

  • @NineToFiveGamer
    @NineToFiveGamer 7 ปีที่แล้ว

    I used to use an augmented translator app for my French tests. Shit just about worked half the time

  • @Exploreyourlife88
    @Exploreyourlife88 4 ปีที่แล้ว

    Thanks

  • @pearls9133
    @pearls9133 7 ปีที่แล้ว

    could you do videos explaining how mastering audio and video works? (if it doesnt already exist)

  • @_Disi
    @_Disi 7 ปีที่แล้ว

    What about if you're trying to copy the line "D'ya like dags?" from Snatch?

  • @7EEVEE
    @7EEVEE 7 ปีที่แล้ว

    most scanners look pretty good to be fair

  • @AndyPhu
    @AndyPhu 7 ปีที่แล้ว

    This isn't in 4k! :(

  • @thepalettewhispererasmr1227
    @thepalettewhispererasmr1227 3 ปีที่แล้ว

    Arizona's audit brought me here 🇺🇸

  • @marcusleung8985
    @marcusleung8985 7 ปีที่แล้ว

    what about Fourier transform?

  • @bas116677
    @bas116677 7 ปีที่แล้ว +2

    Dag actually means Hey or day in Dutch!

    • @kdm_6799
      @kdm_6799 7 ปีที่แล้ว

      Bas Roelofs dag means bye too

  • @James-qd3mw
    @James-qd3mw 7 ปีที่แล้ว

    the reason that this is my favorite tech channel is because of things like the interpretive dance incident

  • @sebon11
    @sebon11 4 ปีที่แล้ว

    Cool! Thx a lot.

  • @Brusanan
    @Brusanan 7 ปีที่แล้ว

    Not one mention of neural networks?

  • @GroovingPict
    @GroovingPict 7 ปีที่แล้ว +3

    do you like dags?

  • @donaldfilbert4832
    @donaldfilbert4832 7 ปีที่แล้ว +1

    OneNote has a pretty good built in OCR for small text articles - and it's free !! ABBYY FineReader does an excellent job converting image PDFs into searchable text based PDFs !!

  • @rry1994
    @rry1994 7 ปีที่แล้ว +1

    I love u guys man

  • @ThePiGuy24
    @ThePiGuy24 7 ปีที่แล้ว +1

    I WANT INTERPRETIVE DANCE TRANSLATOR NOW!!!

  • @Shirojm
    @Shirojm 7 ปีที่แล้ว

    So use a normal "photographic" scanner , then use OCR services such as google drive .

  • @teksight9714
    @teksight9714 7 ปีที่แล้ว

    Good video. Thumbs up!

  • @aamir_xo
    @aamir_xo 7 ปีที่แล้ว

    what did Dennis say?

  • @1OldWriter
    @1OldWriter 7 ปีที่แล้ว +1

    Techquickie you do know most scanning software do this as part of their operation. If your's doesn't perhaps you should get a new one.

  • @levingthedream
    @levingthedream 7 ปีที่แล้ว

    Is there any awesome free software that do this? Linux or PC. Besides Google drive that is

  • @unvergebeneid
    @unvergebeneid 7 ปีที่แล้ว

    4:11 That's not actually writing, is it? Because if it is, it beats _my_ character recognition.

  • @aislius9200
    @aislius9200 7 ปีที่แล้ว

    Printing costs like 150 dollars for new ink if you go to retail, if you manage to go online it costs like 10-20 bucks. What the actual fuck?!!?

  • @metashrew
    @metashrew 7 ปีที่แล้ว +2

    If the software were dutch, the word would be "dag" (which means day in english), and not "dog".

  • @Dantastic
    @Dantastic 7 ปีที่แล้ว

    If you like in Rhode Island, "dag" is more accurate than "dog." e.g. "I'm gonna take my dag for a walk in the paak."

  • @sahotaquack1
    @sahotaquack1 7 ปีที่แล้ว +1

    Oxford Cambridge RSA

  • @supervegito2277
    @supervegito2277 7 ปีที่แล้ว

    3:38 soft g, its day in danish actually.

    • @22RH544
      @22RH544 7 ปีที่แล้ว

      Also in Dutch, Swedish & Norwegian

  • @SUNDOWNER88
    @SUNDOWNER88 7 ปีที่แล้ว

    Background noise at 4:26

  • @blingerang
    @blingerang 6 ปีที่แล้ว

    3:33 dag is actualy morning in dutch

  • @tamasberki3330
    @tamasberki3330 5 ปีที่แล้ว

    Who else checked the resolution at 0:37?

  • @megabithero
    @megabithero 7 ปีที่แล้ว

    My Galaxy S3 could do this. Made lab reports super manageable.