How Does Optical Character Recognition (OCR) Work?

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 เม.ย. 2017
  • How do computers read text on a page, and how has the technology improved?
    Freshbooks message: Head over to freshbooks.com/techquickie and don’t forget to enter Tech Quickie in the “How Did You Hear About Us” section when signing up for your free trial.
    Techquickie Merch Store: www.designbyhumans.com/shop/L...
    Techquickie Movie Poster: shop.crowdmade.com/collection...
    Follow: / linustech
    Join the community: linustechtips.com
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 419

  • @DisbelieverH2o
    @DisbelieverH2o 7 ปีที่แล้ว +7

    I gotta say, I really liked this one! Very informative but what really made it for me was the seamless sponsor spot. I'd love to see more in such a way!

  • @TheOriginalFayari
    @TheOriginalFayari 7 ปีที่แล้ว +30

    That was the smoothest transition to a sponsor spot I've ever seen.

  • @freedomofmotion
    @freedomofmotion 7 ปีที่แล้ว +133

    Irish travelers will be deeply hurt that OCR and even you don't accept that dag is a word.
    Has no one ever tried to sell you a dag?
    Or admired your dag?

    • @chantafreak
      @chantafreak 7 ปีที่แล้ว +15

      Ya like dags?

    • @ataksnajpera
      @ataksnajpera 7 ปีที่แล้ว

      Knackers do not even speak english ;)

    • @GewelReal
      @GewelReal 7 ปีที่แล้ว +2

      hey kid, you wanna buy some dags?

    • @EvadingFate
      @EvadingFate 7 ปีที่แล้ว +18

      Oh, dogs. Sure, I like dags. I like caravans more.

    • @chantafreak
      @chantafreak 7 ปีที่แล้ว +5

      This is the post I was waiting for.

  • @sabaamin3179
    @sabaamin3179 2 ปีที่แล้ว

    Just what I was looking for. Good Job!

  • @jamesklein4399
    @jamesklein4399 7 ปีที่แล้ว +412

    FILE FORMATS AS FAST AS POSSIBLE!
    png vs jpg
    mp4 vs mkv
    mp3 vs ...?

    • @laser5317
      @laser5317 7 ปีที่แล้ว +44

      James Klein MP3 vs WAV

    • @RobertHildebrandt
      @RobertHildebrandt 7 ปีที่แล้ว +36

      mp3 vs flac

    • @coffeen8128
      @coffeen8128 7 ปีที่แล้ว +2

      James Klein png keep the quility

    • @smarthd7749
      @smarthd7749 7 ปีที่แล้ว +5

      MP4 and .mkv Is not a file format, IT is a container. And ITS not many difference between mkv and MP4 the only difference is that mkv can hold some more codecs.

    • @cldream
      @cldream 7 ปีที่แล้ว +2

      SmartFyrHD Also Matroska can also embed multiple subtitle formats (SRT, SSA/Advanced SSA)

  • @OMNIA_RH
    @OMNIA_RH 5 ปีที่แล้ว

    Thank so much for you explaining Sir.

  • @DustinRodriguez1_0
    @DustinRodriguez1_0 7 ปีที่แล้ว +7

    OCR was one of the first practical uses of neural networks back in the 70s or 80s. Maybe even earlier? When I took an AI class in college, we wrote a simple OCR neural net and it was pretty easy.

  • @ziyitan8996
    @ziyitan8996 7 ปีที่แล้ว +3

    I love how Luke explains stuff :D

  • @jandresshade
    @jandresshade 7 ปีที่แล้ว +2

    the OCR can use different techniques to recognize character, one is creating a model based on data of different characters and training the sofware to recognize them( Artificial neural networks is an example of this)

  • @TheDyingFox
    @TheDyingFox 7 ปีที่แล้ว

    I was going to ask "How about Voice Recognition next?" but searched your channel, and I'll be damned, 1 year ago, you guys work fast! (Not sure how I've been missing it though, alot of content much?).
    It's a shame neither is "How to create your own Voice Recognition and Optical Character Recognition as fast as possible"

  • @cestsibon2468
    @cestsibon2468 3 ปีที่แล้ว

    This is the first time i've watched a tech video and actually not had a headache after. Waiting for the interpretive google dance hehe

  • @Mr.FastZombie
    @Mr.FastZombie 7 ปีที่แล้ว

    There are also programs for character recognition on your screen.
    Project Naptha is a Chrome extension that can let you copy and paste words in an image.
    And ShareX has OCR that you can use for any program.

  • @SnypeSin
    @SnypeSin 7 ปีที่แล้ว +1

    that's good and all but I would have thought you'd give us and idea of what kind of devices use OCR for consumer/business.

  • @arnatsemtappra3822
    @arnatsemtappra3822 6 ปีที่แล้ว

    Very useful knowledge and easy to understand provided to the new faces of this technology.

  • @narutosasuke30
    @narutosasuke30 5 ปีที่แล้ว

    Which OCR recognizes Handwritten text that you have shown at the end? I couldn't find anything which actually does that within a permissible error rate :/

  • @sebon11
    @sebon11 4 ปีที่แล้ว

    Cool! Thx a lot.

  • @jehdo144
    @jehdo144 7 ปีที่แล้ว

    great video!

  • @ShreyPandya150
    @ShreyPandya150 7 ปีที่แล้ว +7

    When Luke said it wouldn't look as crisp and the video resolution went down I instantly checked if I was at 1080p

  • @ulashofficial
    @ulashofficial 4 ปีที่แล้ว

    Sir can you tell me how can i find duplicate numbers with any OCR app or how should i pursue to make an app for that ?

  • @rry1994
    @rry1994 7 ปีที่แล้ว +1

    I love u guys man

  • @quenjankosky7348
    @quenjankosky7348 7 ปีที่แล้ว

    Well, with OCR, there is an exception for the lack of accuracy. When basic modern OCR was being developed, they made a series of fonts deigned to be as accurate as possible. These fonts were OCR-A and OCR-B. These fonts are super accurate with OCR, and there is usually never any error with them.

  • @Lorten369
    @Lorten369 7 ปีที่แล้ว

    YEES More history please. love knowledge.

  • @macpclinux1
    @macpclinux1 7 ปีที่แล้ว +1

    luke are you finally using linux? i saw that little ubuntu font box :D good job mate!

  • @pearls9133
    @pearls9133 7 ปีที่แล้ว

    could you do videos explaining how mastering audio and video works? (if it doesnt already exist)

  • @HirooKoslov
    @HirooKoslov 7 ปีที่แล้ว +1

    My ScanSnap IX500 usese software to make scans readable. It works pretty well and the IX500 is blisteringly fast.

  • @MiMiOrt
    @MiMiOrt 3 ปีที่แล้ว

    I downloaded but , I thought that it will recognize the different fonts that are someonetimes in just ONE page. Does anyone know an APP/Program that can recognize the font on a scanned document?

  • @JRDev4All
    @JRDev4All 7 ปีที่แล้ว

    You should do an as fast as possible on assistive technologies such as screen readers

  • @KX36
    @KX36 7 ปีที่แล้ว +1

    I did some OCR recently. Tesseract on Linux was the best at recognising the text accurately, but it outputs plain text only. There are 3rd party GUIs, but still none really preserve formatting.
    ABBYY FineReader on Windows (the gold standard for home use) was quite good at preserving formatting but worse at recognising text accurately. My scan was 200 pages of black 12pt Times New Roman on white paper scanned at 300dpi which should be one of the easiest things to process, and it regularly made mistakes on 1 vs l vs I , y vs v, H vs II etc. And these were often in places the dictionary should have easily known what it should have been. How often do you get a lower case L in the middle of a long number or a double upper case I at the start of a word or a v at the end of a word. It took 3 hours to go through the document correcting the mistakes it highlighted. Don't know how many mistakes are in there that it didn't highlight.

  • @fleksimir
    @fleksimir 4 ปีที่แล้ว +1

    Linus ad (pulseway) on linus video. I love this ahahaha

  • @HolarMusic
    @HolarMusic 7 ปีที่แล้ว +1

    Is that an 8k green-screen video? Looks super clean

  • @leivadaros
    @leivadaros 7 ปีที่แล้ว +1

    Haven't read a single comment regarding the video's topic.... only "First", "Notification Squad where you at" and comments trying to be witty.....
    Great video by the way, i love getting general introductory information on the subject of my studies (computer engineer). Keep at it TechQuickie :D

  • @teksight9714
    @teksight9714 7 ปีที่แล้ว

    Good video. Thumbs up!

  • @94213915
    @94213915 5 ปีที่แล้ว

    Can you please tell me about any OCR software for devanagari language .
    Which can cost me less

  • @moenbase1
    @moenbase1 2 ปีที่แล้ว

    In my industry, which is electronics. We use OCR in our automated optical machine to detect component marking on components as small as micro BGA's that are like 400microns wide. It's amazing to see how you can push it's limits. Just, sometimes like when there's a sufficient amount of flux on the components it makes it impossible to read.

    • @Ahmed71616
      @Ahmed71616 2 ปีที่แล้ว

      What is the best scanner that does the same job as your devices

  • @vapexxx
    @vapexxx 7 ปีที่แล้ว

    Luke - I actually watched the ad because of your fresh moves!

  • @MotivationAdonis
    @MotivationAdonis 7 ปีที่แล้ว

    Linus tech tips as fast as possible

  • @howardt12345
    @howardt12345 7 ปีที่แล้ว +2

    Dennis: "You are dancing?"

  • @rediculousman
    @rediculousman 7 ปีที่แล้ว

    convolutional and LSTM neural networks are the cutting edge for these applications

  • @hillppari
    @hillppari 7 ปีที่แล้ว +2

    Google translate app with OCR is pretty nifty when you can translate foreign signs etc.

  • @rushabmehta
    @rushabmehta 7 ปีที่แล้ว

    Can you do video on Virtualization such as hardware, network and storage Virtualization.

  • @dav2mai
    @dav2mai 7 ปีที่แล้ว +70

    Will it also recognize language?
    because "dag" translates to "day" in Danish

    • @Meg_A_Byte
      @Meg_A_Byte 7 ปีที่แล้ว +31

      Is there anything on this world that recognizes danish?

    • @22RH544
      @22RH544 7 ปีที่แล้ว +10

      Nope, as a Dutch guy i can read it just fine, but when it is spoken.................I quit.

    • @TheDyingFox
      @TheDyingFox 7 ปีที่แล้ว +5

      Same result when translated to Swedish xD

    • @Mr.FastZombie
      @Mr.FastZombie 7 ปีที่แล้ว +3

      I would assume it sticks to one language, but some can probably change their language. Also perhaps some could be able to determine the language based on what it has already recognized.

    • @crewskater06
      @crewskater06 7 ปีที่แล้ว +3

      It's from the movie Snatch

  • @JOELwindows7
    @JOELwindows7 7 ปีที่แล้ว +1

    Wow, I saw this video right near before my National exam days.

  • @jean-lucasymptotic5083
    @jean-lucasymptotic5083 7 ปีที่แล้ว

    Speaking of machine learning..... that would make a good techquickie :D

  • @jamilangon5798
    @jamilangon5798 7 ปีที่แล้ว +1

    well google releases a OCRT (optical character recognition translator). which translate even other character aside from ASCII (chinese, japanese, thai and other non alpha character)... it become useful for those who travel and find themselves trap into a place where no one can speak or understand english.

  • @TheZorch
    @TheZorch 7 ปีที่แล้ว

    I've got a Chrome extension that does OCR within images. Sometimes comes in really handy.

  • @pikotechsolutions
    @pikotechsolutions 2 ปีที่แล้ว

    awesome

  • @mickeyhage
    @mickeyhage 7 ปีที่แล้ว

    OCRs font work ive tried them but they dont properly. They dont read encrypted documents they spit out random incorrect letters.

  • @littletomatomonkeysmeeeeel8324
    @littletomatomonkeysmeeeeel8324 ปีที่แล้ว +2

    Highly recommend PaddleOCR! 80 languages supported! Good performance! Easy to use! It would be great if bloggers could do a comparative evaluation of the popular OCR tools.

  • @feni_1553
    @feni_1553 2 ปีที่แล้ว

    Images in video editing?

  • @stayprofessional2453
    @stayprofessional2453 7 ปีที่แล้ว

    Make an episode on network topologies

  • @Exploreyourlife88
    @Exploreyourlife88 3 ปีที่แล้ว

    Thanks

  • @Jinni_SD
    @Jinni_SD 7 ปีที่แล้ว

    I really like Tesseract withHomebrew on Mac for OCR.

  • @Juiceman777
    @Juiceman777 2 ปีที่แล้ว

    I couldn't help but to think of the line from the movie Snatch when Brad Pitt said "ya like dags?" lol

  • @johneygd
    @johneygd 7 ปีที่แล้ว

    But can OCR ever distinguich hand written numbers and letters from eachother? Such as 0's & o's, G's & 6's, 1's & i's ,H's & 4's , j's & i's, 7's & 1's ,0's & 8's etc,,,, because numbers and letters looks similar to eachother.

  • @bradad1111
    @bradad1111 7 ปีที่แล้ว +10

    Saw OCR and immediately thought it had something to do with the Exam Board.

    • @craigmalcom6294
      @craigmalcom6294 7 ปีที่แล้ว

      bradad111 Lool same

    • @StickyBagel
      @StickyBagel 5 ปีที่แล้ว

      So did youtube, i was watching a revision playlist and here i am??

  • @Mihnea729
    @Mihnea729 7 ปีที่แล้ว

    Interesting !

  • @bassmickey
    @bassmickey 7 ปีที่แล้ว

    Funny used OCR last night. What a coincidence

  • @Golde2Good
    @Golde2Good 7 ปีที่แล้ว

    You should explain core parking in the near future.

  • @_Disi
    @_Disi 7 ปีที่แล้ว

    What about if you're trying to copy the line "D'ya like dags?" from Snatch?

  • @NineToFiveGamer
    @NineToFiveGamer 7 ปีที่แล้ว

    I used to use an augmented translator app for my French tests. Shit just about worked half the time

  • @levingthedream
    @levingthedream 7 ปีที่แล้ว

    Is there any awesome free software that do this? Linux or PC. Besides Google drive that is

  • @Quack201
    @Quack201 7 ปีที่แล้ว +1

    So I guess the real question here is why is Luke only wearing socks while recording this? Doesn't Linus give sandals to all the employees?

  • @unguidedone
    @unguidedone 5 ปีที่แล้ว +1

    we need a firefox plugin that will log what youtube upload has paid promotions, skip past it and end the video when teh promotion happens.
    this video is an example of native advertisting

  • @araddadi2
    @araddadi2 5 ปีที่แล้ว

    Watching this 10 minutes before class because I have a home and I’m a highly functional college student

  • @marcusleung8985
    @marcusleung8985 7 ปีที่แล้ว

    what about Fourier transform?

  • @rinoy_43
    @rinoy_43 7 ปีที่แล้ว +1

    I've tried Tesseract. Its free and pretty accurate.

  • @isabellaereshki
    @isabellaereshki 7 ปีที่แล้ว

    I liked your dancing, ignore dennis. great video.

  • @antonjohansson1384
    @antonjohansson1384 7 ปีที่แล้ว +4

    Dag is in swedish day

  • @DanRobards
    @DanRobards 7 ปีที่แล้ว +1

    Man, the ACR was great. Hardly any recoil

  • @angelstrife
    @angelstrife 7 ปีที่แล้ว +15

    Hi! Could you do a FPS 1%low explaination? I have seen so many tech reviewers use this term but i have no idea what it means.

    • @sniperunrepeat752
      @sniperunrepeat752 7 ปีที่แล้ว +18

      Long Nguyen Games tend to have "stutters" (i.e. briefly running out of VRAM on say, a 1060 3gb) which can temporarily bring the minimum fps incredibly low. So 1% lows are used. All they mean is the minimum fps that doesn't factor in the bottom 1% of frames, to give a more realistic minimum

    • @Bayonet1809
      @Bayonet1809 7 ปีที่แล้ว

      Could also be called the 99th percentile.

  • @todddembsky8321
    @todddembsky8321 7 ปีที่แล้ว

    Luke, you have to tell me when you go on tour -- I need to leave the country at that point....

  • @Brusanan
    @Brusanan 7 ปีที่แล้ว

    Not one mention of neural networks?

  • @AndyPhu
    @AndyPhu 7 ปีที่แล้ว

    This isn't in 4k! :(

  • @donaldfilbert4832
    @donaldfilbert4832 7 ปีที่แล้ว +1

    OneNote has a pretty good built in OCR for small text articles - and it's free !! ABBYY FineReader does an excellent job converting image PDFs into searchable text based PDFs !!

  • @MrTuffarts
    @MrTuffarts 7 ปีที่แล้ว

    Dag is a word OCR software would not pick this up spellcheck does not pickup this also

  • @terrybell898
    @terrybell898 7 ปีที่แล้ว

    Micky: Ya like dags?
    Tommy: Dags?
    Micky: Yea, dags
    Tommy: OH, dogs, sure I like dags

  • @DeppImAll
    @DeppImAll 7 ปีที่แล้ว +1

    I mean tbh ... when I write in OneNote some text and microsoft can figure out what I just wrote and convert it into real characters I'm always astonished since my handwriting is horrible.

  • @182ndNegociator
    @182ndNegociator 7 ปีที่แล้ว

    What if it's supposed to say dag, that's also a completely legitimate word used in Australian English, plus it could also be used to describe a Directed Acyclic Graph, also known as a tree.

  • @zcuipylo
    @zcuipylo 7 ปีที่แล้ว

    TPS reports!!!!!! What a perfect example. Almost an easter egg.

  • @joerider5063
    @joerider5063 7 ปีที่แล้ว +1

    Do speech recognition as fast as possible please.

  • @matthewpurcell5498
    @matthewpurcell5498 7 ปีที่แล้ว

    What did Dennis say?

  • @BenPotts
    @BenPotts 7 ปีที่แล้ว

    Nice dancing, Luke

  • @blingerang
    @blingerang 6 ปีที่แล้ว

    3:33 dag is actualy morning in dutch

  • @nitini.764
    @nitini.764 6 ปีที่แล้ว

    I liked this "don't worry, be happy" in your video. Are you a Meher Baba lover too!!!!

  • @GroovingPict
    @GroovingPict 7 ปีที่แล้ว +3

    do you like dags?

  • @sahotaquack1
    @sahotaquack1 7 ปีที่แล้ว +1

    Oxford Cambridge RSA

  • @UNPhantom93
    @UNPhantom93 7 ปีที่แล้ว

    Would be much better if was a fold able or detachable at least to use it as a tablet

  • @SuperManitu1
    @SuperManitu1 7 ปีที่แล้ว

    Tesseract is the best OCR program out there. It is Open Source and runs on all major OS

    • @94213915
      @94213915 5 ปีที่แล้ว

      How can I run it on Windows

  • @thornejman6467
    @thornejman6467 7 ปีที่แล้ว +2

    Thumbs up if anyone else checked the videoquality at 0:36 xD

  • @Shirojm
    @Shirojm 7 ปีที่แล้ว

    So use a normal "photographic" scanner , then use OCR services such as google drive .

  • @bas116677
    @bas116677 7 ปีที่แล้ว +2

    Dag actually means Hey or day in Dutch!

    • @kdm_6799
      @kdm_6799 7 ปีที่แล้ว

      Bas Roelofs dag means bye too

  • @ThePiGuy24
    @ThePiGuy24 7 ปีที่แล้ว +1

    I WANT INTERPRETIVE DANCE TRANSLATOR NOW!!!

  • @Seag-Gaming
    @Seag-Gaming 7 ปีที่แล้ว

    Who else had nostalgia @ 0:36?

  • @user-ni3cm3uq7t
    @user-ni3cm3uq7t 7 ปีที่แล้ว

    Office Lens cad detect the curve of your document now...

  • @svsrkpraveen
    @svsrkpraveen 6 ปีที่แล้ว

    When did Dan Reynolds start doing tech stuff?

  • @MrEsChannelYT
    @MrEsChannelYT 7 ปีที่แล้ว +2

    d'ya like dags?

  • @saisagarmrcool4610
    @saisagarmrcool4610 2 ปีที่แล้ว +1

    it was the most simpler way to understand

  • @1OldWriter
    @1OldWriter 7 ปีที่แล้ว +1

    Techquickie you do know most scanning software do this as part of their operation. If your's doesn't perhaps you should get a new one.

  • @thepalettewhispererasmr1227
    @thepalettewhispererasmr1227 3 ปีที่แล้ว

    Arizona's audit brought me here 🇺🇸

  • @soriatullah8988
    @soriatullah8988 6 ปีที่แล้ว

    thanks form Bangladesh