IMAGE TEXT TO SPEECH CONVERSION USING OPTICAL CHARACTER RECOGNITION TECHNIQUE IN RASPBERRY PI

แชร์
ฝัง
  • เผยแพร่เมื่อ 24 มิ.ย. 2020
  • Full documentation of the project :
    electronicsworkshops.com/2020...
    instagram link:
    / electronicsworkshop111
    facebook link:
    / electronics-. .
    In our planet of 7.4 billion humans, 285 million are visually impaired out of whom 39 million people are completely blind, i.e. have no vision at all, and 246 million have mild or severe visual impairment (WHO, 2011). It has been predicted that by the year 2020, these numbers will rise to 75 million blind and 200 million people with visual impairment [5]. As reading is of prime importance in the daily routine (text being present everywhere from newspapers, commercial products, sign-boards, digital screens etc.) of mankind, visually impaired people face a lot of difficulties. Our device assists the visually impaired by reading out the text to them. There have been numerous advances in this area to help visually impaired to read without much difficulties. The existing technologies use a similar approach as mentioned in this paper, but they have certain drawbacks. Firstly, the input images taken in previous works have no complex background, i.e. the test inputs are printed on a plain white sheet. It is easy to convert such images to text without pre-processing, but such an approach will not be useful in a real-time system [1][2][3]. Also, in methods that use segmentation of characters for recognition, the characters will be read out as individual letter and not a complete word. This gives an undesirable audio output to the user. For our project, we wanted the device to be able to detect the text from any complex background and read it efficiently. Inspired by the methodology used by Apps such as “CamScanner”, we assumed that in any complex background, the text will most likely be enclosed in a box eg billboards, screens etc. By being able to detect a region enclosing four points, we assume that this is the required region containing the text. This is done using warping and cropping. The new image obtained then undergoes edge detection and a boundary is then drawn over the letters. This gives it more definition. The image is then processed by the OCR and TTS to give audio ouput.
    HIT THE LIKE BUTTON IF YOU LIKE.
    ALL YOUR COMMENT ARE APPRECIATED
    SHARE_IT_IF_YOU_WANT_YOUR_FRIENDS_TO_WATCH.

ความคิดเห็น • 7

  • @dmitchell63
    @dmitchell63 3 ปีที่แล้ว +2

    This is cool, but that piano is much louder than the text being read.

    • @electronicsworkshop7131
      @electronicsworkshop7131  3 ปีที่แล้ว +1

      thank you so much for your compliment we apologize for that it was due to misplacement of our speaker.

  • @navamitelsang5226
    @navamitelsang5226 7 หลายเดือนก่อน

    SIR AN I GET THE CODES PLEASE. I AM DOING THE SAME PROJET

  • @rda2987
    @rda2987 3 ปีที่แล้ว +1

    Sir I am doing a same project can I get codes for the same pls

    • @electronicsworkshop7131
      @electronicsworkshop7131  3 ปีที่แล้ว

      electronicsworkshops.com/2020/06/24/image-text-to-speech-conversion-using-optical-character-recognition-technique-in-raspberry-pi/

    • @dhanushsunkara2980
      @dhanushsunkara2980 3 ปีที่แล้ว +1

      i m also doing same project but doing changes...only one to one lang here..i changing into one to many lang...and title also different and it is used for different purpose

    • @electronicsworkshop7131
      @electronicsworkshop7131  2 ปีที่แล้ว

      @@dhanushsunkara2980 all the best stay tuned with my channel