“Automation 2.0 coming…No more boring data entry job”

แชร์
ฝัง
  • เผยแพร่เมื่อ 27 พ.ย. 2024

ความคิดเห็น • 130

  • @USBEN.
    @USBEN. ปีที่แล้ว +29

    I think yours is the only channel that shows practical usage for gpt and automation with existing tools.
    I learn a lot here, thankyou man.

    • @CodingAfterThirty
      @CodingAfterThirty ปีที่แล้ว +2

      That is the fact, this is my go to channel for learning.

  • @Cygx
    @Cygx ปีที่แล้ว +44

    This is an incredible saas product on its own. Now you just need a easy to use frontend for the user to take pictures and export a well defined excel spreadsheet. Incredible work!

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว +6

      Thanks! Good idea to turn this into a micro sass with simple scanning function

    • @amandamate9117
      @amandamate9117 ปีที่แล้ว +7

      the bottleneck is: no company want to send private highly sensitive data as cleartext to openAIs chatGPT to process. Not in USA, not in Europe.

    • @BUILDINGFUTUREX
      @BUILDINGFUTUREX ปีที่แล้ว

      @@amandamate9117 maybe some encrypted solution

    • @antoninleroy3863
      @antoninleroy3863 ปีที่แล้ว

      @@amandamate9117 Any large company could afford to run an open source LLM internally on a private network.
      EDIT: or even private microsolft openAi endpoints

    • @sw4rmify
      @sw4rmify ปีที่แล้ว

      @@amandamate9117the OpenAI API data is never used for training etc…

  • @VRDivision
    @VRDivision ปีที่แล้ว +4

    dude you're on fire! keep it up, I can't wait to apply knowledge from your videos

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      Thank you!!

  • @ZahedAshkara-q6u
    @ZahedAshkara-q6u ปีที่แล้ว +23

    Hey Jason, you are the greatest teacher I have encountered! This is exactly how people need to learn to build AI apps. You're going to be very successful if you keep teaching us like this. Thank you for all the great work, man!

  • @okt4k
    @okt4k ปีที่แล้ว +2

    Thank you for all of these videos bro, please keep making them!

  • @KundanKumar-xu4kd
    @KundanKumar-xu4kd ปีที่แล้ว +1

    Thank you for exposing me to Make, just signed up. great tool will use this in a lot my projects, and it will make my life a lot easier.

  • @KarlJuhl
    @KarlJuhl ปีที่แล้ว +4

    Great vidoe Jason, you are awesome at explaining these things. I personally support doing more of these guides in core coding format like here, it is super helpful for understanding.

  • @jayhu6075
    @jayhu6075 ปีที่แล้ว

    As a beginner in ML I am very glad to find your channel. I learn a lot and you from each topic everything understandable. Many thanks

  • @salamina_
    @salamina_ 8 หลายเดือนก่อน

    great content! thank you for taking the time to put together and share!

  • @AngusLou
    @AngusLou ปีที่แล้ว +1

    Jason is always giving amazing and practical use cases

  • @jasonfinance
    @jasonfinance ปีที่แล้ว +2

    Thank you Jason. Great work as always. Very practical user case

  • @RyckmanApps
    @RyckmanApps ปีที่แล้ว

    Your videos are pretty helpful. The way you logically explain each tool is helpful.

  • @chetans1557
    @chetans1557 ปีที่แล้ว +1

    I was here before he was subscribed by every AI enthusiast
    Incredible video as always, thank you!

  • @Scooterboy_and_others109
    @Scooterboy_and_others109 ปีที่แล้ว +1

    Fantastic simple walk-thru of e2e Business Scenario

  • @korywilson3005
    @korywilson3005 ปีที่แล้ว +1

    This content is so GREAT. Thank you. Very transparent.

  • @asithakoralage628
    @asithakoralage628 ปีที่แล้ว +1

    Hi Jason, fantastic video, I learned a lot from your content. Please keep up the good work. Cheers

  • @epireve
    @epireve ปีที่แล้ว +1

    Incredible work as always Jason!
    P/s : I just realised Jin Yang and you has over 90% resemblance. What a doppelgänger! Minus the hair of course

  • @AI_News_Français
    @AI_News_Français ปีที่แล้ว

    man, your content is brilliant, by the way the thumbnails ROCK :)

  • @mikepetersen5662
    @mikepetersen5662 ปีที่แล้ว +1

    That is amazing. Thank you so much for this great code and tutorial!

  • @mlg4035
    @mlg4035 ปีที่แล้ว

    Freaking awesome video, Jason! So much info! Keep these videos coming!

  • @ShawnCady
    @ShawnCady ปีที่แล้ว +1

    Another great video, Jason!

  • @lucyn.7501
    @lucyn.7501 ปีที่แล้ว

    Another wonderful tutorial thank you Jason so much ❤. In the perfect world, there should be no manual intervention, the POS machine should just talk to the bank, and AI in the middle transforming the semi/un-structured data into structured data, which then get feed into your online banking and accounting software. Scanning is a serious pain when the transaction gets large and digitalise receipts save a lot of trees and ink too 😂

  • @АндрейБогаев-ы2я
    @АндрейБогаев-ы2я ปีที่แล้ว +1

    Good Job Jason. Top content🔥

  • @MichaelHoughton_
    @MichaelHoughton_ ปีที่แล้ว +1

    AWS has a really good system to extract data from a document and it cods $1.50 per 1000 pages... so its super efficient

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      oh nice, didnt know that, will give it a try! whats the name of the service?

  • @kevon217
    @kevon217 ปีที่แล้ว +1

    Another banger tutorial, thanks!

  • @harrisongovan7623
    @harrisongovan7623 ปีที่แล้ว +1

    Brother, you’re amazing

  • @kickingnscreaming
    @kickingnscreaming ปีที่แล้ว +2

    Thanks!

  • @micbab-vg2mu
    @micbab-vg2mu ปีที่แล้ว +1

    Thank you for the video.

  • @carkawalakhatulistiwa
    @carkawalakhatulistiwa ปีที่แล้ว +2

    all repetitive work using computers can be automated within 2 years by ai.

  • @miltondavilaharjula
    @miltondavilaharjula ปีที่แล้ว

    Awesome tutorial !! 🎉

  • @avi7278
    @avi7278 9 หลายเดือนก่อน

    when Jason drops a video I can't click fast enough

  • @齐洋-o2s
    @齐洋-o2s ปีที่แล้ว

    Seriously I mean this is great video for educational purposes and I have two specific questions 1’ have you got access to GPT 4 api 2’ they are great educational contents, have you ever thing about productizing your idea such as this one, I mean filling for tax return seems to be a high demand for a lot of people

  • @enceladus96
    @enceladus96 ปีที่แล้ว +1

    Exactly what I’m looking for 😭

  • @readmarketings9061
    @readmarketings9061 ปีที่แล้ว +2

    waiting for this

  • @hazema.6150
    @hazema.6150 ปีที่แล้ว +2

    One of the key takeaways from this amazing tutorial is: AI by itself will not replace you but rather one who uses AI effectively is the one will insha’Allah (God willing). So go learn how to use AI in your day-to-day job now and impress your employers with your ideas.
    Great tutorial Jason.

  • @dhaw
    @dhaw ปีที่แล้ว +1

    This is Amazing !

  • @devklepacki
    @devklepacki ปีที่แล้ว +4

    I'm curious how's the accuracy of pytesseract. I did the exact same project a long time ago (it's in production up to this date) and we used Google Vision API to perform OCR. The biggest issue is that although the accuracy is at idk like 99.9% it's still at least one wrong character recognized in each invoice! And since there's a lot of numeric data (prices, VAT values, amounts, different units of measures) writing validation for this all took more time than the rest of the project. You never actually knew what the OCR will return and you REALLY don't want to put the wrong data for accounting.

    • @devklepacki
      @devklepacki ปีที่แล้ว

      And actually here's the thing, in the video the Transaction ID wasn't recognized 100% correctly

    • @TheParagamer
      @TheParagamer ปีที่แล้ว

      @@devklepacki You're right it's missing an extra W @5:46, eagle eyes🦅! I suppose you could feed this output to another llm checking whether sequences numbers of another run match, repeating until however accuracy you want. It wouldn't ever be perfect tho and would add up quickly💸

    • @andrewxzvxcud2
      @andrewxzvxcud2 ปีที่แล้ว

      yh thats one of the problems w all these ai apps, problems where u need to be 100% accurate or there could be big consequences is hard to actually solve with ai

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      @@TheParagamer ohh Having 2 OCR service to do text extraction & LLM to validate, this is 🧠

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว +1

      @@devklepacki ahh good catch! i really like @TheParagamer idea on having 2 service for validating the result, will give it a try

  • @AkulSamartha
    @AkulSamartha ปีที่แล้ว

    You are a Genuis bro. 👏

  • @ryancoble-neal6186
    @ryancoble-neal6186 ปีที่แล้ว +5

    Hi Jason, when I try to run your code I encounter the following error: PdfiumError: Failed to load document (PDFium: File access error). Do you know what might be causing this and how to rectify it? Thanks

    • @krasimirivanov6627
      @krasimirivanov6627 ปีที่แล้ว +1

      +1 I am facing the same error. Appreciate if someone has an advice on how to solve it

    • @albertalbert5785
      @albertalbert5785 ปีที่แล้ว +1

      i also have the same error :/, someone help pls

    • @chandoyoutube-o1d
      @chandoyoutube-o1d ปีที่แล้ว +1

      facing same error

    • @bibinbalakrishnan
      @bibinbalakrishnan ปีที่แล้ว +1

      The NamedTemporaryFile is getting deleted. You can change it like - with NamedTemporaryFile(suffix='.pdf',delete=False) as f:

    • @kenhtinhthuc
      @kenhtinhthuc ปีที่แล้ว

      Thanks. It worked for me.@@bibinbalakrishnan

  • @nathan_leo
    @nathan_leo ปีที่แล้ว

    This is amazing, love all your content, thank you! Would you be able to make this video’s git public? Also, love the thumbnails 😂

  • @MK-jn9uu
    @MK-jn9uu ปีที่แล้ว +1

    🤬 why am I having so much trouble importing? What am I missing?

  • @umeshtiwari9249
    @umeshtiwari9249 ปีที่แล้ว

    believe me you do fantastic AI use case to handle business processes which anyone can use to get a job in AI. It will be great if you can do more use case in AI. would be really helpful to me and many others. At the end thanks a lot. 😃

  • @alessandroceccarelli6889
    @alessandroceccarelli6889 ปีที่แล้ว

    Best llm content on the web!
    Why OCR instead of native pdf text retrieval though? Don’t you risk to incur into ocr-related mistakes?
    I mean, you already have the “real” text! Thank you

  • @aliphian
    @aliphian ปีที่แล้ว

    Great channel!

  • @rafael_tg
    @rafael_tg ปีที่แล้ว

    Very nice video. Have you tried to use function calling in GPT instead of asking it to return a string json ?

  • @HarshVerma-xs6ux
    @HarshVerma-xs6ux ปีที่แล้ว +1

    Hey Jason, your content is really amazing. Thanks for creating AI related content. I wanted to ask if there's any advantage of saving the image in jpeg format before extracting text because if there's no actual advantage the same can be done with just 3 lines of code which also makes the process faster.
    def parse_pdf(file_path, scale=300/72):
    pdf_file = pdfium.PdfDocument(file_path)
    renderer = pdf_file.render(
    pdfium.PdfBitmap.to_pil,
    scale=scale
    )
    return "
    ".join(image_to_string(img) for img in renderer)

    • @gonorrex_571
      @gonorrex_571 ปีที่แล้ว

      Hey, you seem to understand the field. Looking to launch this idea into the market? Sales guy here looking for a tech cofounder. Cheers!

  • @齐洋-o2s
    @齐洋-o2s ปีที่แล้ว +1

    AI Jason is a must watch, now I wanna make a copycat of him on Chinese web, what about NewAI Jason for my channel 👨🏿‍🔧👨🏿‍🔧👨🏿‍🔧

  • @MANISHPANDEY-q2m
    @MANISHPANDEY-q2m 9 วันที่ผ่านมา

    a life saver man!!

  • @cjbobby
    @cjbobby ปีที่แล้ว +2

    The github link seems to be broken. Could repost the link pls? :)

    • @jmanhype1
      @jmanhype1 ปีที่แล้ว

      he took it down looks like he will be turning it into a micro service

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      @@jmanhype1 Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

  • @ExtK-l7zV
    @ExtK-l7zV ปีที่แล้ว

    Why did you use a simple langchain prompt template instead of using openai’ s function api to get the structured data?

  • @Ascended23
    @Ascended23 ปีที่แล้ว

    Given the thumbnail I have to ask... when do we get the Hot Dog or Not Hot Dog App?

  • @hwzforumsg
    @hwzforumsg 8 หลายเดือนก่อน

    With function calling, is it more convenient for LLMs to extract structured data?

  • @DePhpBug
    @DePhpBug ปีที่แล้ว

    I like the approach above here , as I require to do alot of admin work as well.
    Was wondering is there a way to protect your data ? Bit concern with data privacy!! T.T

  • @faridmohdismail31
    @faridmohdismail31 ปีที่แล้ว

    i was thinking of using this to just extract text from PDF if its better then langchain for embedding, i guess your example is good for forms and invoices, but for instructional document or PDF of wikipedia, the tesseract dont handle some data that well.
    but still its a very good guide.. thx for sharing

  • @autoboto
    @autoboto ปีที่แล้ว

    Surprised could not access the pdf object model to get text from the pages. . But yes tessaract does work well

  • @lukaszl9542
    @lukaszl9542 ปีที่แล้ว

    And are those language model libraries available in Python? You said you Will explain it later in the video but i think you didnt

  • @learningstuff5679
    @learningstuff5679 8 หลายเดือนก่อน

    Awesome. Jason do you offer 1-on-1 consulting?

  • @temirzhanyussupov6997
    @temirzhanyussupov6997 ปีที่แล้ว

    Would not function calling be more appropriate for formatting invoice data into a JSON format you need?

  • @adolphododo
    @adolphododo ปีที่แล้ว

    If the PDF has many pages (for example, a contract), do I need to go through the process of splitting it into smaller chunks, or can I simply insert any PDF, regardless of the text size?

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      the function auto split them into pages!

  • @staceyjo1752
    @staceyjo1752 ปีที่แล้ว

    when the invoice has subtotal with an indented item, it gets read as duplicate item (as pytesseract doesn't recognized indent) and therefore, the total doesn't match the invoice total... do you have any suggestions for this kid of error?

  • @khirtah
    @khirtah ปีที่แล้ว +1

    This is a great as you.

  • @the.edgemedia
    @the.edgemedia ปีที่แล้ว

    But ocr is built already right? why cant we directly use that

  • @amandamate9117
    @amandamate9117 ปีที่แล้ว

    the bottleneck is: no company want to send private highly sensitive data as cleartext to openAIs chatGPT to process. Not in USA, not in Europe.

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว +4

      yea you are right; Im making a new video about how companies can handle data privacy soon, so hopefully it can address that :) But in general, host private cloud, or using opensource LLM should solve that

    • @krasimirivanov6627
      @krasimirivanov6627 ปีที่แล้ว

      Looking forward to this video!

  • @jamesxprosper
    @jamesxprosper ปีที่แล้ว

    Im getting an an error that says Import "dotenv" could not be resolved Pylance (reportMissingImports) [Ln 4_ Col 6], what am I doing wrong?

  • @niharikasingh2541
    @niharikasingh2541 ปีที่แล้ว

    Why are we converting pdf to image instead u can use any python Library to get text from pdf

  • @digital4smallbusiness
    @digital4smallbusiness ปีที่แล้ว

    Hey Jason, this is great! But can you Llama2 to achieve the same?

  • @rverm1000
    @rverm1000 ปีที่แล้ว +1

    wow the python coding tutorials keep getting more and more complicated lately thats good.

  • @tapos999
    @tapos999 ปีที่แล้ว

    not clear yet, what are the output difference from pypdf/langchain pdf to pdf->img->text? do the later one, keep some structure of the info in certain way or what's good/bad from these 2 approach?

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      When I tried pypdf/langchain unstructured file upload, it only extract like 10~20% of the text from img, so almost unusable

  • @ayusharora2019
    @ayusharora2019 ปีที่แล้ว

    tons of companies have been doing this with OCR. I don't know what are you saying!!

  • @photon2724
    @photon2724 ปีที่แล้ว

    Hi Jason! thanks for the great video. looks like your github link is broken. would love an updated link to access the code!

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว +2

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

  • @howtowithtt
    @howtowithtt ปีที่แล้ว

    Hey everyone, im pretty new to all of this. im the type to just dive in and do, i keep getting this error after i pip install anything "is not recognized as the name of a cmdlet, function, script file, or operable program." any help?

  • @markdin2988
    @markdin2988 9 หลายเดือนก่อน

    How does GPT4 vision affect this ? better or worse?

  • @Nurof3n_
    @Nurof3n_ ปีที่แล้ว +1

    hey, github link doesn't work :(

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว +1

      Sorry forgot to set it public, just updated it! github.com/JayZeeDesign/gpt-data-extraction

    • @Nurof3n_
      @Nurof3n_ ปีที่แล้ว

      @@AIJasonZ Thanks!

  • @ivanlee7450
    @ivanlee7450 ปีที่แล้ว

    Can you do an assist filling form using langchain tutorial?

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w ปีที่แล้ว

    would using OpenAI's function calling be useful here?

    • @AIJasonZ
      @AIJasonZ  ปีที่แล้ว

      You can try function calling for data extraction for sure! but still need a way to turn PDF text well first

  • @jreamer0
    @jreamer0 ปีที่แล้ว

    how do I get the file_url to be passed from make to relevance?

  • @DIY_Foodie
    @DIY_Foodie ปีที่แล้ว

    please attach link to medium article

  • @ambitious_nutritious_delicious
    @ambitious_nutritious_delicious 11 หลายเดือนก่อน

    Красавчик

  • @SnapFacts-h3z
    @SnapFacts-h3z ปีที่แล้ว

    Vid content aside,你的声音jimmy o yang是真的很像哈哈哈哈哈

  • @juancasas5532
    @juancasas5532 ปีที่แล้ว

    Jason for presiden 2024

  • @iamseth5253
    @iamseth5253 ปีที่แล้ว

    Each time he says pdffiles 👀

  • @MrBou.
    @MrBou. ปีที่แล้ว

    im a marketer, i just don't understand the whole coding part, it's like chinese for me.

  • @ammadali5799
    @ammadali5799 ปีที่แล้ว +1

    This is nice. maybe deploying these models on MS Azure so we can have their API?
    and for the next video try making a simple streamlit app with that API
    Really appreciate the work you are doing. Thank you very much

  • @saadkassim9729
    @saadkassim9729 7 หลายเดือนก่อน

    Can you do the all SLOWLY.. Again I COULDN'T FOLLOW YOU 😮😮😮😮

  • @gonorrex_571
    @gonorrex_571 ปีที่แล้ว +1

    Anyone with tech background wanna work on this? I'm looking to launch a SaaS company and I have more than 10 years in Sales working on B2B Finance. Reply here and I will get in touch!

  • @napent
    @napent ปีที่แล้ว

    Use new Microsoft office features xs

  • @Supasweet95
    @Supasweet95 ปีที่แล้ว

    What about safety concerns regarding data? Anyway to overcome this? Good video.

  • @udaynj
    @udaynj ปีที่แล้ว

    What you call a boring data entry job feels millions of families around the world where the bread "earner" has no better skills. I find the attitude of CS and esp AI folks distasteful. You guys are so flippant about the destruction of families and communities caused by AI taking over jobs. There will be a day of reckoning I am afraid when the world turns against CS folks. Please watch your language leave the commentary out....

  • @_derive
    @_derive ปีที่แล้ว +1

    Thanks!