Text Mining with R - Part 1 | Importing PDF and Text Detection

แชร์
ฝัง
  • เผยแพร่เมื่อ 7 พ.ย. 2024

ความคิดเห็น • 20

  • @videorprologic
    @videorprologic 2 ปีที่แล้ว +1

    You do a fantastic job keeping this simple, focused and clear. Your presenting skills are impressive

  • @田耕銘
    @田耕銘 ปีที่แล้ว

    Thank you soooooooo much! It works perfectly! Even having around 8k files, R finished the process in less than 20 minutes. I can finally understand after watching your part 2 (The video for AI.) Thanks again!

  • @liancheefoong4309
    @liancheefoong4309 3 ปีที่แล้ว +2

    It's awesome to know that R could do this!

  • @TillThielmann
    @TillThielmann 7 หลายเดือนก่อน

    Hey, thanks for this good instruction. I tried your code but it will only search for my first word? How can I analyze more than one word at the time?

  • @agustincsn
    @agustincsn 2 ปีที่แล้ว

    Thank you very much for this tutorial, really helpful. I am looking forward to Part 2. Thanks

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      You can find part 2 on the channel as " Text mining with R -Part 2"

  • @busyshah
    @busyshah 2 ปีที่แล้ว

    Such a cool stuff.
    I am in HR recruitment and visualizing a use case for screening various CVs for given key words (in job description).
    Let me see how it turns out to be (high hopes).
    Thank you for this.

  • @nida.shamim
    @nida.shamim ปีที่แล้ว

    This is really helpful, however, when I execute the code, it gives me numbers from 1-60 and not the actual word count for the keywords. What to do if I want the word count for keywords from PDFs in a table?

  • @MrKevinsux
    @MrKevinsux ปีที่แล้ว

    My word count is coming up as a some across all papers. Is there a way to modify the code so that I can just see how many times a keyword appears in each of the papers individually?

    • @nida.shamim
      @nida.shamim ปีที่แล้ว

      Hello, did you get the answer for your query from somewhere? Actually, I am looking for the same thing.

  • @vilmantaspedisius5783
    @vilmantaspedisius5783 ปีที่แล้ว

    What's an alternative for txt files rather than pdf ones?

  • @AdOrient
    @AdOrient 2 ปีที่แล้ว

    Would you be able to make a video on how create a similar model to extract information from a large number of PDF files, and export in an excel spreadsheet?

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      Oh I have done something similiar on part 2 oninary classification on journals. However, if you already had an rough idea on the contents on the papers, you can simply setup a matrix and use certain clustering algorithm to seprate them instead of a neural network.
      th-cam.com/video/GihOdZUkH1Y/w-d-xo.html

  • @uzmaashraf5456
    @uzmaashraf5456 2 ปีที่แล้ว

    Hi, I am facing the following error while loading many pdf files in R.PDF
    error: Invalid shared object hint table offset
    PDF error (5393291): insufficient arguments for Marked Content
    PDF error (5393300): insufficient arguments for Marked Content
    PDF error: Invalid shared object hint table offset
    PDF error: Invalid Font Weight
    Please can you help me.

  • @ehecatl3830
    @ehecatl3830 2 ปีที่แล้ว

    Very good thanks

  • @jakoborellieandreasen674
    @jakoborellieandreasen674 2 ปีที่แล้ว

    Is there a part two to this?

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      Ya, it's call text mining part 2 on the channel

  • @abogadorobot6094
    @abogadorobot6094 2 ปีที่แล้ว

    Thank you!

  • @kevinstevenmojicamunoz9629
    @kevinstevenmojicamunoz9629 2 ปีที่แล้ว

    The Github script is not available :(
    Could you please upload it?

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      hmm, that's funny, I have updated the link. if not see if you can access it here (github.com/brandonyph/Text-Mining-With-R)