ScrapeGraphAI - REVOLUTION in WEB SCRAPING!!!

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ม.ค. 2025

ความคิดเห็น • 35

  • @ShihChiehLee
    @ShihChiehLee 4 หลายเดือนก่อน +1

    can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @evolvingtech1
    @evolvingtech1 วันที่ผ่านมา

    Thank you for this video/post. I am trying to use a locally running llama model (lama3.3) running under Ollama. Running the python code complains that the model is not found, but continues to parse and summarize the articles. Can you suggest how to work with locally installed models like those that are Ollama based?

    • @TomsTechAcademy
      @TomsTechAcademy  3 ชั่วโมงที่ผ่านมา

      Hi, perhaps this article can be of help: www.scrapingbee.com/blog/scrapegraph-ai-tutorial-scrape-websites-easily-with-llama-ai/
      Let me know whether you were able to solve your issue :)

  • @MrMoonsilver
    @MrMoonsilver 7 หลายเดือนก่อน +5

    It extracted the titles, not the articles themselves.

    • @raunaqss
      @raunaqss 7 หลายเดือนก่อน

      Thanks for writing this

    • @ShihChiehLee
      @ShihChiehLee 4 หลายเดือนก่อน

      can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @iamrrricardo
    @iamrrricardo 6 หลายเดือนก่อน

    Thank you so much for showing the demo! I still have a question:does scrapegraphai support full stack crawling? or it only works for one page

  • @userou-ig1ze
    @userou-ig1ze 7 หลายเดือนก่อน

    Thanks for demo-ing the tool

    • @TomsTechAcademy
      @TomsTechAcademy  7 หลายเดือนก่อน

      Thank you for your comment, much apreciated!

  • @Bbyanar
    @Bbyanar 5 หลายเดือนก่อน +1

    Great video thx !

  • @amandamate9117
    @amandamate9117 7 หลายเดือนก่อน

    this video tutorial was suprising beginner friendly

    • @TomsTechAcademy
      @TomsTechAcademy  7 หลายเดือนก่อน

      Thank you for your comment Amanda, and glad my video was useful for you :) I always try to make my videos as beginner friendly as possible

  • @rubenagurcia906
    @rubenagurcia906 หลายเดือนก่อน

    ModuleNotFoundError: No module named 'fp'

  • @aleksdeveloper698
    @aleksdeveloper698 4 หลายเดือนก่อน

    Can you scrape data from Google maps?

  • @DuhaMasoodi
    @DuhaMasoodi 6 หลายเดือนก่อน

    When i am trying this i get the error asyncio error cannot be called

    • @TomsTechAcademy
      @TomsTechAcademy  6 หลายเดือนก่อน

      Hi, thanks for your comment. I have seen this error before, is it possible you're running Python 3.12? If so, try running it in a venv with 3.11. Good luck! If you need more help, you can reach me on my Discord (link in description).

  • @anishmanandhar1203
    @anishmanandhar1203 7 หลายเดือนก่อน

    whenever I install scrapegraphai I am hit by AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'?
    [end of output]

    • @TomsTechAcademy
      @TomsTechAcademy  7 หลายเดือนก่อน +1

      Hi, I think I had this exact same error - are you running Python in a virtual environment? If you run Python 3.12 you can try switching to 3.11

  • @s6yx
    @s6yx 6 หลายเดือนก่อน

    I just give chat gpt the html data as example. Give it what I want scraped and it’ll generate the code for me

    • @TomsTechAcademy
      @TomsTechAcademy  6 หลายเดือนก่อน

      Thanks for Sharing your expertise, that’s another way of doing it 😀

  • @SynergyOfTwo
    @SynergyOfTwo 6 หลายเดือนก่อน +2

    I found ScrapeGraphAI pretty useless, it doesn't except for the simplistic examples.

    • @TomsTechAcademy
      @TomsTechAcademy  6 หลายเดือนก่อน +1

      It’s a great tool for people without Python / BeautifulSoup skills. For most cases which are a bit more complex, the conventional approach is probably the best ;)

  • @videosthatinspireyou
    @videosthatinspireyou 18 วันที่ผ่านมา

    No module named 'scrapergraphai'

    • @TomsTechAcademy
      @TomsTechAcademy  12 วันที่ผ่านมา

      Hi, the name of the module is scrapegraphai (not scrapeRgraphai)

  • @zemmouripopodori
    @zemmouripopodori 6 หลายเดือนก่อน +1

    i get api error

    • @TomsTechAcademy
      @TomsTechAcademy  6 หลายเดือนก่อน

      If you receive any error, feel free to join my discord (link in description) and post your question there 😀

  • @256Services
    @256Services 4 หลายเดือนก่อน

    Video could have been more helpful if it was about scraping helpful marketing information like emails and phone numbers

    • @TomsTechAcademy
      @TomsTechAcademy  4 หลายเดือนก่อน

      Thanks for your comment. It’s impossible for me to make a video about every possible use case. Instead I try to make videos which can be easily applied to other scenarios. In this specific case, it’s just a matter of modifying the prompt and the url and you’re good to go 😉

  • @metazoo4407
    @metazoo4407 7 หลายเดือนก่อน

    I keep getting this error "PermissionError: [Errno 13] Permission denied: 'wired.xlsx'" and I've even typed the code shown in your video exactly and still I get that error, any help would be greatly appreciated

    • @TomsTechAcademy
      @TomsTechAcademy  7 หลายเดือนก่อน +2

      Hi Meta, it looks like you're trying to write a file to a directory where you don't have enough permissions. Assuming you're using Windows, can you try the following?
      Navigate to your Desktop, right click on any of your files there. Click properties. Go to the tab "general" and read location. This should be something like: "C:\Users\thoma\Desktop" (where thoma is your username). Copy this location to VS Code. Now add \wired.xlsx and change all slashes for double slashes (you should now have something like this:
      df.to_excel("C:\\Users\\thoma\\Desktop\\wired.xlsx",index=False)
      Run the code again. Let me know whether this works for you. If not, feel free to join my Discord if you need more help (link in video description).

    • @metazoo4407
      @metazoo4407 7 หลายเดือนก่อน

      @@TomsTechAcademy Hi, I managed to get it to produce two results and now I'm having issues again, It is now giving me this message "Traceback (most recent call last):
      File "c:\scrapperai\main.py", line 26, in
      df = pd.DataFrame(result['pages'])
      ~~~~~~^^^^^^^^^" any help would be greatly appreciated!

  • @videosthatinspireyou
    @videosthatinspireyou 18 วันที่ผ่านมา

    You should take this down.... Their module cannot be imported... As they now charge

    • @TomsTechAcademy
      @TomsTechAcademy  12 วันที่ผ่านมา

      Hi, I just successfully installed the module. Please pip install scrapegraphai, and not scrapeRgraphai as mentioned in your other comment. This is the reason your import doesn't work