Scrape Any Website for FREE Using DeepSeek & Crawl4AI

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 ก.พ. 2025

ความคิดเห็น • 131

  • @satvalite1854
    @satvalite1854 3 วันที่ผ่านมา +10

    man this is how tutorials need to be
    A real use case which is something different from the examples already being used in the Documentation
    this is so much better compared to all other yt channels which simply do whats already mentioned in the docs and adding no value at all

  • @Changheelee-zn2vj
    @Changheelee-zn2vj 2 วันที่ผ่านมา +1

    This is a quality level of class... I wish I knew this year ago....

  • @mbottambotta
    @mbottambotta 3 วันที่ผ่านมา

    Love this video-thanks Brandon for explaining the topic so clearly. Liked, subscribed and joined your Skool community.

  • @dinnupasala-po4yr
    @dinnupasala-po4yr 12 ชั่วโมงที่ผ่านมา +1

    Very good job 👏
    It would be nice to extract the contact information of the venue and email them automatically with a predefined email template for photography business to seek new clients. 😊

  • @Cynthia-cw6kd
    @Cynthia-cw6kd 2 วันที่ผ่านมา

    Enjoyed this! Thanks for the source code too Brandon!

  • @HarshKumar-j2r5g
    @HarshKumar-j2r5g 5 วันที่ผ่านมา

    was searching for this from a week thanks man

  • @abbcc555
    @abbcc555 5 วันที่ผ่านมา +11

    It''s the css_selector which is typically the problematic part. I was kinda hoping this crawler would use AI to somehow magically deal with randomized class names etc. edit: I know there are things like scrapy shell but it's a bit tedious.

    • @rapcod-n8c
      @rapcod-n8c 4 วันที่ผ่านมา

      yeah thats the main problem to solve! if llm can automatically pick the css selector from the prompt then it will be very easy otherwise! we can just use puppeter and other crawling libraries i dont see much difference

    • @jrosmail
      @jrosmail 4 วันที่ผ่านมา +2

      Why to use a llm when the information it’s perfectly structured on the site… probably tagged ln the css so you can assign location phone etc directly to the database. Don’t gent why to use LLM

    • @MarvijoSoftware
      @MarvijoSoftware 4 วันที่ผ่านมา

      @@jrosmail The site's source can change, then your scraping doesn't work anymore

  • @alsaher309
    @alsaher309 วันที่ผ่านมา +1

    Thank you, Guys he give you the first step on the road use ur brain to make that advanced specially with so much Free Advanced AIs.
    Whish you all the best

    • @bhancock_ai
      @bhancock_ai  วันที่ผ่านมา

      Thank you for all the support 😁

  • @RomuloMagalhaesAutoTOPO
    @RomuloMagalhaesAutoTOPO 2 วันที่ผ่านมา +1

    Thank you very much.

  • @rcbrush99
    @rcbrush99 5 วันที่ผ่านมา +7

    You are an excellent teacher with a fabulous talent for clarity. Nice job!

    • @bhancock_ai
      @bhancock_ai  5 วันที่ผ่านมา +3

      Thank you so much! This made my day 😁

    • @ripaire
      @ripaire 5 วันที่ผ่านมา

      Believe me, you are one of the best teachers who made complex things look like nothing .Thanks, brother ​@bhancock_ai

  • @FUNTasticFlutter
    @FUNTasticFlutter 4 วันที่ผ่านมา

    this very video has earned you my subscription dude....... Good job ... I'm subscribed now

  • @shaambhavshankar8782
    @shaambhavshankar8782 4 วันที่ผ่านมา

    loved the explanation. You deserve a sub

  • @LiamCook-b1b
    @LiamCook-b1b 5 วันที่ผ่านมา +23

    I really appreciate the detailed steps in this tutorial. Would love to see a comparison between these methods and HasData's approach to scraping complex web data. Any suggestions?

  • @baltin80
    @baltin80 4 วันที่ผ่านมา

    This is a great ! thanks for sharing mate

  • @nusenyaw
    @nusenyaw 4 วันที่ผ่านมา

    Thanks! was just about get into Scrapy.

  • @jimmysaxblack
    @jimmysaxblack 4 วันที่ผ่านมา

    amazing thanks

  • @Ray_eddi
    @Ray_eddi วันที่ผ่านมา +1

    Absolute Gold!

  • @jrcoimbra
    @jrcoimbra 5 วันที่ผ่านมา

    Amazing content as usual!

  • @TheGenerationGapPodcast
    @TheGenerationGapPodcast 18 ชั่วโมงที่ผ่านมา

    Nice Marketing for Crawl AI

  • @eraclee
    @eraclee 5 วันที่ผ่านมา +32

    Expensive way of doing scraping

    • @deephousemorocco4802
      @deephousemorocco4802 4 วันที่ผ่านมา +8

      care to share a cheaper way please?

    • @graczew
      @graczew 4 วันที่ผ่านมา

      ​@@deephousemorocco4802I believe scrapy will do this faster and cheaper. But you need to play a bit more with selectors and data cleaning.

    • @dolboeb-tz4bw
      @dolboeb-tz4bw 4 วันที่ผ่านมา

      And slow

    • @eraclee
      @eraclee 4 วันที่ผ่านมา

      @@deephousemorocco4802 Just classical HTML Dom parsing. Using CSS selectors or xpath. The field of scraping wasn't invented with LLMs

    • @WeLoveWave
      @WeLoveWave 4 วันที่ผ่านมา

      @@deephousemorocco4802 Use something like Scrapewell - just make API calls for the pages and parse with Python.

  • @UNCPHIL
    @UNCPHIL 8 ชั่วโมงที่ผ่านมา

    is this also able to scrape the images with it? if so, what lines do we need to edit?

  • @darshantank554
    @darshantank554 3 ชั่วโมงที่ผ่านมา

    Is there anyway to automatically pass that css component to make it more dynamic?!

  • @50PlusLife
    @50PlusLife 2 ชั่วโมงที่ผ่านมา

    Way outside of my wheelhouse ! but learning..

  • @aderitocruz6054
    @aderitocruz6054 2 วันที่ผ่านมา +1

    Very nice and clear explain about AI Crawler. Where can I get the code to study on it a lit bit more?

    • @bhancock_ai
      @bhancock_ai  วันที่ผ่านมา

      It's in the first link in the description 😁

  • @ricardosnotes
    @ricardosnotes 4 วันที่ผ่านมา

    Excellent video, what software do you use to record your video tutorials?

  • @alexandraspalato
    @alexandraspalato 6 วันที่ผ่านมา

    Just what I need!

  • @ahmedhatem7566
    @ahmedhatem7566 วันที่ผ่านมา +1

    Is the code from the video a generic code that can serve as a multi-purpose tool for any king of scraping?? or is it just specific for the shown use case in this video??

    • @bhancock_ai
      @bhancock_ai  วันที่ผ่านมา +1

      You can easily adjust the code to work with different websites.
      If I was you, I'd throw the code into cursor and ask it to make tweaks based on the next website you want to scrape.

  • @ИванИванов-б8у4и
    @ИванИванов-б8у4и 5 วันที่ผ่านมา +4

    А зачем нейросеть для такого простого парсинга данных?Это делается питоном без проблем. Покажите как работать с файлами где есть скрытие формы, подгрузка с аяксом по клику и тд.

  • @RajeshSingh-hx2sc
    @RajeshSingh-hx2sc 2 วันที่ผ่านมา +2

    Why I really need to use deepseek here? Isnt it overkill? I mean the webpage is pretty much structured. One can still use python standard libraries to extract the same information right? No need to have powerful processing machine / high computation cost etc, right?

    • @ventricity
      @ventricity วันที่ผ่านมา

      it's a silly example because it's the 'perfect' site to scrape. too easy. in fact this site is used in other youtube web scraping clickbait videos. who hires a programmer for scraping wedding sites antway?

  • @yannyimo
    @yannyimo วันที่ผ่านมา +1

    Crawl4AI is great, I installed it, but I’ve never been able to get it fully working with the API and custom parameters, did you try with an api?

    • @bhancock_ai
      @bhancock_ai  วันที่ผ่านมา

      I've never tried the API either.
      I know a ton of developers also like browser base so I recommend trying that one too if you're not a fan of Crawl4AI

  • @VenkataSaripalli
    @VenkataSaripalli 3 วันที่ผ่านมา

    excellent presentation, but need to watch the tutorial and read understand the docs of GroqCloud, Crawl4AI, and phidata

  • @nagay189
    @nagay189 4 วันที่ผ่านมา +1

    Good tutorial. QQ:
    Can this not be done just by python selenium ? What are we gaining using Deepseek & Crawl4AI ?

  • @VaibhavShewale
    @VaibhavShewale 4 วันที่ผ่านมา +1

    looks interesting

  • @johndoeisyourfriend
    @johndoeisyourfriend 3 วันที่ผ่านมา

    I think it is useful for scraping the image for products in my webshop, right?

  • @stevencrobertson
    @stevencrobertson 3 วันที่ผ่านมา

    This looks great for websites that are directories, but what about a typical company website. Will this technique be effective at scraping all the pages of the website look for company data like locations, phone numbers, etc. and contact info for all partners of a law firm? Currently I use perplexity and n8n to do this?

  • @the-story-reel
    @the-story-reel 2 วันที่ผ่านมา +2

    i cant do anything in deepseek it always give me this error: The server is busy. Please try again later.

    • @bhancock_ai
      @bhancock_ai  2 วันที่ผ่านมา +3

      Try getting an openrouter key. That’s how I had to work around that issue

  • @Headownfocus
    @Headownfocus 5 วันที่ผ่านมา

    Good please recommended more

  • @necuspam
    @necuspam 2 วันที่ผ่านมา

    how it acts against "robots" file that prevents scraping at many websites?

  • @oakgnarl5021
    @oakgnarl5021 5 วันที่ผ่านมา +3

    Why just do things, when you can go off and do them?

  • @v.t.photography1590
    @v.t.photography1590 4 วันที่ผ่านมา

    Can you please advise if what you show in the video violates T&Cs of the websites that are being scraped? As far as I know any automated data scraping usually violates T&Cs. Websites retain the right to ban your IP, or take you to court.

  • @XubiVerse
    @XubiVerse 5 วันที่ผ่านมา

    Nice one!! I also tried to make it uncensored on my channel about the chinese conent

  • @alexnaz
    @alexnaz 4 วันที่ผ่านมา

    But y photobro no just go listing site himself? Y he need haz excel sheet ?
    Nicely structured tutorial btw thanks 🙏🏽

  • @KrishnaSingh-rd6pr
    @KrishnaSingh-rd6pr 3 วันที่ผ่านมา

    Can i use it to scrape twitter

  • @TauvicRitter
    @TauvicRitter 5 วันที่ผ่านมา +11

    It's weird to use AI and then it requires to specify a model. AI is intelligent. So why not just say: extract all relevant data and put that into a suitable format to the AI. If you have to specify the format then you could also take the extra steps and extract the data with some xpath statements. That will save the cost of using AI. I guess technology is not that mature yet.

    • @ziach923
      @ziach923 5 วันที่ผ่านมา

      Technology is well mature to scrape dynamically but I did not get why he specified css selectors 🥲

    • @dixalex02
      @dixalex02 5 วันที่ผ่านมา +1

      Schemas, xml, XPath, sitemaps. Rudimentary concepts can propel the industry. At times, the hype of AI drowns out the simple approaches. I agree with you.

    • @KenDores-oy9mc
      @KenDores-oy9mc 5 วันที่ผ่านมา +2

      To sum it up, true intelligence requires understanding of context. It's when Ai can understand context of instruction that it can do so. Truth is it is that smart, but needs u to systematically give it a prompt that can give it context to execute Nd give u the right result or answer

    • @dixalex02
      @dixalex02 5 วันที่ผ่านมา

      @@KenDores-oy9mc "systematically" 100% this. The capability and technology is already there. We (humans), just have to fine tune and harness that power.

    • @Advisory-of5lf
      @Advisory-of5lf 4 วันที่ผ่านมา

      BrowserUse does exactly this. It’s smart enough to understand the page by using vision AI capabilities and decide the next best action

  • @mnageh-bo1mm
    @mnageh-bo1mm 4 วันที่ผ่านมา +1

    is it me or LLM scraping feels like the most inefficient way to do such task?

  • @JoremzaBeauty
    @JoremzaBeauty 4 วันที่ผ่านมา

    Hi! can I scrape whatsapp contacts with status? I mean, I need to know the details of my tab of contacts in order to delete who are not add me as contact. Thank you.

  • @Cynthia-cw6kd
    @Cynthia-cw6kd 2 วันที่ผ่านมา +1

    How much do you charge for the scraping?😊

    • @pfxstar
      @pfxstar 2 วันที่ผ่านมา

      If you need help, please contact me.

  • @shqipko
    @shqipko 4 วันที่ผ่านมา

    can you scrape data from google?

  • @Careerpod
    @Careerpod 5 วันที่ผ่านมา

    Is it possible to scrape LinkedIn profile data

  • @ravi1341975
    @ravi1341975 5 วันที่ผ่านมา

    Wow amazing

  • @VikashKumar-nn9pu
    @VikashKumar-nn9pu วันที่ผ่านมา +1

    ok if i have to type the logic for the info- container extraction myself
    why we are using ai 😂
    or i am thinking ahead of time😅

    • @bhancock_ai
      @bhancock_ai  วันที่ผ่านมา

      No! Great question!
      When I started creating this tutorial, I was focused on running DeepSeek-r1 locally. However, my computer wasn't the strongest so it really struggled with long context window.
      I could have dropped this when I moved over from the local model to the groq model.

  • @rmt3589
    @rmt3589 2 วันที่ผ่านมา +1

    Only 1033 in, but this sounds so cool! My primary scraping goal is to scrape the lds website, and copy all talks from conferences and posts from ensign to three folders: "presidents", "quarum of the 12", and a default folder. I want to do this checking name & date, then checking a list of when they joined the quarem or became president, and sort them appropriately, where quarem includes the presidents, and the default includes them all. (Yes, up to 3 copies) so if Russell Nelson gave a talk when he was only in the 70, it'd go to default only, despite his current position.

  • @vladoesgrowth
    @vladoesgrowth 6 วันที่ผ่านมา

    LGO! Do you know if it works with proxies and authenticated users ?

    • @HarshKumar-j2r5g
      @HarshKumar-j2r5g 5 วันที่ผ่านมา +1

      let me know if u found that

    • @HarshKumar-j2r5g
      @HarshKumar-j2r5g 5 วันที่ผ่านมา +1

      : Enables dynamic proxy rotation to avoid IP bans and enhance security during web crawling. found this

  • @HenryPham92
    @HenryPham92 5 วันที่ผ่านมา +1

    I have an issue when running main.py that always shows 'Invalid API Key,' but I double-checked that my API KEY is correct.

    • @dolarmoro1339
      @dolarmoro1339 3 วันที่ผ่านมา

      That is an error maybe you miss something. Computer always right and it will show an error if something goes wrong. You can judge which are wrong maybe 1. men/you 2. methode 3. tools

    • @twelsh37
      @twelsh37 2 วันที่ผ่านมา

      I get the same. It works fine if I uxse my local ollamd model, the API key also works fine with a curl request. Im currently investigating this as not everyone is affected. What are you runingf on windows or Mac/Linux?

  • @abdullahwaheed601
    @abdullahwaheed601 5 วันที่ผ่านมา

    code repo??

  • @thomasschlitzer
    @thomasschlitzer วันที่ผ่านมา

    I can't see why I'd need an AI for that. This can be done easily with usual tools. And I guess it would even be 10 times the speed. Where is the benefit?

  • @Web.Scraping
    @Web.Scraping 3 วันที่ผ่านมา

    This seems complex to get the data

  • @ran2207
    @ran2207 4 วันที่ผ่านมา

    I use LM Studio instead of Groq

    • @alsaher309
      @alsaher309 วันที่ผ่านมา

      is it paid?

    • @ran2207
      @ran2207 วันที่ผ่านมา +1

      @ it’s free?

    • @alsaher309
      @alsaher309 วันที่ผ่านมา

      @ thanks buddy

  • @foxasdf888
    @foxasdf888 5 วันที่ผ่านมา

    22:06 Why are the prices just a $ sign?

    • @jet3432
      @jet3432 4 วันที่ผ่านมา

      it literally has dollar signs on the listings. you can probably have it go into each one and grab the Starting Price and anything else you specify instead.

  • @sinayagubi8805
    @sinayagubi8805 4 วันที่ผ่านมา +1

    Wow! I look forward to another useful and easy to follow AI project

  • @PabloSubiabre-g4y
    @PabloSubiabre-g4y 5 วันที่ผ่านมา

    Excellent video. I would appreciate it if you could activate the new TH-cam option for automatic AI audio translation in your videos so that I can listen to your videos in Spanish. Thank you.

  • @jnej89
    @jnej89 5 วันที่ผ่านมา +1

    That early return statement is funny 🤣

  • @snehitvaddi
    @snehitvaddi 5 วันที่ผ่านมา

    Can Crawl4ai scrape Reddit?

  • @Kal-el23
    @Kal-el23 4 วันที่ผ่านมา

    This is a nice example, but I guess you don’t really know how photographers and venues operate. Most wedding venues don’t actually have a house hotographer, instead the person getting married hires a photographer, who then just goes to the wedding venue of choice. So I don’t know how valuable it is in your example to give a photographer a list of wedding venue leads.

    • @rayzorr
      @rayzorr 4 วันที่ผ่านมา

      Just a demo, your use case may be different.

  • @AlainProcs
    @AlainProcs 4 วันที่ผ่านมา

    You gained a new sub and I also joined your Skool community. Amazing content! Thank you.

  • @JNET_Reloaded
    @JNET_Reloaded 5 วันที่ผ่านมา

    :/ I cant save your video to a playlist turn off option made for children / kids or w.e this prevents playlisting thats all it does! ty

    • @bhancock_ai
      @bhancock_ai  5 วันที่ผ่านมา

      Hey! All of my videos are set to “not made for children” so I’m not sure why it’s giving you that issue 🤔

  • @jasonowens4510
    @jasonowens4510 4 วันที่ผ่านมา

    Wish this could scrape LinkedIn!

  • @SiebrechtDigital
    @SiebrechtDigital 4 วันที่ผ่านมา +1

    all this and no phone number and email????? like bro...

  • @AutoKeybo
    @AutoKeybo 5 วันที่ผ่านมา

    AutoKeybo runs DeepSeek.

  • @adamgdev
    @adamgdev 5 วันที่ผ่านมา

    Great video per usual!

  • @orlandoagostinho
    @orlandoagostinho 5 วันที่ผ่านมา

    Amazing! You are a rock star!

  • @jjolla6391
    @jjolla6391 4 วันที่ผ่านมา +1

    Groq is not free

    • @wahibonae
      @wahibonae 4 วันที่ผ่านมา

      it is actually free for a limited usage!

  • @subversionz4919
    @subversionz4919 2 วันที่ผ่านมา +1

    Instead of CSV you said SCV... Too much starcraft! lol

    • @bhancock_ai
      @bhancock_ai  2 วันที่ผ่านมา +1

      😂😂😂 I need to stop watching Clem dominate everyone else in SC2! Lol

  • @rogerdfranko7457
    @rogerdfranko7457 4 วันที่ผ่านมา

    Overkill. Ask AI to build a custom system leveraging free tools and plug the code into a VM. You'd be surprised.

  • @CitizenEa
    @CitizenEa 2 วันที่ผ่านมา

    Sorry but you talk too much. You should cut to the chase and start coding immediately. I don't need the preliminary details. I will get it as you code

  • @harimgarcialamont9140
    @harimgarcialamont9140 5 ชั่วโมงที่ผ่านมา

    Si

  • @avataros111
    @avataros111 3 วันที่ผ่านมา

    I just came to unsubscribe. I do this for all finger pointing videos. Follow the follower!

  • @petec737
    @petec737 5 วันที่ผ่านมา +1

    there are already a million scrapping tools outthere that don't need a dozen interconnected services...like seriously

  • @StealthyLlamaBites
    @StealthyLlamaBites วันที่ผ่านมา +1

    enjoy your 20 years in jail

  • @divyv20
    @divyv20 5 วันที่ผ่านมา

    Hey aiwithbrandon , really nice video! I was wondering if I could help you with more Quality Editing in your videos with good pricing & turnaround time and will also make a highly engaging Thumbnail which will help your video to reach a wider audience ! Lmk what you think ?

  • @SjarMenace
    @SjarMenace 5 วันที่ผ่านมา

    Youre handsome bro i think im in love