Selenium Web Scraping is too Slow. Try This.

แชร์
ฝัง
  • เผยแพร่เมื่อ 27 ธ.ค. 2024

ความคิดเห็น • 42

  • @parisneto
    @parisneto หลายเดือนก่อน +9

    I have been scraping for 6 months and since TH-cam help me find you ( I was not searching ! ) I learned so much and evolved my work in the past 2 weeks that seems like years of experience shared. Thanks for the generosity and dedication.

    • @ibekwevictor1158
      @ibekwevictor1158 หลายเดือนก่อน

      What kind of jobs can you get as a scraper sir?

  • @quintinsweeney2927
    @quintinsweeney2927 8 วันที่ผ่านมา

    I am currently working on a similar project and this tutorial has helped me so much! Quick question, if I am interested in gathering the data from a grid similarly to this, is it necessary to open all the links to the items? I want to scrape the price, item name, category etc. and that can be found directly on the grid. Would the downside be that you won't have access to the data in JSON format?

  • @DanKayser
    @DanKayser หลายเดือนก่อน +5

    Cool! I've always scraped data on a single driver and yeah the process turns slow quickly... This is awesome, but I'm not very familiarized with async/await going to do my research!
    Thanks!

  • @admaxcool
    @admaxcool หลายเดือนก่อน

    is there a keyword utility library on top of selenium like SeleniumBase but without the recorder or demo modes (a slimmed out version with handy utilities)

  • @vladgri7754
    @vladgri7754 หลายเดือนก่อน +2

    You always bring fresh ideas!

  • @GouravMittal-fi4vg
    @GouravMittal-fi4vg 5 วันที่ผ่านมา

    Hey
    Thanks for the video and guidance.
    Can we do with the same dynamic website for live sports data which is updating every second

  • @ebukaume
    @ebukaume หลายเดือนก่อน

    Nice approach! What do you think about using a Semaphore instead of a temporal rate limiter?

  • @switch8291
    @switch8291 หลายเดือนก่อน +1

    Hey john
    We've seen how you use a list of selectors in json file to scrape multiple website
    Is there a library to auto get selectors or this part is manual on each website, or instead is there a way to automate it using json schemas of each website
    Stay Golden!

    • @voinywolnyprod3046
      @voinywolnyprod3046 หลายเดือนก่อน

      You can automate it yourself collecting selectors individually (in set for example) for each website because each website will have different selectors

  • @stephena8965
    @stephena8965 หลายเดือนก่อน

    Great tutorial as always! How do you add headers to the request? The docs are incomplete and I've never used selenium so it's a little confusing with network interceptions. The reason I'm asking is because the page isn't loading fully, it always timesout after loading the navbar, and I think it's because I need some headers or maybe it's because I'm not using any proxies. Thanks in advance! I'll post my solution if I find one

  • @viratchoudhary6827
    @viratchoudhary6827 หลายเดือนก่อน

    hi john, My question is whether using a self-hosted proxy with multiple ports necessitates a Proxyscrape subscription. eg localhost:2001, *:2002, *:2003

  • @adhamkhaled8687
    @adhamkhaled8687 หลายเดือนก่อน +1

    What is better to use seleniumbase or driverless ?

    • @JohnWatsonRooney
      @JohnWatsonRooney  หลายเดือนก่อน +1

      Both seem to work well but I haven’t used either enough to say, right now I am using selenium driverless more

  • @hamed6899
    @hamed6899 หลายเดือนก่อน +1

    Great video, but please make a video about how to find hidden api

  • @StanHordon
    @StanHordon หลายเดือนก่อน

    amazing for beginer in scrapping your video is life saver.thank you

  • @adhishtanaka
    @adhishtanaka หลายเดือนก่อน

    what linux you use?can you make a video about webscrapper pc setup that explain os and tools,ide that you use day to day life in current moment

    • @wiresploit
      @wiresploit หลายเดือนก่อน +1

      He uses Fedora Linux with i3wm (tiling window manager)

  • @attifbhuttoa3384
    @attifbhuttoa3384 หลายเดือนก่อน +1

    I love your video's. They are amazing 💪

  • @MyKnighty
    @MyKnighty หลายเดือนก่อน +1

    Hey John .. Can i run in headless mode

  • @lunaticberserker5869
    @lunaticberserker5869 หลายเดือนก่อน +1

    NoDriver lacks documenation and methods that undetected-chromedriver used to have. but it's more faster than its predecessor.

    • @bill8126
      @bill8126 หลายเดือนก่อน

      its buggy

  • @towhidurrahman8961
    @towhidurrahman8961 หลายเดือนก่อน +1

    Make a video on selenium grid.

  • @atulraaazzz2931
    @atulraaazzz2931 หลายเดือนก่อน

    Can you share the base of code

  • @Volkskomissar
    @Volkskomissar หลายเดือนก่อน +7

    wow never seen a 22 Minutes AD/ Commercial dsiguised as a tutorial,
    + before

    • @truthwillout1980
      @truthwillout1980 หลายเดือนก่อน

      You haven't? That's every video.

    • @kareemyoussef2304
      @kareemyoussef2304 หลายเดือนก่อน +2

      isnt this entire channel now? everytime I see his title saying "something is too slow/detectable try this instead" i know its a 20 min waste of time

    • @genericname1296
      @genericname1296 หลายเดือนก่อน

      I don't know what you are so upset by here. He clearly states in the description who he is affiliated with and spends the entire video teaching you how to do what his title says. No where does he say only one proxy service can do this.

  • @zaidyousaf4449
    @zaidyousaf4449 หลายเดือนก่อน

    make a video on some online tools for scraping like phantom buster. Like how do they do it, because they also go for platforms like linkedin where login is needed plus the js rendering is very much involved. How do they it from their cloud services. i want to know the technique so that we people can also replicate such things at least of 10 % of theirs instead of just using selenium, scrapy or puppeteer.

  • @saadkhan883
    @saadkhan883 หลายเดือนก่อน

    Can you share a code with us ?

  • @Aidas_Li
    @Aidas_Li หลายเดือนก่อน +1

    Nice video.

  • @bakasenpaidesu
    @bakasenpaidesu หลายเดือนก่อน +2

    .

  • @bluekeybo
    @bluekeybo หลายเดือนก่อน +3

    It's very distracting to watch so many typos as you type, and deleting/correcting them. Not sure what's a way to fix that? Maybe showing the code chunks already typed and explaining them instead of typing? Thank you!

    • @JohnWatsonRooney
      @JohnWatsonRooney  หลายเดือนก่อน +3

      I understand and that’s come up before, I can copy/paste chunks which I have done in the recent past but I walked to show my working how I got there etc. I should just practice typing more…

    • @ScalpersInc
      @ScalpersInc หลายเดือนก่อน +6

      @@JohnWatsonRooneyno ur good brother

    • @profbiyi
      @profbiyi หลายเดือนก่อน

      ​@@JohnWatsonRooneyyou are doing an excellent job. Thanks so much

    • @cybern9ne
      @cybern9ne หลายเดือนก่อน

      then don't watch the channel. you just want to copy and not learn.

    • @carlos-ferreira
      @carlos-ferreira หลายเดือนก่อน +1

      ​@JohnWatsonRooney I disagree with the person above. I actually like that you're coding while recording and showing how you actually work.