Should I have used this Web Scraping Technique?

Is this how pro's scrape HUGE amounts of data?

Obsidian + Cursor = Magical AI Knowledge Management

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

Selenium Web Scraping is too Slow. Try This.

John Watson Rooney

มุมมอง 11 755

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 27 ธ.ค. 2024

ความคิดเห็น • 42

@parisneto หลายเดือนก่อน ⁺⁹
I have been scraping for 6 months and since TH-cam help me find you ( I was not searching ! ) I learned so much and evolved my work in the past 2 weeks that seems like years of experience shared. Thanks for the generosity and dedication.
@ibekwevictor1158 หลายเดือนก่อน
What kind of jobs can you get as a scraper sir?
@quintinsweeney2927 8 วันที่ผ่านมา
I am currently working on a similar project and this tutorial has helped me so much! Quick question, if I am interested in gathering the data from a grid similarly to this, is it necessary to open all the links to the items? I want to scrape the price, item name, category etc. and that can be found directly on the grid. Would the downside be that you won't have access to the data in JSON format?
@DanKayser หลายเดือนก่อน ⁺⁵
Cool! I've always scraped data on a single driver and yeah the process turns slow quickly... This is awesome, but I'm not very familiarized with async/await going to do my research!
Thanks!
@admaxcool หลายเดือนก่อน
is there a keyword utility library on top of selenium like SeleniumBase but without the recorder or demo modes (a slimmed out version with handy utilities)
@vladgri7754 หลายเดือนก่อน ⁺²
You always bring fresh ideas!
@GouravMittal-fi4vg 5 วันที่ผ่านมา
Hey
Thanks for the video and guidance.
Can we do with the same dynamic website for live sports data which is updating every second
@ebukaume หลายเดือนก่อน
Nice approach! What do you think about using a Semaphore instead of a temporal rate limiter?
@switch8291 หลายเดือนก่อน ⁺¹
Hey john
We've seen how you use a list of selectors in json file to scrape multiple website
Is there a library to auto get selectors or this part is manual on each website, or instead is there a way to automate it using json schemas of each website
Stay Golden!
@voinywolnyprod3046 หลายเดือนก่อน
You can automate it yourself collecting selectors individually (in set for example) for each website because each website will have different selectors
@stephena8965 หลายเดือนก่อน
Great tutorial as always! How do you add headers to the request? The docs are incomplete and I've never used selenium so it's a little confusing with network interceptions. The reason I'm asking is because the page isn't loading fully, it always timesout after loading the navbar, and I think it's because I need some headers or maybe it's because I'm not using any proxies. Thanks in advance! I'll post my solution if I find one
@viratchoudhary6827 หลายเดือนก่อน
hi john, My question is whether using a self-hosted proxy with multiple ports necessitates a Proxyscrape subscription. eg localhost:2001, *:2002, *:2003
@adhamkhaled8687 หลายเดือนก่อน ⁺¹
What is better to use seleniumbase or driverless ?
@JohnWatsonRooney หลายเดือนก่อน ⁺¹
Both seem to work well but I haven’t used either enough to say, right now I am using selenium driverless more
@hamed6899 หลายเดือนก่อน ⁺¹
Great video, but please make a video about how to find hidden api
@StanHordon หลายเดือนก่อน
amazing for beginer in scrapping your video is life saver.thank you
@adhishtanaka หลายเดือนก่อน
what linux you use?can you make a video about webscrapper pc setup that explain os and tools,ide that you use day to day life in current moment
@wiresploit หลายเดือนก่อน ⁺¹
He uses Fedora Linux with i3wm (tiling window manager)
@attifbhuttoa3384 หลายเดือนก่อน ⁺¹
I love your video's. They are amazing 💪
@MyKnighty หลายเดือนก่อน ⁺¹
Hey John .. Can i run in headless mode
@JohnWatsonRooney หลายเดือนก่อน
Yes absolutely
@lunaticberserker5869 หลายเดือนก่อน ⁺¹
NoDriver lacks documenation and methods that undetected-chromedriver used to have. but it's more faster than its predecessor.
@bill8126 หลายเดือนก่อน
its buggy
@towhidurrahman8961 หลายเดือนก่อน ⁺¹
Make a video on selenium grid.
@atulraaazzz2931 หลายเดือนก่อน
Can you share the base of code
@Volkskomissar หลายเดือนก่อน ⁺⁷
wow never seen a 22 Minutes AD/ Commercial dsiguised as a tutorial,
+ before
@truthwillout1980 หลายเดือนก่อน
You haven't? That's every video.
@kareemyoussef2304 หลายเดือนก่อน ⁺²
isnt this entire channel now? everytime I see his title saying "something is too slow/detectable try this instead" i know its a 20 min waste of time
@genericname1296 หลายเดือนก่อน
I don't know what you are so upset by here. He clearly states in the description who he is affiliated with and spends the entire video teaching you how to do what his title says. No where does he say only one proxy service can do this.
@zaidyousaf4449 หลายเดือนก่อน
make a video on some online tools for scraping like phantom buster. Like how do they do it, because they also go for platforms like linkedin where login is needed plus the js rendering is very much involved. How do they it from their cloud services. i want to know the technique so that we people can also replicate such things at least of 10 % of theirs instead of just using selenium, scrapy or puppeteer.
@saadkhan883 หลายเดือนก่อน
Can you share a code with us ?
@Aidas_Li หลายเดือนก่อน ⁺¹
Nice video.
@bakasenpaidesu หลายเดือนก่อน ⁺²
.
@bluekeybo หลายเดือนก่อน ⁺³
It's very distracting to watch so many typos as you type, and deleting/correcting them. Not sure what's a way to fix that? Maybe showing the code chunks already typed and explaining them instead of typing? Thank you!
@JohnWatsonRooney หลายเดือนก่อน ⁺³
I understand and that’s come up before, I can copy/paste chunks which I have done in the recent past but I walked to show my working how I got there etc. I should just practice typing more…
@ScalpersInc หลายเดือนก่อน ⁺⁶
@@JohnWatsonRooneyno ur good brother
@profbiyi หลายเดือนก่อน
@@JohnWatsonRooneyyou are doing an excellent job. Thanks so much
@cybern9ne หลายเดือนก่อน
then don't watch the channel. you just want to copy and not learn.
@carlos-ferreira หลายเดือนก่อน ⁺¹
@JohnWatsonRooney I disagree with the person above. I actually like that you're coding while recording and showing how you actually work.

ต่อไป

เล่นอัตโนมัติ

Should I have used this Web Scraping Technique?

Should I have used this Web Scraping Technique?

Is this how pro's scrape HUGE amounts of data?

Is this how pro's scrape HUGE amounts of data?

Obsidian + Cursor = Magical AI Knowledge Management

Obsidian + Cursor = Magical AI Knowledge Management

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

Web Scraping + Reverse Engineering APIs

Web Scraping + Reverse Engineering APIs

How To Scrape Any Website in 9 Minutes (Seriously)

How To Scrape Any Website in 9 Minutes (Seriously)

Stop Using Selenium or Playwright for Web Scraping

Stop Using Selenium or Playwright for Web Scraping

The Biggest Issues I've Faced Web Scraping (and how to fix them)

The Biggest Issues I've Faced Web Scraping (and how to fix them)

This is How I Scrape 99% of Sites

This is How I Scrape 99% of Sites

Clone ANY Website with AI - 💥 V0 vs Replit Agent - Who's better? 💥

Clone ANY Website with AI - 💥 V0 vs Replit Agent - Who's better? 💥

Scrape more with this simple change

Scrape more with this simple change

This is how I scrape 99% websites via LLM

This is how I scrape 99% websites via LLM

Learning Scraping is MUCH harder now.

Learning Scraping is MUCH harder now.

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

บังอาจ ทาบบารมี ! ผ่าเบื้องลึก 1 วันก่อนสังหาร เดินเกมล้มตระกูล “วิลาวัลย์” #ถกไม่เถียง

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

"ทักษิณ" ยึดปราจีนฯ ลูกน้องโกทรแปรพักตร์| DAILYNEWSTODAY 17/12/67

"ทักษิณ" ยึดปราจีนฯ ลูกน้องโกทรแปรพักตร์| DAILYNEWSTODAY 17/12/67

หนีบ้านมากาดงัว

หนีบ้านมากาดงัว

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!