Introduction to Scrapy API | Practical Python Web Scraping Tutorial (Part 1 of 2)

แชร์
ฝัง
  • เผยแพร่เมื่อ 2 ธ.ค. 2024

ความคิดเห็น • 36

  • @kevinz1991
    @kevinz1991 4 ปีที่แล้ว +2

    Really, really impressive how simple and clear this introduction to Scrapy is. Absolutely phenomenal thank you so much. So many gold nuggets in there too (e.g. always go to page 2 with pagination to see the full url). Thanks a lot for the time you put into this. Definitely subscribing to the channel blindly just because this video was so damn good.

    • @SATSifaction
      @SATSifaction  4 ปีที่แล้ว

      Than you Kevin

    • @kevinz1991
      @kevinz1991 4 ปีที่แล้ว

      @@SATSifaction is there any place on the channel that might address how you would scrape (e.g. this exact same tutorial demo book website if hypothetically it made you send a username/password first?)...? Googling around a bit it seems like I need to send the keys using Selenium in the parse() method that is generated by Scrapy when you create the spider. Just wondering if you could point me in the right direction possibly. Can't wait to see what other videos you come out with!

    • @SATSifaction
      @SATSifaction  4 ปีที่แล้ว +1

      Hi Kevin check out this. It’s similar to what you’re looking for I believe - th-cam.com/video/HsA0mJ4kNKE/w-d-xo.html

    • @kevinz1991
      @kevinz1991 4 ปีที่แล้ว +1

      @@SATSifaction thank you so much!!

  • @OptionGal
    @OptionGal 5 ปีที่แล้ว +2

    Yes, please! Would like to see also a tutorial on more advanced lesson on web scraping, beyond the basics ;-) Thanks!

  • @rhkina
    @rhkina 4 ปีที่แล้ว +1

    Great tutorial!!! Thank you!

  • @current1710
    @current1710 4 ปีที่แล้ว +1

    Amazing and simple tutorial. It was useful for me. Thank you so much!

  • @leventbozkurt9796
    @leventbozkurt9796 3 ปีที่แล้ว

    Perfect. Thank you.

  • @lucindaamicis6606
    @lucindaamicis6606 2 ปีที่แล้ว

    Thank you very much for this tutorial, I am really new to web scraping. When I tried to implement your code I got an error pseudo class attr() is unknown, has scrapy changed? Thank you again.

  • @emreilksu5344
    @emreilksu5344 4 ปีที่แล้ว

    Thanks for the simple tutorial. Would it be possible to handle a dynamic web-site using with splash together?

  • @Aprild174
    @Aprild174 5 ปีที่แล้ว

    Thank you for making this video. I'd like to see more on scraping.

    • @SATSifaction
      @SATSifaction  5 ปีที่แล้ว

      Stay tuned I always change things up

  • @subrinalazad3215
    @subrinalazad3215 4 ปีที่แล้ว

    Thank you for the video

  • @CurtlyTalks
    @CurtlyTalks 4 ปีที่แล้ว

    Hey, lovely tutorial. Could anyone point me to the packages you're using in sublime to write this, and the syntax/style settings ?

  • @xaviruiz8345
    @xaviruiz8345 5 ปีที่แล้ว

    Great tutorial! I'd like to see more videos about Scrapy, pls! Specially when the web target has no pages with number. Thanks!

    • @SATSifaction
      @SATSifaction  5 ปีที่แล้ว +1

      +Xavi Ruiz thank. Stay tuned I’m always adding new stuff

  • @SuperPaco0o
    @SuperPaco0o 4 ปีที่แล้ว +1

    Great tutorial. Thank you! Can you extent this tutorial with more advance features of scrapy? :)

  • @hayathbasha4519
    @hayathbasha4519 3 ปีที่แล้ว

    Hi,
    Please advice me on how to improve / speed up the scrapy process

    • @SATSifaction
      @SATSifaction  3 ปีที่แล้ว

      What specifically is bottleneck

    • @hayathbasha4519
      @hayathbasha4519 3 ปีที่แล้ว

      I am having a spider which runs upto 6 hr to scrape the datas
      So looking for options in scrapy so that atleast I can reduce half the time

  • @kazakhstanitprojects7657
    @kazakhstanitprojects7657 3 ปีที่แล้ว

    show work with user-agent proxi and scrapyd

  • @אריאלליפנהולץ
    @אריאלליפנהולץ 4 ปีที่แล้ว

    i'm a little bit confused because i'm trying to get info but the info is sitting inside a div in a div in a div in a tag so i don't understand how to get this

    • @antonsavitskiy7124
      @antonsavitskiy7124 4 ปีที่แล้ว

      check xpath method

    • @SATSifaction
      @SATSifaction  4 ปีที่แล้ว

      I’m general you should be able to search by div id in order to avoid multiple divs in divs

  • @iKrufkaPsotka
    @iKrufkaPsotka 4 ปีที่แล้ว +1

    liked as soon as you told me too. Look no thumbs down XD

  • @danishmallick920
    @danishmallick920 4 ปีที่แล้ว

    why am i getting duplicate data while scraping, I followed all the steps as shown here in the video

    • @SATSifaction
      @SATSifaction  4 ปีที่แล้ว

      duplicates are sometimes a result of an improperly formatted loop. Make sure you check indenting

    • @danishmallick920
      @danishmallick920 4 ปีที่แล้ว

      @@SATSifaction hey thanks i was able to fix it. Can you also make tutorial in how to scrape different kinds of website e.g static/dynamic? and is scrapy enough to scrape information from all kinds of website?

  • @josemadarieta865
    @josemadarieta865 4 ปีที่แล้ว

    gwd that background music was annoying af. sooooo imappropriate for a tutorial vid