ไม่สามารถเล่นวิดีโอนี้
ขออภัยในความไม่สะดวก

NLP Tutorial in Python - Spam Classification

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 ก.พ. 2022
  • In this video we implement an email spam classifier using NLTK (natural language processing toolkit) in Python. We use the bag of words (BOW) approach to building the model, after performing tokenizing, lemmatization / stemming, and removing stop words. You'll learn a ton about NLP in just 20 minutes!
    Link to the Colab notebook: colab.research...
    Thank you for watching the video! You can learn data science FASTER at mlnow.ai!
    Master Python at mlnow.ai/cours...!
    Learn SQL & Relational Databases at mlnow.ai/cours...!
    Learn NumPy, Pandas, and Python for Data Science at mlnow.ai/cours...!
    Become a Machine Learning Expert at mlnow.ai/cours...!
    Don't forget to subscribe if you enjoyed the video :D

ความคิดเห็น • 32

  • @GregHogg
    @GregHogg  ปีที่แล้ว

    Take my courses at mlnow.ai/!

  • @duchahapsari7081
    @duchahapsari7081 2 ปีที่แล้ว +2

    Wow! This is an amazing tutorial. Learn so much about fundamental NLP, while keeping it so practical to follow. Looking for more crisp contents from you, Greg!

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      Really glad to hear that Ducha!

  • @ashathotan
    @ashathotan 10 หลายเดือนก่อน

    I enjoyed watching your illustration on the email spam.

  • @DarkTobias7
    @DarkTobias7 2 ปีที่แล้ว +2

    Amazing video, please do more NLP projects

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว +1

      Thanks so much, and will do!

  • @Brocollipy
    @Brocollipy 2 ปีที่แล้ว

    Great video , just what I needed. Need to test out some models quickly and really can't do another 3hr course!!

  • @arsheyajain7055
    @arsheyajain7055 2 ปีที่แล้ว +1

    This is awesome 👏

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      I thought so too!

  • @gustavojuantorena
    @gustavojuantorena 2 ปีที่แล้ว +1

    Great video!

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      Thanks so much Gustavo!

  • @akshitadixit_1068
    @akshitadixit_1068 ปีที่แล้ว +1

    Thanks Greg for this amazing video, however i have a doubt, how are we determining that the tokens with the maximum frequency are the ones contributing to the spamy nature of the message, their is a possibility that tokens with a low frequency are equally malicious.
    Thanks!

    • @GregHogg
      @GregHogg  ปีที่แล้ว

      Yes that's very true, you could absolutely change how I did things

  • @jspetrolina
    @jspetrolina ปีที่แล้ว

    Hi Greg! Old but gold, fabulous video, let me ask you, is that structure that you build what packages like spacy is doing behind the scenes? Thanks again

    • @GregHogg
      @GregHogg  ปีที่แล้ว

      Thank you :) and what structure sorry?

  • @mikekertser5384
    @mikekertser5384 2 ปีที่แล้ว +2

    Thank you! Can you please make a video with the review of the word embeddings models and corresponding transfer learning examples?
    And some nlp feature engineering as well. :)

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว +1

      You're very welcome - and this is in the works :)

  • @iqrarkhan8129
    @iqrarkhan8129 2 ปีที่แล้ว

    thanks that was quite helpful bus can you also please do a malware detection and classification using machine learning algo? if yes please do and upload as soon as possible

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      You're very welcome! Probably eventually, but won't be able to do that for awhile sorry.

  • @InfernalPasquale
    @InfernalPasquale ปีที่แล้ว

    12:00 Why is features = set() converted to a list, rather than just being a list to begin with?

    • @rahulnayak8866
      @rahulnayak8866 11 หลายเดือนก่อน

      using a list instead of set will result in getting redundant values, so in order to get unique values set() is used which is then converted into a list.

  • @e_hossam96
    @e_hossam96 2 ปีที่แล้ว

    This is great

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      Thanks so much Hossam!! :)

  • @writabratadey8048
    @writabratadey8048 ปีที่แล้ว

    nltk.download() is not working. showing WinError 10060 everytime ie. connection attempt failed...plz provide a solution

    • @GregHogg
      @GregHogg  ปีที่แล้ว

      Probably slow internet unfortunately

  • @MyStockz
    @MyStockz 2 ปีที่แล้ว

    Hi Greg Hogg! Hope all is well! Have you tried or heard of a website called logikbot? If yes, what do you think of it?

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      Never heard :)

    • @MyStockz
      @MyStockz 2 ปีที่แล้ว

      @@GregHogg Cool!

  • @tareq8109
    @tareq8109 2 ปีที่แล้ว

    Want NLP series

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว +1

      Working on it!

  • @dannyrodin1151
    @dannyrodin1151 2 ปีที่แล้ว

    I enjoy your videos, but this one's way too fast. I'll need to watch it 3 times with 0.5 speed.

    • @GregHogg
      @GregHogg  2 ปีที่แล้ว

      Hmm, I appreciate the feedback here.