NLP Demystified 4: Advanced Preprocessing (part-of-speech tagging, entity tagging, parsing)

NLP Demystified 5: Basic Bag-of-Words and Measuring Document Similarity

Stemming and Lemmatization: NLP Tutorial For Beginners - S1 E10

#快成长计划 #年轻影画创作之星办法总比困难多，还是儿子有办法。#乡村幽默#家庭趣事#搞笑创作#农村风情#幽默生活

From Small To Giant 0%🍫 VS 100%🍫 #katebrush #shorts #gummy

🔴Live : เกาะติดนับคะแนนเลือกตั้งนายก อบจ.อุดรธานี "เพื่อไทย VS ประชาชน" : Matichon TV

NLP Demystified 3: Basic Preprocessing (case-folding, stop words, stemming, lemmatization)

Future Mojo

มุมมอง 12 143

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 27 พ.ย. 2024

ความคิดเห็น •

@futuremojo 2 ปีที่แล้ว ⁺³
Timestamps:
00:00:00 Basic Preprocessing
00:00:35 Case-folding and its tradeoffs
00:02:40 Stop word removal (tradeoffs and how it can go wrong)
00:04:40 Stemming (tradeoffs and things to watch out for)
00:06:28 Lemmatization and its advantages over stemming
00:07:52 DEMO: basic processing with spaCy
00:10:37 Basic preprocessing recap
@khalidnaveed1077 3 หลายเดือนก่อน
Great concise intro, I see you getting big in the future. Keep up with the work.
@nisargpatel1443 5 หลายเดือนก่อน
Concise and easily understandable. Thanks a lot for the series.
@YashodPerera-b9j ปีที่แล้ว
This is the best NLP series I have ever watched
@YashodPerera-b9j ปีที่แล้ว ⁺¹
This content is simple and easy to understand.
@somerset006 ปีที่แล้ว ⁺¹
Well done, thanks!
@rishidixit7939 หลายเดือนก่อน
I have a bunch of reviews(about 20 million) on places like restaurants, cafes, pet groomers, cleaners and other services.
Now I have to categorize them into these service categories like food, pet grooming, cleaning etc. A heavy model like BERT is taking up a lot of time and resources.
The data in not labelled for the service so I was thinking about doing a clustering and doing food or no food as the only classes. Kind of like Aspect Based Classification
@rishidixit7939 หลายเดือนก่อน
I also had to ask one more question that if I have so many product reviews(around 20 million) how will I analyze and clean my data. In some places the punctuations are wrong, some have too many spaces etc. It is not possible to see all the errors in the reviews.
In that case how to preprocess the data.

ต่อไป

เล่นอัตโนมัติ

NLP Demystified 4: Advanced Preprocessing (part-of-speech tagging, entity tagging, parsing)

NLP Demystified 4: Advanced Preprocessing (part-of-speech tagging, entity tagging, parsing)

NLP Demystified 5: Basic Bag-of-Words and Measuring Document Similarity

NLP Demystified 5: Basic Bag-of-Words and Measuring Document Similarity

Stemming and Lemmatization: NLP Tutorial For Beginners - S1 E10

Stemming and Lemmatization: NLP Tutorial For Beginners - S1 E10

#快成长计划 #年轻影画创作之星办法总比困难多，还是儿子有办法。#乡村幽默#家庭趣事#搞笑创作#农村风情#幽默生活

#快成长计划 #年轻影画创作之星办法总比困难多，还是儿子有办法。#乡村幽默#家庭趣事#搞笑创作#农村风情#幽默生活

From Small To Giant 0%🍫 VS 100%🍫 #katebrush #shorts #gummy

From Small To Giant 0%🍫 VS 100%🍫 #katebrush #shorts #gummy

🔴Live : เกาะติดนับคะแนนเลือกตั้งนายก อบจ.อุดรธานี "เพื่อไทย VS ประชาชน" : Matichon TV

🔴Live : เกาะติดนับคะแนนเลือกตั้งนายก อบจ.อุดรธานี "เพื่อไทย VS ประชาชน" : Matichon TV

ร้องเพลงสั่งข้าว Ver.โอ้เธอช่าง... - บี้เดอะสกา | Feat @jamsaijs @ramer.official #ร้องเพลงสั่งข้าว

ร้องเพลงสั่งข้าว Ver.โอ้เธอช่าง... - บี้เดอะสกา | Feat @jamsaijs @ramer.official #ร้องเพลงสั่งข้าว

NLP Demystified 8: Text Classification With Naive Bayes (+ precision and recall)

NLP Demystified 8: Text Classification With Naive Bayes (+ precision and recall)

NLP Demystified 15: Transformers From Scratch + Pre-training and Transfer Learning With BERT/GPT

NLP Demystified 15: Transformers From Scratch + Pre-training and Transfer Learning With BERT/GPT

NLP Demystified 12: Capturing Word Meaning with Embeddings

NLP Demystified 12: Capturing Word Meaning with Embeddings

NLP Lecture 2(c) - Text Normalization

NLP Lecture 2(c) - Text Normalization

NLP Demystified 2: Text Tokenization

NLP Demystified 2: Text Tokenization

Complete Natural Language Processing (NLP) Tutorial in Python! (with examples)

Complete Natural Language Processing (NLP) Tutorial in Python! (with examples)

What is Bag of Words?

What is Bag of Words?

NLP Demystified 10: Neural Networks From Scratch

NLP Demystified 10: Neural Networks From Scratch

NLP Demystified 9: Automatically Finding Topics in Documents with Latent Dirichlet Allocation

NLP Demystified 9: Automatically Finding Topics in Documents with Latent Dirichlet Allocation

ทายอายุพี่ไฮซ์แลก1000บัค#wkc #shorts #แจกโรบัค #funny #คริปตลกๆ

ทายอายุพี่ไฮซ์แลก1000บัค#wkc #shorts #แจกโรบัค #funny #คริปตลกๆ

หาทำ EP.54 : ลาบปลาทับทิมทอดครั้งแรก ของ "เจ๊มิ่ง" | จือปาก

หาทำ EP.54 : ลาบปลาทับทิมทอดครั้งแรก ของ "เจ๊มิ่ง" | จือปาก

แครี่คุณภาพแห่งวงการ ROV

แครี่คุณภาพแห่งวงการ ROV

Mix the spurious with the genuine #joker #cosplay#Harriet Quinn

Mix the spurious with the genuine #joker #cosplay#Harriet Quinn

🔴Live โหนกระแส แอมจ๋าพัชลาก่อน โบกมือลาเสียงเพลงครวญมาต้องลาแล้วเพื่อน

🔴Live โหนกระแส แอมจ๋าพัชลาก่อน โบกมือลาเสียงเพลงครวญมาต้องลาแล้วเพื่อน

한 팀 같은 찰떡 호흡ಇ 파리타(PHARITA)·이젤(EJel)이 들려주는 하모니 'Issues'♬ ｜비긴어게인 오픈마이크

한 팀 같은 찰떡 호흡ಇ 파리타(PHARITA)·이젤(EJel)이 들려주는 하모니 'Issues'♬ ｜비긴어게인 오픈마이크

Smart Sigma Kid #funny #sigma

Smart Sigma Kid #funny #sigma

จะไปแล้วหรอ🥺 #fisch #แป๋มวัดดวง #shorts

จะไปแล้วหรอ🥺 #fisch #แป๋มวัดดวง #shorts