Text Preprocessing | NLP Course Lecture 3

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ต.ค. 2024

ความคิดเห็น • 155

  • @harinair3002
    @harinair3002 9 หลายเดือนก่อน +50

    Anyone following this playlist, my recommendation to them is to please do the assignment, I was shocked at how little we learn by just watching, I did the assignment and what can I say, I was stuck a lot of times and at the end, I completed and now I regularly do Text Preprocessing by making my datasets from Rapid APIs, It gives one soo much flexibility to work on a dataset they created.

    • @surajnikam3327
      @surajnikam3327 9 หลายเดือนก่อน

      Mam can you explain me or refer some notes or videos on using API's and Create own Dataframe

    • @komalkumbhare4789
      @komalkumbhare4789 8 หลายเดือนก่อน +1

      Hey Hari! The assignment links given above are not directing to the tmdb website, and if I search of TMDB directly on google, it doesn't work as well. Can you tell me how you did that?

    • @sampath4150
      @sampath4150 2 หลายเดือนก่อน +1

      hello have you saved that code ,its been removed i need it immediately

    • @venualli3917
      @venualli3917 2 หลายเดือนก่อน

      Would you please let me know resources for practice

    • @divyatagupta597
      @divyatagupta597 หลายเดือนก่อน

      @@surajnikam3327 it is already mentioned in ml playlist created by sir himself

  • @GamerBoy-ii4jc
    @GamerBoy-ii4jc 2 ปีที่แล้ว +8

    Again Sir your are a great person on you tube.. your explanation in every domain and for every topic is great...i followed you ML playlist A-Z and now i start watching NLP.. i hope you will complete your ML series soon and this too and also making great series for us with new and needed emerging thigs ...Thanks Alot Sir!

  • @usmanhaider5255
    @usmanhaider5255 2 หลายเดือนก่อน +2

    Session Was SO Good.
    Assignment Was SO SO SO SO Amazing To Do.
    Thank For Your Hard Work Sir.

  • @shikhasoni9346
    @shikhasoni9346 2 ปีที่แล้ว +6

    your lectures really help me to understand NLP Text Preprocessing , Thank you so much!

  • @sukantb1980
    @sukantb1980 2 ปีที่แล้ว +21

    You are a rare gem , I can simply put that in clear short words❤️❤️

    • @bhanu0925
      @bhanu0925 2 ปีที่แล้ว +3

      Exactly, rarest !!

  • @siddharthbhardwaj7664
    @siddharthbhardwaj7664 2 ปีที่แล้ว +4

    Hi, Could you please make the next video on the same IMDB data set and show us how to analyze the linguistic features of the training dataset? I have recently gone through your previous NLP (Movie Review Sentiment Analysis) videos. However, I was quite interested in finding out how can we analyze the linguistic features and what all different algorithms can we apply apart from the Naive Bayes on the same IMDB dataset. PS - your videos are amazing!!! the way you teach the concepts has helped me to understand the basics of NLP. Thank you so much!!

  • @prashantlakde
    @prashantlakde 2 ปีที่แล้ว +1

    Ur way of explaination shows ur concept clearity and ur efforts to prepare this topic...keep it up.

  • @sarithajaligama9548
    @sarithajaligama9548 6 หลายเดือนก่อน

    Very good explanation. your explaining every single details. it's very helpful for beginners. and assignements also very intresting.
    i feel like why im not found your channel before but lucky to have right now

  • @ayushsachdeva4635
    @ayushsachdeva4635 หลายเดือนก่อน +2

    56:58 can we use the spelling corrector with Stemming ?? we can get better efficiency with correct spellings and no mistake

  • @Riya-zb1iz
    @Riya-zb1iz ปีที่แล้ว +2

    This series is amazing!

  • @pralaymondal3324
    @pralaymondal3324 2 ปีที่แล้ว +4

    Thank you, you are just awesome. Much waited for this video. You explain things better than other youtubers. Keep it up...!!!

  • @sachi-4750
    @sachi-4750 2 ปีที่แล้ว +1

    You are really a great teacher, thank you so much for coming up with such informative videos, Thanks a lot

  • @mohaiminrahat4974
    @mohaiminrahat4974 2 ปีที่แล้ว +2

    Sir you are a lifesaver.Thankyouuuuuu

  • @satyamtiwari7680
    @satyamtiwari7680 ปีที่แล้ว +1

    Easy way to remove punctuations.
    import string
    import re
    def remove_punctuation(text):
    # Define the set of punctuation characters
    punctuations = string.punctuation
    # Remove punctuation using regular expressions
    text_no_punct = re.sub('[' + re.escape(punctuations) + ']', '', text)
    return text_no_punct

  • @pankajbeldar9799
    @pankajbeldar9799 ปีที่แล้ว +1

    You are God for me in learning data science

  • @rafibasha4145
    @rafibasha4145 2 ปีที่แล้ว +3

    please tag notbook in description,also please complete NLP playlist

  • @manishachaurasia3405
    @manishachaurasia3405 ปีที่แล้ว +1

    Series is amazing sir 👏 kindly provide the regex lecture in the description

  • @BTStechnicalchannel
    @BTStechnicalchannel 2 ปีที่แล้ว +2

    Thanks! for the great content!! One small suggestion can you also give us sometime to write code you are explaining otherwise it becomes theoritical.

  • @abhishek_iith
    @abhishek_iith ปีที่แล้ว +3

    Your videos are full of knowledge. Thanks a lot for this 🙏 you deserve more subscribers... it can attract more viewers if you divide your videos into smaller parts. People generally don't want to engage with long lectures.

  • @saurabhdeshmane8714
    @saurabhdeshmane8714 2 ปีที่แล้ว +5

    sir could you please share notebook, it is not available on given link

  • @samt5682
    @samt5682 2 ปีที่แล้ว

    Literally, All In One !

  • @Akashphs7217
    @Akashphs7217 4 หลายเดือนก่อน +1

    Hi Sir. Regarding the assignment, how can we meagre genre id and genre type with movies data-frame?
    I got stuck there.

  • @gautampatadiya6096
    @gautampatadiya6096 8 หลายเดือนก่อน +2

    Thanks!

  • @anupprasad695
    @anupprasad695 2 ปีที่แล้ว +1

    One suggestion: sir, ek udemy course banaiye.... Data science bootcamp...

  • @rajeevranjan5007
    @rajeevranjan5007 2 ปีที่แล้ว +1

    Nice assignment Sir. Thankyou

  • @miteshkumar7739
    @miteshkumar7739 2 ปีที่แล้ว

    Your lecture are really helpful...all consept are very clear

  • @shipradhiman08
    @shipradhiman08 2 ปีที่แล้ว +1

    Awesome lecture 🤗🤗🤗❤️❤️❤️❤️

  • @mridang2064
    @mridang2064 2 ปีที่แล้ว +3

    Dhanyavaad. Can you also start a series on web development ?
    You're just an excellent teacher

    • @Codingon_lup
      @Codingon_lup ปีที่แล้ว

      hey

    • @Codingon_lup
      @Codingon_lup ปีที่แล้ว

      are you working in NLP or other in python?
      i need your help
      can you help me?

  • @MRBAM
    @MRBAM 2 ปีที่แล้ว +1

    Its helpful for me ❤️

  • @NaryVip
    @NaryVip 2 ปีที่แล้ว +2

    You didn't link the video for regular expression in description, can u update it

  • @stunninghealer7442
    @stunninghealer7442 7 หลายเดือนก่อน

    You are the best sir😊.

  • @anjalihansda437
    @anjalihansda437 2 หลายเดือนก่อน

    Bahot acha smjhate ho :)

  • @abdulqadar9580
    @abdulqadar9580 ปีที่แล้ว

    You are Amazing Sir Love from Pakistan.

  • @raj-nq8ke
    @raj-nq8ke 2 ปีที่แล้ว

    Gold contents. Thanks for the video

  • @NishantKumar-dw5er
    @NishantKumar-dw5er ปีที่แล้ว

    very detailed explanation. Kudos to you.

  • @siddharth4251
    @siddharth4251 ปีที่แล้ว +1

    Thank a lot Nitish ....i dont have enough words to express my gratitude.

  • @faizahmed8015
    @faizahmed8015 ปีที่แล้ว

    56:30 with 'e' probable hai...
    I understand but it was confusing me.
    And Thank you Sir such a good video ❤

  • @manucmgowda
    @manucmgowda ปีที่แล้ว +1

    Sir the notebook link is dysfunctional .....pls upload the notebook discussed in the video

  • @piyushpathak7311
    @piyushpathak7311 2 ปีที่แล้ว +1

    Sir when you will start series on Deep learning..

  • @jandaabdulla9335
    @jandaabdulla9335 2 ปีที่แล้ว

    Congo sir for third video🥳🥳

  • @raj4624
    @raj4624 2 ปีที่แล้ว

    so far so good.....awesome x 100

  • @ajitkulkarni1702
    @ajitkulkarni1702 ปีที่แล้ว +1

    Hello Sir, can you reshare code, the link you shared has no code....Thanks !

  • @dilipkumarbk7657
    @dilipkumarbk7657 ปีที่แล้ว

    The way of teaching is cool loved it.
    One doubt 12:00 in remove_html_tags() it only removes the tags but in real time when we scrap data from a website it contains tags like style, script etc which aren't required in the text mining or NLP process.
    Just wanted to know is there any other better approach or method that could solve this thing.
    Thanks in advance for everyone who tries to solve this.

  • @Auruenjuhshsh1999
    @Auruenjuhshsh1999 2 หลายเดือนก่อน

    Ek doubt tha though data set mai chat words dictionary banake bhi nikal sakte hai but agar naya data mila toh there should be a way to identify the chat words then put that in dictionary. or tokenization karke hi we can identify these words?

  • @romanahmed4754
    @romanahmed4754 หลายเดือนก่อน

    Need, your regular expression TH-cam video,link please

  • @pankajnaik1574
    @pankajnaik1574 ปีที่แล้ว

    You are the best

  • @imamasafeer4536
    @imamasafeer4536 7 หลายเดือนก่อน

    Where is the video on Regular Expressions?

  • @unknown-ho4wk
    @unknown-ho4wk 9 หลายเดือนก่อน

    that was awsome tutorial can you pls link to your Regular expression video ?

  • @anshuman_madhav
    @anshuman_madhav 2 ปีที่แล้ว +3

    While using the lowercase conversion function shown at 7:23 , I am getting below warning,even though conversion is successful. Can you let me know if any other way is there to do conversion or we can ignore the warning?
    A value is trying to be set on a copy of a slice from a DataFrame.
    Try using .loc[row_indexer,col_indexer] = value instead

  • @siddharthkarale3100
    @siddharthkarale3100 6 หลายเดือนก่อน

    Getting problem while doing assignment as I have no idea how to get data into a dataframe using api.

  • @cipher4811
    @cipher4811 2 ปีที่แล้ว +1

    Sir I have been following you for long time and glad that I found your channel and learning so much from you and for that I am greatful and thank you from bottom of my heart.
    Till now I was working with Google colab but as I am moving towards deep learning now I think it's time for me to buy high end laptop..
    But I am at a loss which one should I pic if I go for rtx 3080 then the price is way to much for me ... Having this confusion for past few weeks can you please please please suggest me a laptop for ml&Al&dl learning projects and my budget is 1400-1500$
    I will be greatful .
    Or you may make a video on this topic

  • @potjason2132
    @potjason2132 7 หลายเดือนก่อน

    actually tokenization doesn't work in dataset. can u write code to tokenize only the reviews in ur dataset

  • @deepankarmullick3121
    @deepankarmullick3121 2 ปีที่แล้ว

    Amazing video but from where can i download the notebooks.
    I would also request you to share the notebook url's in the video description.

  • @riiyyyaaaa
    @riiyyyaaaa 7 หลายเดือนก่อน

    Hi Sir, Can you please re add the data links here as unable to load it.

  • @rahulrajbhar7012
    @rahulrajbhar7012 2 ปีที่แล้ว

    How to explain a data science project in interview for fresher please make it one video.

  • @kumarabhishek1064
    @kumarabhishek1064 2 ปีที่แล้ว +1

    where is the template notebook?

  • @anitabhandari3886
    @anitabhandari3886 6 หลายเดือนก่อน

    @campusX : can you please suggest how can we use text for regression (for eg. use comments to predict number of subscribers)

  • @mdaliarmaghan8292
    @mdaliarmaghan8292 4 หลายเดือนก่อน

    Can you please provide solution for this assignment

  • @shaiksalavuddin5976
    @shaiksalavuddin5976 2 ปีที่แล้ว

    Sir thank you so much😊

  • @SLADE-VA
    @SLADE-VA 8 หลายเดือนก่อน

    Couldn't find the Notebook link!

  • @tanmayshinde7853
    @tanmayshinde7853 2 ปีที่แล้ว +1

    Does anyone know how to apply word/sentence tokenizer on columns? if you know please reply.

  • @rishabhvarshney2234
    @rishabhvarshney2234 2 ปีที่แล้ว +1

    Can we get the pdf of code that you have written in ths vedio

  • @jasonbourn29
    @jasonbourn29 ปีที่แล้ว

    I checked both methods (removing punctuation)but they are similar in speed sometimes the second one is slower why is it so

  • @kislaykrishna5599
    @kislaykrishna5599 2 ปีที่แล้ว

    great content

  • @trackbackresearch
    @trackbackresearch 2 ปีที่แล้ว

    Thankyou Sir .

  • @sachin2725
    @sachin2725 ปีที่แล้ว

    please tag notebook used in this video in description,

  • @swet_gokugod9382
    @swet_gokugod9382 11 หลายเดือนก่อน

    Great

  • @ashishsom3849
    @ashishsom3849 3 หลายเดือนก่อน

    I am not able to find the notebook of the code.
    Could anyone please help?

  • @AshishSharma-tf3fy
    @AshishSharma-tf3fy 4 หลายเดือนก่อน

    sir TMDB website is blocked in india

  • @anooshkaa
    @anooshkaa 7 หลายเดือนก่อน

    notebook ka koi saved version nahi dikhara hai.

  • @youtubekumar8590
    @youtubekumar8590 ปีที่แล้ว

    Thanku Bhaiya

  • @piyushpawar75
    @piyushpawar75 9 หลายเดือนก่อน

    I got an error by using spacy library which is OSError

  • @ambarkumar7805
    @ambarkumar7805 หลายเดือนก่อน

    the code link is not found?

  • @bhushanbowlekar4539
    @bhushanbowlekar4539 ปีที่แล้ว

    Sir at timestamp 3.30 you said you will provide notebook , can you please provide that , Thank you

  • @ritakathrotiya
    @ritakathrotiya 6 หลายเดือนก่อน

    In the assignment, Can anyone have the solution on how to change genres ID to it's Name ?

  • @tusarmundhra5560
    @tusarmundhra5560 11 หลายเดือนก่อน

    awesome

  • @SurajitDas-gk1uv
    @SurajitDas-gk1uv 8 หลายเดือนก่อน

    Thank you

  • @snrmedia8965
    @snrmedia8965 2 ปีที่แล้ว

    Nice video👍

  • @bhushanbowlekar4539
    @bhushanbowlekar4539 ปีที่แล้ว +1

    can you please share the colab file

  • @anshumanmahabhoi5771
    @anshumanmahabhoi5771 9 หลายเดือนก่อน

    where is the notebook ?

  • @bibasrai752
    @bibasrai752 ปีที่แล้ว +1

    do you have videos on Nlp with deep learning ?

  • @bhanuprakash5060
    @bhanuprakash5060 ปีที่แล้ว

    where is notebbok of this lecture?? could u please just upload the notebook

  • @surajnikam3327
    @surajnikam3327 9 หลายเดือนก่อน

    Can Anyone explain me how to create dataframe for assignment using thia API . PLEASE!🙏

  • @miteshkumar7739
    @miteshkumar7739 2 ปีที่แล้ว

    Hello Sir ,
    make a video for R programming language plezzz......

  • @abhishekvashistha2398
    @abhishekvashistha2398 3 หลายเดือนก่อน

    code used is not available in the link. if anyone has please share.

  • @shyamtyagi95
    @shyamtyagi95 2 ปีที่แล้ว

    Nice video

  • @gauravverma3700
    @gauravverma3700 2 ปีที่แล้ว

    Awesome

  • @MRBAM
    @MRBAM 2 ปีที่แล้ว +1

    👍

  • @zkhan2023
    @zkhan2023 2 ปีที่แล้ว

    Thanks sir

  • @adityasoni1639
    @adityasoni1639 2 ปีที่แล้ว

    the notebook/code is not available .!!!

  • @kalpesh_saindane108
    @kalpesh_saindane108 29 วันที่ผ่านมา

    sir stemmer kyu use karna o aapne bataya nahi...root words me kyu lana hai o bataya nahi apne...we are reducing dimensionality of our data.is that correct?

    • @IqraKhan-xh2cp
      @IqraKhan-xh2cp 21 วันที่ผ่านมา +1

      We use stemmer because tokenization k time hm same meaning wale words ko more than once consider na kre... if hm stemming nhi krnge toh hmara algorithm walk and walking ko different words consider krega..jo ki hai same..which is not good for our model...isiliye we use stemming...moreover it is not dimensionality reduction..we are not reducing the no. of columns here....we are cleaning our data..we are following the principle of "GARBAGE IN GARBAGE OUT"

    • @kalpesh_saindane108
      @kalpesh_saindane108 21 วันที่ผ่านมา

      @@IqraKhan-xh2cp same context ka word deke koi matlab nahi hai.. Usase algo me koi change nahi anevala.. Its just increasing our dimensions ye bhi ek reason hai.. Or stemmer se meaningful word se koi matlab nahi hai.. O to sirf root word me convert karta hai jo ki meaning less bhi ho sakta hai.. Jinke root same hai unhe ek consider karna taki more imp dimension mile.. 👍🏻

  • @rishabhvarshney2234
    @rishabhvarshney2234 2 ปีที่แล้ว +2

    I don't get the code

    • @Shobhitchoudhary321
      @Shobhitchoudhary321 3 หลายเดือนก่อน

      colab.research.google.com/drive/1sAjdLZStcavDt4ktHe3j_NUllO_yAZ-v?usp=sharing

  • @ayushroy6208
    @ayushroy6208 2 ปีที่แล้ว

    Sir i m making a project using the concepts u told in NLP..... Ek week mein heroku ka ek link comment pe dunga sir plz dekhke batana aap kaisa hua hai.... Plz sir.... And thank u 🙏for teaching us so much

    • @campusx-official
      @campusx-official  2 ปีที่แล้ว

      Sure

    • @ritakathrotiya
      @ritakathrotiya 6 หลายเดือนก่อน

      In the assignment, How can i change genres ID to its Name after creating both datasets?@@campusx-official

  • @pradumankumar7607
    @pradumankumar7607 2 ปีที่แล้ว

    sir can you please share the link of "chatword" used in chatword treatment

  • @ahmedullahkhan9166
    @ahmedullahkhan9166 ปีที่แล้ว

    where is the notebook link?
    the above link only showing csv file.

  • @shrutianand285
    @shrutianand285 2 ปีที่แล้ว

    How to use textblob for a large dataset?

  • @vijayraghuwanshi4486
    @vijayraghuwanshi4486 ปีที่แล้ว

    anyone tried the assignment? if please reply I have few doubts

  • @ShivaniSharma-tk4bl
    @ShivaniSharma-tk4bl ปีที่แล้ว

    @campusX I cant find the codes. can you plz plz give the link?

  • @allwithinone1345
    @allwithinone1345 2 ปีที่แล้ว

    thank sir

  • @furry2fun
    @furry2fun ปีที่แล้ว

    can anyone send the link to the notebook, the given link does not work