Naive Bayes algorithm in Machine learning Program | Text Classification python (2018)

แชร์
ฝัง
  • เผยแพร่เมื่อ 24 ม.ค. 2025

ความคิดเห็น • 169

  • @mohanarajjagadesan8940
    @mohanarajjagadesan8940 4 ปีที่แล้ว +1

    Your way of explanation is too good

  • @sandipdikshit3736
    @sandipdikshit3736 3 ปีที่แล้ว +4

    This is one of the best clear example setting videos with step-by-step architecture than that of any "Edtech platforms". Yes, this is what we want as an explanation. It's that simple rather than something more to it. I am surely subscribing to your channel for more explanations in near future. "Beautifully broken down and explained"

  • @shaminmohammed672
    @shaminmohammed672 3 ปีที่แล้ว +1

    You are awesome. You are better than my professor.. thank you

  • @Aminulislam-jf2ct
    @Aminulislam-jf2ct 4 ปีที่แล้ว

    Everyone who want to start text classification research should watch this video...
    Really well explained

    • @CodeWrestling
      @CodeWrestling  4 ปีที่แล้ว

      Thank you so much!! It means a lot

  • @sapanabasukala2291
    @sapanabasukala2291 3 ปีที่แล้ว +1

    Perfect Explanation on countvectorizer, tfidfvectorizer and all the metrics with good examples.

  • @jkore2554
    @jkore2554 4 ปีที่แล้ว

    This is exactly what I was looking for. I learned how to build a binary classifier (e.g. ham, spam), but needed to learn how to build a model that would predict an outcome for more than two categories. Thank you for the tutorial!

    • @romanjaxx1600
      @romanjaxx1600 3 ปีที่แล้ว

      i know it is quite randomly asking but do anyone know of a good place to watch newly released tv shows online ?

    • @mohaab8786
      @mohaab8786 ปีที่แล้ว

      is this program (Web Document Classification Using Naïve Bayes in advanced data mining)? i need to know please

  • @vamsikrishna1131
    @vamsikrishna1131 6 ปีที่แล้ว +1

    better than a few videos i have seen trying to understand the basics of NB+python

  • @inzayngirl
    @inzayngirl 4 ปีที่แล้ว

    Firstly i would like to thank for your video.
    Finally found something for text classification with proper explanation

  • @nikhithasrinivas
    @nikhithasrinivas 6 ปีที่แล้ว +3

    You have a got a very detailed explanation of concept. This is a very big asset for my project implementation.
    Thank you so much and please make more videos with more machine learning algorithms.

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว +1

      soon we will make 😄 #codewrestling

  • @GLocarso
    @GLocarso 3 ปีที่แล้ว +1

    THANKYOUUUU 1 LIFE SAVED 🙏🙌😁

  • @MrEvergreen777
    @MrEvergreen777 4 ปีที่แล้ว

    very nice superb explanation would like to see more of data science and text mining videos

  • @dbiswas
    @dbiswas 4 ปีที่แล้ว

    Your explanation is excellent. Keep up with such good teaching. Thanks !

  • @Trouble.drouble
    @Trouble.drouble 4 ปีที่แล้ว +1

    Good explanation keep it up champ

  • @_shreyagarg
    @_shreyagarg 3 ปีที่แล้ว

    Very helpful. Totally recommend seeing this.

  • @wsgsantos
    @wsgsantos 5 ปีที่แล้ว +2

    Congrats from Brazil!

  • @abelyosua4235
    @abelyosua4235 3 ปีที่แล้ว

    Amazing so easy to understand. Thank You

  • @Iffe4
    @Iffe4 5 ปีที่แล้ว +5

    Well explained and properly coded implementation. Thank you for explaining in this manner.

  • @atriraha
    @atriraha 6 ปีที่แล้ว +1

    Great narrative and explaination. Keep up the good work brother !

  • @kapiljain4234
    @kapiljain4234 4 ปีที่แล้ว +1

    Awesome explanation. Keep uploading. Thanks a lot :)

  • @pavani3830
    @pavani3830 4 ปีที่แล้ว +1

    Thank you🙏very well explained.please make a video🎥on other algorithms too.please.or else suggest some best channel for it

  • @manjula6942
    @manjula6942 5 ปีที่แล้ว +2

    good explanation and very useful for beginners.

  • @viralentertainmentnetwork2171
    @viralentertainmentnetwork2171 4 ปีที่แล้ว +1

    Hi , very nicely explained. Basically I need to understand what will be the output of this model like if I want to return the text as result of text analysis so how can we display the output in excel sheet again in further classification like neutral, positive,negative ..thanks

  • @ayushk1666
    @ayushk1666 4 ปีที่แล้ว

    Superb explanation .. keep it up

  • @unio-yourofficialcollegeap3361
    @unio-yourofficialcollegeap3361 6 ปีที่แล้ว

    Excellent explanation clearly and succinctly - Very well done..

    • @CodeWrestling
      @CodeWrestling  6 ปีที่แล้ว

      Thanks and Stay Tuned with us!! :-)

  • @sofluzik
    @sofluzik 4 ปีที่แล้ว +1

    Awesome brother continue the good work 🙂

  • @roha12
    @roha12 2 ปีที่แล้ว +1

    wow good job go ahead!

  • @anilkumar-dm8om
    @anilkumar-dm8om 5 ปีที่แล้ว +1

    Nice and clear explanation

  • @harshitvishwakarma310
    @harshitvishwakarma310 3 ปีที่แล้ว

    what if i want to test for single document and want to predict its target_names ?

  • @JiminPark-ld2xx
    @JiminPark-ld2xx 3 ปีที่แล้ว

    Can you tell me if there's a way I can do the same classifier with Excel or CSV data sheet?

  • @naz-kh6lj
    @naz-kh6lj 4 ปีที่แล้ว

    hi , i would like to ask you something. what techniques should i use to find some keyword in my csv file and then if match with the keyword, i want to assign it to another keyword. the output something like this,
    column A Keyword
    DUMPBLT:TESTING FAILED: OPERATOR PUSHED STOP BUTTON SYSTEM FAILED
    if i found keyword of 'DUMPBLT' and 'PUSHED STOP BUTTON' in column A, i want to assign it to "SYSTEM FAILED" and put to other column. can you help me about this ?

  • @Ankit-hs9nb
    @Ankit-hs9nb 5 ปีที่แล้ว +2

    Thanks, dude, nice explanation!

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Thanks a lot. Stay Tuned #CodeWrestling

  • @amitjuyal63
    @amitjuyal63 5 ปีที่แล้ว +1

    Great explanation really helpful thanks

  • @mohaab8786
    @mohaab8786 ปีที่แล้ว

    Thanks sir. is this program (Web Document Classification Using Naïve Bayes )?

  • @nesreddine7103
    @nesreddine7103 4 ปีที่แล้ว

    Thank you, it's a very nice explanation, great helpful

  • @prashantkumarjharia
    @prashantkumarjharia 4 ปีที่แล้ว +1

    Hi, I want to build text to intent classification program. Can you suggest which algorithm should be used and how to achieve the same.

  • @rianasmaraputra
    @rianasmaraputra 5 ปีที่แล้ว +4

    can you explain please, what should i do if i want to classify twitter dataset from csv file ?
    thanks

    • @guelibbouchra1115
      @guelibbouchra1115 5 ปีที่แล้ว

      Did you find how to do that ?

    • @rianasmaraputra
      @rianasmaraputra 5 ปีที่แล้ว

      @@guelibbouchra1115 yes, i'm using pandas dataframe and preprocessing and classifiy

    • @avinashprasad4181
      @avinashprasad4181 5 ปีที่แล้ว

      @@rianasmaraputra can you help me with how to classify twitter data from a text file

    • @rianasmaraputra
      @rianasmaraputra 5 ปีที่แล้ว

      @@avinashprasad4181 yeah, find me on twitter @rianasmara_p

    • @gasmisafa2979
      @gasmisafa2979 5 ปีที่แล้ว

      can you help me please Rian Asmara Putra ?

  • @haritar9053
    @haritar9053 5 ปีที่แล้ว

    The concepts are very well explained. But the classifier isn't able to distinguish between alt.atheism and soc.religion.christian. How would you fine tune the model?

  • @lakkojufam366
    @lakkojufam366 3 ปีที่แล้ว

    Can we classify someother pdf files as datasets using this implementation?? Please Answer bro, I am in a similar kind of project

  • @amritasengupta4260
    @amritasengupta4260 4 ปีที่แล้ว

    Why are we not removing the punctuations and stopwords?

  • @shankar3109
    @shankar3109 5 ปีที่แล้ว +2

    Nicely explained magician :)

  • @ashishpandey4072
    @ashishpandey4072 4 ปีที่แล้ว

    Good explanation bro..

  • @GelsYT
    @GelsYT 5 ปีที่แล้ว +1

    can someone tell me what is news_train["target"] for? same with the testing set
    THANKSSS

    • @manthanadmane7812
      @manthanadmane7812 5 ปีที่แล้ว +1

      Out of all the categories, we restricted ourselves to 4 categories while importing data. [By passing categories list while importing]
      Now news_train["target"] shows the same 4 categories.
      Note: "target_names" is a key in the dictionary of news_train containing only our 4 categories.
      Hope this helps.

    • @GelsYT
      @GelsYT 5 ปีที่แล้ว

      @@manthanadmane7812 THANK YOU SOO MUUUUCHHHH =) GOD BLESS YOU :)

    • @manthanadmane
      @manthanadmane 5 ปีที่แล้ว +1

      @@GelsYT Ah! No worries bud, happy learning :)

  • @sharonelijah
    @sharonelijah 5 ปีที่แล้ว +1

    Excellent & very impressive !

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Thanks for appreciating.
      #CodeWrestling

  • @ShaheenMirja
    @ShaheenMirja 5 ปีที่แล้ว

    My dataset is not categorical and i want to detect the novelty from news archive.Can any one help me to catch the right approach?i want to use SVM.

  • @infopedia_life_facts
    @infopedia_life_facts 5 ปีที่แล้ว +1

    This was really great. Can you make a whole video on Next word prediction .......This will help us a lot bro..

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Yeah, I will try to, Just for a reference, you can use word2vec skip gram model or continuous bag of words for predicting next word... Stay Tuned.. #CodeWrestling

  • @MitoVault
    @MitoVault 5 ปีที่แล้ว +1

    when I run the accuracy at 20:07 is get the following error:
    ValueError: Found input variables with inconsistent numbers of samples: [1502, 2]
    do you know how to solve this?

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Can you please elaborate a little more?

  • @nandeshnarayannaik3793
    @nandeshnarayannaik3793 5 ปีที่แล้ว

    Greate Content and Nicely explained

  • @vcrmartinez
    @vcrmartinez 6 ปีที่แล้ว

    Hi, really appreciated your video. Nice explanations.

  • @jyrust1713
    @jyrust1713 5 ปีที่แล้ว

    class_prior parameters in naive bayes what mean? i dont understand in documentation

  • @mauriciorey8609
    @mauriciorey8609 2 ปีที่แล้ว

    Very nice video. Question, besides writing less code, what do I gain using the pipeline method? Do I gain computational time?

  • @nanlirmullah7944
    @nanlirmullah7944 4 ปีที่แล้ว

    Nice one! Well done!!

  • @Nadhine
    @Nadhine 5 ปีที่แล้ว +1

    i'm using python django and my datasets were stored in django db how can I use that dataset as the training datasets aside from creating a folder put the documents there

    • @Eebbistuu
      @Eebbistuu 5 ปีที่แล้ว

      I am also looking for such activity

  • @ripandeb7227
    @ripandeb7227 5 ปีที่แล้ว +1

    How to use the model( pickle file) in a separate module to predict a new set of data. How to transform the new data to be predicted?

    • @Kiddzzvideos
      @Kiddzzvideos 4 ปีที่แล้ว

      Hi, I have same doubt. can you help me with this? How to predict new dataset?

  • @kekkettoful
    @kekkettoful 3 ปีที่แล้ว

    hello could you also show an implementation of a binary neural network always with this dataset?

  • @ashrafulhoquemiraj7294
    @ashrafulhoquemiraj7294 5 ปีที่แล้ว

    how they are preparing text file as a dataset inside training & test data.what us the format.

  • @rap60d
    @rap60d 5 ปีที่แล้ว +1

    Very helpful, thanks and God bless

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Thanks a lot. Stay Tuned. #CodeWrestling

  • @amitmalik_who
    @amitmalik_who 5 ปีที่แล้ว

    I want to label Quora question of dataset around 30k rows in CSV .
    How would you test this train model of 20 newsgroup on such a dataset .
    Much needed help / any vedio on this will be very helpful .

  • @gamasacademy1416
    @gamasacademy1416 5 ปีที่แล้ว +1

    can you make a video on one example where we can learn how to use text classification multivariate problem...like i have 10 features with categorical output. out of 10 variables couple of them are text....can you please share one example how to use all 10 features applying text mining on 2 and finally use classification model to predict results...

  • @faizanali3394
    @faizanali3394 5 ปีที่แล้ว +1

    i am working Urdu news text can u guide me. i am working in python with pycharm.

  • @ojhamanvi
    @ojhamanvi 4 ปีที่แล้ว

    Are these training dataset are .txt files?

  • @mounikab9830
    @mounikab9830 4 ปีที่แล้ว

    Can I use emotion dataset of having attributes I'd, text, emotions

  • @manumathew8502
    @manumathew8502 4 ปีที่แล้ว

    Well explained

  • @moshithaarunachalam3932
    @moshithaarunachalam3932 5 ปีที่แล้ว

    Hello! Once the accuracy of Mutlinomial Naive Bayes is calculated, could you tell me how to predict the class of unseen data/test data using same classifier Mutlinomial Naive Bayes?

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      load the new dataset, use the pipeline to classify the data in the same way as I have mentioned the video. Also, refer the code, a link is given in the description of the video

  • @ojhamanvi
    @ojhamanvi 4 ปีที่แล้ว

    I have lots of text file for model training how to train model for classifiction?

  • @umamadisha287
    @umamadisha287 3 ปีที่แล้ว

    Can you please make a video on text similarity measurements using cosine similarity ?

  • @RimujiFoods
    @RimujiFoods 4 ปีที่แล้ว

    10:20
    Feature Selection
    12:30
    Term Frequency

  • @Abhi-bv2eb
    @Abhi-bv2eb 5 ปีที่แล้ว +1

    please do a video on decision tree on iris dataset without using sklearn

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      The video is on the way.. Stay Tuned

  • @abhishekraj8677
    @abhishekraj8677 4 ปีที่แล้ว

    Hey bro.. after finding the result of precision recall and F1 score. How we'll write this in text?? Will u help me out from this?

  • @sumanvey3934
    @sumanvey3934 5 ปีที่แล้ว

    hi brother i love ur explanation. but i have one question. When i tried to increase the size of target names to 6 from 4, it produces error which says Number of classes, 6, does not match size of target_names, 4. Try specifying the labels parameter
    how to solve these????

  • @manumathew8502
    @manumathew8502 4 ปีที่แล้ว

    Is there any datasets for feedback classification?

  • @sumanshu.nankana
    @sumanshu.nankana 5 ปีที่แล้ว

    Hello Bro, what is the use of using argument categories = categories while load_files()?

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว +1

      Bro, by default fetch20newsgroup will take all the categories, but I want to work on only 4 categories, so I have created a list containing only those, you can give any name to this list variable. And finally assign to the fetch20 newsgroup categories. I hope, I was able to solve your query.

  • @nuraisyah9509
    @nuraisyah9509 5 ปีที่แล้ว

    Hi. did anyone know how's to create a transliteration machine learning that can solved homograph disambiguation using python?

  • @samyaknayak5731
    @samyaknayak5731 4 ปีที่แล้ว

    Great video!!! Can we get the github code for the process which is not the magic one?

  • @saudnaeem
    @saudnaeem 5 ปีที่แล้ว

    why you are using count vectorizer and tfidif both in your implementation ? isn't tfidf enough for both of the tasks (counting and transforming)?

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Try to get the result without using count vectorizer, you might understand then. Anyway in the end we have used something else to make the code shorter

  • @talhatariq4069
    @talhatariq4069 5 ปีที่แล้ว

    why sci.med is not included??

    • @CodeWrestling
      @CodeWrestling  4 ปีที่แล้ว

      It was just for explanation. Not any specific reason behind that.

  • @zahrasiraj766
    @zahrasiraj766 4 ปีที่แล้ว

    hi can we have a word regarding the system specifications for machine learning

  • @ronaldalbertoromero7291
    @ronaldalbertoromero7291 5 ปีที่แล้ว +1

    Very thanks

  • @ashoknp
    @ashoknp 5 ปีที่แล้ว +1

    superb

  • @carellek1060
    @carellek1060 4 ปีที่แล้ว

    Very helpful, thank you. I have one question though, will the process be the same if some document had more than one categorie?

  • @adityaghosh8601
    @adityaghosh8601 5 ปีที่แล้ว

    Can make a playlist on Mathematics of machine learning , like probability , Linear Algebra , differentiation

  • @aarthipugal89
    @aarthipugal89 4 ปีที่แล้ว

    how do u train ur data to the model

    • @CodeWrestling
      @CodeWrestling  4 ปีที่แล้ว

      Use google colab to train the data.

  • @Kiddzzvideos
    @Kiddzzvideos 4 ปีที่แล้ว

    Hi..Amazing video! If I have new file, how to predict the category of that file.
    Can you please provide the step after the following step?
    predicted =clf.predict(X_test_tfidf)

  • @ariesjayveeganzon8774
    @ariesjayveeganzon8774 4 ปีที่แล้ว

    Question, can I use the result of MultinomialNB as an input to another machine learning? I'm doing a classification model that requires other features for prediction.
    By the way, great job for this video. Learned a lot. Appreciate it!

    • @CodeWrestling
      @CodeWrestling  4 ปีที่แล้ว

      Determining which algorithm to use, totally depends on what kind of problem it is. Sometimes other algorithms works better. So maybe first step is to understand that which kind of algorithm you should use for a particular type of dataset and then use the appropriate algorithm.
      Thanks for the appreciation.

  • @sunitareddy8717
    @sunitareddy8717 5 ปีที่แล้ว +1

    Very good explanation. Do you have videos for KNN,Decision tree,SVN models?

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      yes we have lots of videos in queue coming soon stay tuned #codewrestling

  • @raghavendramani9795
    @raghavendramani9795 4 ปีที่แล้ว

    superb bro

  • @randythamrin5976
    @randythamrin5976 4 ปีที่แล้ว

    I can not find the dataset. by the way thank u for your video

  • @mandathejaswee9927
    @mandathejaswee9927 5 ปีที่แล้ว

    can u give explanation on different algorithms like knn algorithm, decession trees with the same data set

    • @CodeWrestling
      @CodeWrestling  5 ปีที่แล้ว

      Working on it.

    • @mandathejaswee9927
      @mandathejaswee9927 5 ปีที่แล้ว

      @@CodeWrestling I am doing a project on that data set. It will be helpful to me

  • @athulyac6599
    @athulyac6599 5 ปีที่แล้ว

    Can I use this same code in windows....???

  • @uzwalgutta
    @uzwalgutta 3 ปีที่แล้ว

    Super bro

  • @chetanahadadi3089
    @chetanahadadi3089 5 ปีที่แล้ว

    I am not getting micro avg row in the output pls help

    • @r.avinashkumar5372
      @r.avinashkumar5372 4 ปีที่แล้ว

      although you might have noticed now that according to the formula of micro avg there is no need of showing it under all the three. And it has been named to accuracy now which shows 0.83

  • @sheetalirappabetgeri9423
    @sheetalirappabetgeri9423 6 ปีที่แล้ว

    hello sir i am not getting that data set on internet can u help me for this

    • @CodeWrestling
      @CodeWrestling  6 ปีที่แล้ว

      you can find the dataset on the following link:
      qwone.com/~jason/20Newsgroups/20news-bydate.tar.gz
      #codewrestling

  • @tsegalemhailu1211
    @tsegalemhailu1211 5 ปีที่แล้ว

    It is an amazing video. Am working my thesis on Amharic language classification. could u guide me on doing so?

  • @sneakyblinder982
    @sneakyblinder982 5 ปีที่แล้ว +1

    Nice Video

  • @hailemariamtesfa8440
    @hailemariamtesfa8440 5 ปีที่แล้ว

    is very nice vedio.But what about the csv dataset.

  • @sabiyafatima56
    @sabiyafatima56 5 ปีที่แล้ว

    thanks a lot

  • @radhikathorbole6929
    @radhikathorbole6929 5 ปีที่แล้ว

    How would I get another dataset?

  • @AdityaShukla-uu8ut
    @AdityaShukla-uu8ut 5 ปีที่แล้ว

    Thanks, bro

  • @siddheshgawali7764
    @siddheshgawali7764 6 ปีที่แล้ว

    can u pls upload video for KMeans method in python

  • @amitmistry2946
    @amitmistry2946 4 ปีที่แล้ว

    Hi CodeWrestler I want to get in touch with you regarding some doubts based on a project. Could you please get back to me. Thank you

  • @GeorgeAlin
    @GeorgeAlin 6 ปีที่แล้ว

    Hi mate,
    Can you made a Naive Bayes algorithm in Machine learning Program but with Document Classification?
    Thx.

    • @CodeWrestling
      @CodeWrestling  6 ปีที่แล้ว

      Sure, I will definitely look into it #codewrestling