Hat's of to your teaching skills sir! As a beginner, this was my 1st ML project & I have completed it successfully & understood each & everything. Now, I am pretty sure that the pure golden content is hidden deeply in TH-cam & as a viewer it's our responsibility to find it & share it with others who need it! Thanks a lot sir!
It's so satisfying when someone blessed with talent such as yours is able to simplify a project enough for the average student to get a grasp of what the project is all bout. Thank you so much!
Outstanding teaching style you possess sir, I can't thank you enough for this pure gold-like content. I appreciate your idea of bringing these project-related videos and making them understandable to all students. Thank you.
This series is really useful for beginners like me. If you could do a video on deploying these models for real-world use then it would be a great help.
man, im new to this and I cant thank you enough for this material. I understood everything you did, but i still have trouble doing this on my own due to my lack of experience in the syntax and what modules to import :/
Awesome so clear Understanding of Machine Learning....Best Machine learning Channel on youtube where we can truly learn Others are there for Views only.
in case anyone gets error in stemming function or the loc news_dataset['content'] = news_dataset['content'].apply(stemming) use the below code just before stemming function (it worked for me) news_dataset['content']=news_dataset['content'].apply(str)
if u are facing error in news_dataset['content'] = news_dataset['content'].apply(stemming) it's likely u did not add .stem at the back of port_stem so use this stemmed_content = [port_stem.stem(word) for word in stemmed_content if word not in stopwords.words("english")]
The error while applying stemming function can be solved by using news_dataset['content']=news_dataset['content'].apply(str) this line of code before applying the stemming function.
Thank you so much sir, For providing us such good videos in the field of Ai & ML. All your videos helped me a lot, my interest in this field has boosted a lot since watching your videos. Keep posting more videos like this sir you are inspiring us a lot 🥰
Dear, I have a question here. So how i can predict for a piece of new news, i mean it should be predicted directly without setting the label from ourselves. so how we can predict that.
Hi siddhardhan nice explanation Pls make some end to end project where you discuss on eda and features engineering, features selection ,remove outlier I want this type of project pls make these type of project
@@Siddhardhan I am doing my masters in Queen mary London and i was just surfing random videos ..even though i knew all the concepts in this i didn't feel like skipping a bit.. truly well explained..i have also sent u the LinkedIn request ! U will have bright future 😄
Great tutorial ! And i have a question while we were making predictive system we have used data from X_test and we said we are using this because the model have not seen these datas.But we also write a code ''X_test_prediction=model.predict(X_test)''.Didn't it see the test datas after we predicted that ? thx.
thanks for the tutorial. but can you suggest how can we related the model with realtime world. like if I pass any news title to it, it should return the Real/False.
I am having trouble applying the Logistic Regression model on the test data that is provided on the Keggle page along with the training data given. How should I go about doing that??
sir while data preprocessing an error keeps coming in the file downloaded from kaggle, ParserError: Error tokenizing data. C error: EOF inside string starting at row 1259 what to do? i have done various online solutions from stackoverflow but they arent helping
That stemming function... No error occurred But I select wrong ^ symbol What that cause... Every English later erased and symbols left.... So whoever going to try this project... Please print the content after applying stemming. To verify And thanks siddhardhan for sharing colab file❤
Bro i have used this dataset and applied random forest everything is perfect accuracy precision but ig its overfited any outside news its displaying as fake ??????
Yes! You just need to prepare the dataset accordingly, similar to the data we trained. Then train the model. You may need to do some tweaking to the model to increase it's accuracy.
@@Siddhardhan sorry, I didn't get, just an example if I have trained data of last year, how can it predict present news whether fake or real.. if someone posts IPL is postponed today, can this be predicted or confirmed as fake..
Yes. It can. But we cannot say for sure that it can predict correctly. So, what you need to do is, train a Machine Learning model with some real news dataset. Larger the dataset, better is the prediction.
And we can use a better model. Logistic regression is a basic Machine Learning model. Instead, you can use Deep Learning models like Recurrent neural networks (RNN). (RNN is used to train on text data & CNN is used to train on image data). By that way, you can make better prediction.
hello sir, when I try to download the dataset from link that you provided, it asks me to accept competition's rules before downloding file. What should I do?
If I enter latest news headlines from the internet will it be able to predict real or fake news. My college professor is asking for this feature. Please help me
Really liked the video and that recap part is very nice... I wanted to ask one thing: Can we show a graph of fake and real news? If yes, then please tell me the procedure for it! Thank you !! in advance
thanks a lot 😇 go through this to plot the data in logistic regression: stackoverflow.com/questions/46085762/sklearn-logistic-regression-plotting-probability-curve-graph
while using a different dataset for testing the accuracy i am getting error due to the total no. of words are not same in the train dataset and test data set. Can you provide a way out,please.
@siddhardhan sir I'm getting error in import bumpy as no it s telling syntax error.. Could u please help me out of that....I have tried so much it's not happening
Great work @Siddhardhan, can you suggest steps to write a simple rule based news recommendation system (Popularity & Content based) with summary metrics (like page views, timestamp etc) from analytics as inputs. Much appreciated!!
Thank you for the video !! I have a question that: 1) I f I have a news (New Query) and I want to predict that is fake or real , How can I do with this model? Regards Ankur
can anyone please explain me why merging the author and title column is done when we can directly feed the model without merging them as 2 different features to learn from
Actual accuracy parameter is like this "accuracy_score(y_true, y_pred)". But in your video, you are giving the predicted value in y_true secion and true value in y_pred section. Please check on it and let me know if am wrong
sir i am facing an issue at data preprocessing, i have uploaded the above given dataset, it is stored as train.csv.zip, but when i execute it i am getting an error like badzipfile .How can i rectify it sir?
Once you upload the zip file completely, use the below code to extract the file: # extracting the compessed Dataset from zipfile import ZipFile dataset = '/content/train.csv.zip' with ZipFile(dataset,'r') as zip: zip.extractall() when you run this, your zip file will be extracted. it will contain multiple files. in that, use train.csv alone for the project.
PC to an external monitor and I'm watcNice tutorialng tNice tutorials video on one screen and working through exactly what Michael is doing and it is a big
my accuracy score on test data is coming out to be 0.53 which is a very poor value. I copied the same code what could be the reason behing this please explain?
hey, if i make any changes in the dataset that you are providing then program will not work . why ? please tell me the solution guys my teacher are asking me to check the latest news these days and if i edit the dataset with the latest news the program will not work , please help me out tomorrow is my final practical
i got many issues like in uploading file, in train_test_split etc but it suddenly worked . i tried to run all many times , but suddenly it got worked. i don't know what to say or how it will help
Hello sir I uploaded the data set which u have provided above .....my doubt is when I uploaded it's saved as train.csv.zip file rather than train.csv ....can I complete this project ....or any error occurs
hi! you can extract the file using the following code: # extracting the compessed Dataset from zipfile import ZipFile dataset = '/content/train.csv.zip' with ZipFile(dataset,'r') as zip: zip.extractall() when you run this, your zip file will be extracted. it will contain multiple files. in that, use train.csv alone for the project.
X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size = 0.2, stratify=Y, random_state=2) if any error arises in this section kindly once all run simultaneously then no error will arises.
Hi! You can join this Telegram group for regular updates about my videos: t.me/siddhardhan
Thank you!
Where is link of hands on data science course ?? 12:02
Sir one doubt where is output of this project ☝🏻☝🏻☝🏻
@@vimal554 output itself is shown ..either a news is fake or true.. if u can elaborate ur doubt ..let me know.. i hope i can help u 😄
Fake Supporters Detection how can we do this using ml ? need your help
@@vishalkumarpal9897 sir can u pls send fake job posting prediction document
Hat's of to your teaching skills sir! As a beginner, this was my 1st ML project & I have completed it successfully & understood each & everything. Now, I am pretty sure that the pure golden content is hidden deeply in TH-cam & as a viewer it's our responsibility to find it & share it with others who need it! Thanks a lot sir!
It's so satisfying when someone blessed with talent such as yours is able to simplify a project enough for the average student to get a grasp of what the project is all bout. Thank you so much!
Hii
You have provided so much projects while the paid courses worth of 30000 provides 6-7 projects.
Great work💯
you are most welcome 😇
Your programming part is really skilled and logics are also good i was searching for these kinds of topics
for unicodedecode error for reading csv use this:
news_dataset = pd.read_csv('train.csv', encoding="unicode_escape",error_bad_lines=False)
thanks a lot buddy....wasted more than an hour but got resolved by you
It is showing parserError how to solve
@@sweety6734 same how to solve?
@@sweety6734 news_dataset = pd.read_csv('/content/train.csv', quoting=3, on_bad_lines='skip') use this
Outstanding teaching style you possess sir, I can't thank you enough for this pure gold-like content. I appreciate your idea of bringing these project-related videos and making them understandable to all students. Thank you.
This series is really useful for beginners like me.
If you could do a video on deploying these models for real-world use then it would be a great help.
Sure thing! will do after deep learning course
@@Siddhardhan wow great.
BTW did you already started deep learning series?
man, im new to this and I cant thank you enough for this material. I understood everything you did, but i still have trouble doing this on my own due to my lack of experience in the syntax and what modules to import :/
In case u have an error “EOF inside string etc.”, just wait until the train file finishes loading. It is pretty huge
Thanks man
Awesome so clear Understanding of Machine Learning....Best Machine learning Channel on youtube where we can truly learn Others are there for Views only.
most useful project as it helps me learns a lot . Thanks alot sir
Thanks a lot Siddhardhan. I will let my friends know about your channel.
Nice tutorial. The best part is that the video doesn't have annoying ads.
Thanks 😇
in case anyone gets error in stemming function or the loc news_dataset['content'] = news_dataset['content'].apply(stemming)
use the below code just before stemming function (it worked for me)
news_dataset['content']=news_dataset['content'].apply(str)
lovely good shit man
Really appreciate your explaining each and every line. Thank you for your time.
My pleasure!😇
Another amazing project sir! Loved it.
Thank you so much for all your efforts sir ,it means alot...
if u are facing error in news_dataset['content'] = news_dataset['content'].apply(stemming)
it's likely u did not add .stem at the back of port_stem
so use this stemmed_content = [port_stem.stem(word) for word in stemmed_content if word not in stopwords.words("english")]
thanks for this but unfortunately still there is error how to fix bro please tell
Hi sir while running model.fit(x_train,y_train) output it showing LogisticRegression()
In this way empty can you tell my error
no. it's fine. it's not an error
@@Siddhardhan tq sir
Nicely done! I like the recap at the end of the video. Keep up the good work!
Thank you so much 😇
The error while applying stemming function can be solved by using
news_dataset['content']=news_dataset['content'].apply(str)
this line of code before applying the stemming function.
Can u tell me how long this stemming take ?
Thank you so much sir, For providing us such good videos in the field of Ai & ML. All your videos helped me a lot, my interest in this field has boosted a lot since watching your videos.
Keep posting more videos like this sir you are inspiring us a lot 🥰
My file is showing its too large to display .What can I do now?
Your video is incredible. but is there a means to check the prediction by inserting textual news instead of numerical index?
30:41 The shortcut for the character is ctrl + 6
Best video for Beginners
Hi, Can we able to put the exact news text and find the prediction rather than using index in predective system.if so can you explain?
Nicely explained . Can you please make a video regarding the mathematics behind Logistic regression in details
sir i really love the way u explain each and everything in detail, thank u
hello I am unable to import the data set due to "utf-8".encoding issuePlease provide the exact code for that.Nowhere it is mentioned
same
Thank you very much. Wish you all the best.
Dear, I have a question here.
So how i can predict for a piece of new news, i mean it should be predicted directly without setting the label from ourselves. so how we can predict that.
Hey even i am having same doubt do you got the solution
Hi siddhardhan nice explanation
Pls make some end to end project where you discuss on eda and features engineering, features selection ,remove outlier I want this type of project pls make these type of project
Thank you for this amazing video. I hope to use it for my project defense
You are just superb explaining each and everything properly 🙌🙏 thank you
Thanks and welcome😇
@@Siddhardhan I am doing my masters in Queen mary London and i was just surfing random videos ..even though i knew all the concepts in this i didn't feel like skipping a bit.. truly well explained..i have also sent u the LinkedIn request ! U will have bright future 😄
Thankyou so much Sir. Helped me a lot.
Good job Siddhardhan!
thanks a lot 😇
hello sir. I am getting an error in the line -news_dataset['content'] = news_dataset['content'].apply(stemming).what to do sir??
such good videos. Quality content. Thank you
Great tutorial ! And i have a question while we were making predictive system we have used data from X_test and we said we are using this because the model have not seen these datas.But we also write a code ''X_test_prediction=model.predict(X_test)''.Didn't it see the test datas after we predicted that ? thx.
Your actual test output that you are predicting is in Y_test variable. Basically your X_test_predictions are predicted Y-values. I hope it will help.
Can you tell how can i attach this code to frontend???
Did you get the solution?
thanks for the tutorial. but can you suggest how can we related the model with realtime world. like if I pass any news title to it, it should return the Real/False.
I am having trouble applying the Logistic Regression model on the test data that is provided on the Keggle page along with the training data given. How should I go about doing that??
sir while data preprocessing an error keeps coming in the file downloaded from kaggle,
ParserError: Error tokenizing data. C error: EOF inside string starting at row 1259
what to do?
i have done various online solutions from stackoverflow but they arent helping
Is this problem solved or not
Nltk.download(‘stopword’) not working
Give idea to fix that
Make sure that all the imports are correctly written
Thank u a lot! Sorry show us how we apply part of speech tagging(POS) to hate speech detection using machine learning
thanks soo muxh for this awsm explaination
That stemming function...
No error occurred
But I select wrong ^ symbol
What that cause...
Every English later erased and symbols left....
So whoever going to try this project...
Please print the content after applying stemming. To verify
And thanks siddhardhan for sharing colab file❤
Thank you so much brother. Loved this one. You taught better than my trainer.
Good contents in your channel. Just subscribed to your channel and looking forward for more hands one projects.
thanks a lot
I cannot find the dataset you used this this code, please send me
See in description
Thank you it was really helpful
Bro i have used this dataset and applied random forest everything is perfect accuracy precision but ig its overfited any outside news its displaying as fake ??????
Hi, How can this model be used for real world news..
Yes! You just need to prepare the dataset accordingly, similar to the data we trained. Then train the model. You may need to do some tweaking to the model to increase it's accuracy.
@@Siddhardhan sorry, I didn't get, just an example if I have trained data of last year, how can it predict present news whether fake or real.. if someone posts IPL is postponed today, can this be predicted or confirmed as fake..
Yes. It can. But we cannot say for sure that it can predict correctly. So, what you need to do is, train a Machine Learning model with some real news dataset. Larger the dataset, better is the prediction.
And we can use a better model. Logistic regression is a basic Machine Learning model. Instead, you can use Deep Learning models like Recurrent neural networks (RNN). (RNN is used to train on text data & CNN is used to train on image data). By that way, you can make better prediction.
But the approach is same as I have explained in the video.
hello sir, when I try to download the dataset from link that you provided, it asks me to accept competition's rules before downloding file. What should I do?
Thanx man this really helped me 😭❤️
sir how to generate a confusion matrix for this code?
can we also check the new news which is not present in the data set
yesss you can check it but the result will be based on the dataset which you have trained.
Thanks bro for all these projects
you're most welcome 😇
Amazing Project
thank you so much your video helped me a lot
Glad it helped!😇
Please tell me feasibility study (economic, technical, operational) for This project (fake news prediction)...
Please 🥺🙏 reply as soon as possible
how to download dataset file
If I enter latest news headlines from the internet will it be able to predict real or fake news. My college professor is asking for this feature. Please help me
Really liked the video and that recap part is very nice... I wanted to ask one thing: Can we show a graph of fake and real news? If yes, then please tell me the procedure for it! Thank you !! in advance
thanks a lot 😇 go through this to plot the data in logistic regression: stackoverflow.com/questions/46085762/sklearn-logistic-regression-plotting-probability-curve-graph
How we can modify this project to giving title(text) of the News as input and the model give it is fake or not fake news?
Take another variable and put the news there as string then vectorize 49:25 that to another variable, then again feed it in x_new
hello i wanna ask
which approached is using here?
is it supervised or unsuperrvised? thanks
supervised. this is a classification problem as we are classifying whether the news is real or fake
What is spesific method? Is it k means, random foress, svm or else sir?
while using a different dataset for testing the accuracy i am getting error due to the total no. of words are not same in the train dataset and test data set. Can you provide a way out,please.
Thanks 😊, just question, if we have unlabeled dataset , what we do in this case ? How can we label it ?
pd.read_csv( ' file path ', header= None )
hii @siddhardhan ....how to implement SVM and ANN in this project?
can you please try to make some projects with tensorflow and keras .
as I also want to learn those libraries
hi hope you are fine how much time it will take the excution of only text data and is there any change required to excute only the text data?
@siddhardhan sir I'm getting error in import bumpy as no it s telling syntax error.. Could u please help me out of that....I have tried so much it's not happening
u mean numpy ?
Sir can we convert text to number by using label encoding?????
Sir, when i fit the logistic regression model the error "Unknown label type: 'unknown'" is coming. I stuck at that point. What should i do.
I got an error while uploading data set(parser error).Any solution?please reply🛑🛑🛑🛑
Wait for long time data set would be loading..... Just click run run every 5 mins and check the number if it's increasing
@@plazmaplayz2499 thank u
Great work @Siddhardhan, can you suggest steps to write a simple rule based news recommendation system (Popularity & Content based) with summary metrics (like page views, timestamp etc) from analytics as inputs. Much appreciated!!
Hi! you can follow the same procedure. You just need a dataset that contains details about popularity score.
@@Siddhardhan sure will do that, have learned a lot. Thanks a ton!
Thanks a lot for your positive words 😇
hi i am getting error like "expected string or bytes-like object " in the stemming process can you help me
If we are getting new data/news, we have to convert that data through all the process before inputting in our model??
Thank you for the video !! I have a question that:
1) I f I have a news (New Query) and I want to predict that is fake or real , How can I do with this model?
Regards
Ankur
Take it as input , transform the data to Tf-idf verctorizer using transform function and then proceed with same steps.
can anyone please explain me why merging the author and title column is done when we can directly feed the model without merging them as 2 different features to learn from
How to create a web application for this project
Yes sir I have a same doubts but how
same please help us sir
Actual accuracy parameter is like this "accuracy_score(y_true, y_pred)". But in your video, you are giving the predicted value in y_true secion and true value in y_pred section. Please check on it and let me know if am wrong
Yeah. You are right. I made a mistake there. But the value won't change much...
sir but the thing is how to develop an app for this with frontend application lilke to give the random news for the project review
sir i am facing an issue at data preprocessing, i have uploaded the above given dataset, it is stored as train.csv.zip, but when i execute it i am getting an error like badzipfile .How can i rectify it sir?
Hi! your file may not be fully uploaded
Once you upload the zip file completely, use the below code to extract the file:
# extracting the compessed Dataset
from zipfile import ZipFile
dataset = '/content/train.csv.zip'
with ZipFile(dataset,'r') as zip:
zip.extractall()
when you run this, your zip file will be extracted. it will contain multiple files. in that, use train.csv alone for the project.
@@Siddhardhan okay, thank you
Hi! I got error when read trai.csv . "ParserError: Error tokenizing data. C error: EOF inside string starting at row 5976". Can you help ?
hi! use this line to read the CSV file:
news_dataset=pd.read_csv('train.csv', engine='python', encoding='utf-8', error_bad_lines=False)
What are the modules are there in this project.?
why is the text column of the dataset not taken??
PC to an external monitor and I'm watcNice tutorialng tNice tutorials video on one screen and working through exactly what Michael is doing and it is a big
great. But include data visualization of dataset in the project.
Where is the title column?why we didn’t feed it into model ?
my accuracy score on test data is coming out to be 0.53 which is a very poor value. I copied the same code what could be the reason behing this please explain?
hey, if i make any changes in the dataset that you are providing then program will not work . why ?
please tell me the solution guys my teacher are asking me to check the latest news these days and if i edit the dataset with the latest news the program will not work , please help me out tomorrow is my final practical
stemming is not working poperly. it removed almost all words. why?
i got many issues like in uploading file, in train_test_split etc but it suddenly worked . i tried to run all many times , but suddenly it got worked. i don't know what to say or how it will help
Hello sir I uploaded the data set which u have provided above .....my doubt is when I uploaded it's saved as train.csv.zip file rather than train.csv ....can I complete this project ....or any error occurs
hi! you can extract the file using the following code:
# extracting the compessed Dataset
from zipfile import ZipFile
dataset = '/content/train.csv.zip'
with ZipFile(dataset,'r') as zip:
zip.extractall()
when you run this, your zip file will be extracted. it will contain multiple files. in that, use train.csv alone for the project.
Thanks a lot sir🌸
I'm getting error after neww_dataset['content'] =news_dataset['content'].apply(stemming)
hi! mail the details to datascience2323@gmail.com
@@Siddhardhan ok sir 1min
X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size = 0.2, stratify=Y, random_state=2)
if any error arises in this section kindly once all run simultaneously then no error will arises.
Run entire codes?
@@aslah4188 yes
Still not working I'm stuck
Like str and int type conflict that's the error thrown,
NameError: name 'train_test_split' is not defined
this error is shown and after removing this error,
TypeError: '