Principal Component Analysis (PCA): With Practical Example in Minitab

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 พ.ย. 2024

ความคิดเห็น • 116

  • @alsteiner7602
    @alsteiner7602 4 ปีที่แล้ว +5

    This is clear, concise, and presented well and in a logical sequence. OUTSTANDING!

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว +1

      Thank you so much for your valuable comments and appreciation 🙏

  • @thaynaalmeida7055
    @thaynaalmeida7055 ปีที่แล้ว +1

    Thank you for this simple and objective explanation!

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      You're welcome!
      Thank you for your valuable comments and appreciation! 🙏☺️

  • @kausalyaakannan7064
    @kausalyaakannan7064 3 ปีที่แล้ว +15

    Universities shall pay half of the tuition fees to youtubers for delivering contents with such an awesome explanation😁 Thank you so much sir for this video. How to know sir whether we have to standardize the data based on the output?

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation!
      Subject matter expertise is required in that case. If you don't have it, then need to consult with people with related expertise.

  • @jensonrozario
    @jensonrozario 3 ปีที่แล้ว +3

    Super informative video. I was looking all over the internet, finally... You did it...

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation 😊🙏

  • @patrunikiran
    @patrunikiran 2 ปีที่แล้ว +1

    Thank you, indeed good example you have taken for explanation. I am a new learner for PCA

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      That's great! Thank you so much for your valuable comments and appreciation ☺🙏

  • @mv829
    @mv829 3 ปีที่แล้ว +2

    The best explanation on TH-cam so far, thank you!!

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation ☺🙏

  • @jayrajjavheri8740
    @jayrajjavheri8740 2 ปีที่แล้ว +1

    Fantastic!!.. speechless keep it up! you are serving the people.. god bless you.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Comments like this make my day☺🙏
      Thank you so much for your valuable comments and appreciation 🙏☺

  • @jdo9102
    @jdo9102 4 ปีที่แล้ว +2

    Vijay, thanks for your invaluable videos. I am a green belt certified now. I am looking forward for more tutorials from you up to the Black Belt .Level. Bless you

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Thank you so much for your valuable comments.

  • @deepakmoda3401
    @deepakmoda3401 3 หลายเดือนก่อน +1

    Superb way of teaching, Sir!

    • @learnandapply
      @learnandapply  3 หลายเดือนก่อน

      Thank you for your valuable comments and appreciation! 🙏☺️

  • @terryliu3635
    @terryliu3635 2 ปีที่แล้ว +1

    Wow! One of the best explanations on PCA!!

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation ☺🙏

  • @marlonmojica7473
    @marlonmojica7473 2 ปีที่แล้ว +1

    The presentation is well explained. Very helpful to all students and instructors.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Thank you for your valuable comments and appreciation 🙏😊

  • @FaizalKasim_UNG
    @FaizalKasim_UNG ปีที่แล้ว +1

    Thank you. Your tut is excellent, with clear in steps but concise

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      Thank you for your valuable comments and appreciation! 🙏☺️

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      Thank you for your valuable comments and appreciation! 🙏☺️

  • @ramvemula4336
    @ramvemula4336 2 ปีที่แล้ว +1

    Excellent explanation. Thank You very much.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation ☺🙏

  • @marciabelldbampaha5149
    @marciabelldbampaha5149 3 ปีที่แล้ว +3

    Good presentation and the baby music is too cute.

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation 😊🙏

  • @mdmahmudulhasanmiddya9632
    @mdmahmudulhasanmiddya9632 2 ปีที่แล้ว +2

    Outstanding performance sir.Ur teaching is adorable sir. Don't say please like. U deserve more than like or subscribe.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว +1

      Comments like this make my day 🙏🙏☺
      Thank you so much for your valuable comments and appreciation 🙏☺

  • @ubhalerao
    @ubhalerao 3 ปีที่แล้ว +1

    Very useful video.
    My doubts have got cleared.

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      That's great! Thank you so much for your valuable comments and appreciation!

  • @wangjessica1275
    @wangjessica1275 8 หลายเดือนก่อน +2

    How do you explain PC3 ? The third component has large negative associations with income, education, and credit cards, so this component primarily measures the applicant's academic and income qualifications

    • @wangjessica1275
      @wangjessica1275 8 หลายเดือนก่อน

      Does it mean increasing income, education and credit card will decrease PC3?

    • @learnandapply
      @learnandapply  8 หลายเดือนก่อน

      Please look at the contribution of income, education and credit card - it's lower 13%. Need to focus on 1st two components as they are the major contributors.

    • @qsdqdqd123
      @qsdqdqd123 7 หลายเดือนก่อน +1

      @@learnandapplyso it means that we need to drop the two original variables (income and education)? you said in the video that sometimes we need more than 90% of the variance explained = 4 principal components. But in the end we only have 2 principal components to analyze the loan applications? I’m quite confused…

    • @learnandapply
      @learnandapply  7 หลายเดือนก่อน

      It's like a Pareto principle. How much data do you want to consider for taking action?

  • @zeeshanazam5104
    @zeeshanazam5104 9 หลายเดือนก่อน +1

    very informative, really apperciated

    • @learnandapply
      @learnandapply  9 หลายเดือนก่อน

      Glad it was helpful!
      Thank you for your valuable comments and appreciation. 😊🙏
      You can also learn it in detail with my mentoring support at vijaysabale.co/multivariate

  • @ammabadi747
    @ammabadi747 4 ปีที่แล้ว +1

    Very nice explanation. Thank you very much sir

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation!

  • @harasaragajadeera7940
    @harasaragajadeera7940 3 ปีที่แล้ว +1

    Very clear and informative. Keep up the good work !!!

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation 😊🙏

  • @priyamishra9893
    @priyamishra9893 2 ปีที่แล้ว +1

    Very informative thank u sir

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Most welcome! Thank you so much for your valuable comments and appreciation 🙏😊

  • @manzoorahmad-mu3xv
    @manzoorahmad-mu3xv 2 ปีที่แล้ว +1

    Fantastic Fantastic

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation ☺️🙏

  • @yamikanikaliwo2084
    @yamikanikaliwo2084 ปีที่แล้ว +1

    well explained big up buddy

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation! 🙏☺️

  • @conanannisa1811
    @conanannisa1811 4 ปีที่แล้ว

    Terimakasih atas penjelasannya, sangat membantu

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Thank you so much for your valuable comments.

  • @capecoaster69
    @capecoaster69 3 ปีที่แล้ว +1

    good explanation !

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation!

  • @apekshatiwari9290
    @apekshatiwari9290 3 ปีที่แล้ว +4

    Great presentation. Thank you! So we know PC1 is positively correlated with 4 variables and PC2 is negativley correlated with 2 variables. What next? What do we do with this information?

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Thank you for your valuable comments and appreciation 🙏😊
      Please use this information (PC1 and PC2) to the group variables as per their similarities and you can name them as a meaningful criterion to take a decision.
      I have explained it in very detail in the video. I will request you to please revisit to understand it in more detail. 😊

  • @sreelaxmib8941
    @sreelaxmib8941 ปีที่แล้ว +1

    Hi, great video, this is extremely helpful. I had a few doubts,
    1. In my data set when I do the same, I do not get eigen vector values close to the proportion value. What does that mean?
    2. I have another data set with 96 variables. Can i use this method in minitab for this high number of variables?
    3. You had said 4 principal components have been chosen, what do you do with the rest of the 3 principal components chosen?
    Thank you in advance.

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      Thank you for your valuable comments and appreciation! 🙏☺️
      1. Eigenvalues and proportion are different things, but both are indicators of the contribution of the respective Principal Component. One explains the value, whereas another explains the percentage.
      2. Of course, please try it.
      3. We are selecting the most contributing Principal Components like Pareto.

  • @saynaislamdibasaynaislamdi8875
    @saynaislamdibasaynaislamdi8875 ปีที่แล้ว +1

    Thank you

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      You're welcome!
      Thank you for your valuable comments and appreciation. 🙏😊

  • @navadeepkalita456
    @navadeepkalita456 3 ปีที่แล้ว +2

    Hello. I joined as a member today. Kindly let me know how do we interpret PC2 and PC3 results

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Hi Navadeep, thank you for being a part of the community.
      The principal components mean a category of variables that we are grouping by their similarities.
      PC2 and PC3 are the 2nd and 3rd groups of variables.

  • @shafiqulislam2663
    @shafiqulislam2663 4 ปีที่แล้ว +1

    I don't know how many thanks should I give you. From last 10 days I have seen more than 15 videos and read many papers.but none of them was easy to understand.thx,thx,thx. Is there any free version of minitab sir? And when to perform PCA and when PCoA sir?

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Thank you so much for your valuable comments and appreciation!
      PCA is used when your are having too many variables and you want to group them logically for easy interpretation.

  • @Shabbir2749
    @Shabbir2749 4 ปีที่แล้ว +1

    Nice work bro. Please make video on GLM

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว +1

      Thank you so much for your valuable comments.
      Sure, I will do it in future videos.

    • @Shabbir2749
      @Shabbir2749 4 ปีที่แล้ว

      @@learnandapply thanks all the best

  • @NicholeRojas-r8i
    @NicholeRojas-r8i 2 ปีที่แล้ว +1

    Hello! what criteria do you use to eliminate outliers?

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      This is based on Mahalanobis distance. The Mahalanobis distance measures the distance from each point in multivariate space to the overall mean or centroid, utilizing the covariance structure of the data.

  • @omerfarukunal110
    @omerfarukunal110 3 ปีที่แล้ว +1

    Great Presentation, I have a question. My matrix has 162*2076 dimensions. Can I analyze this matric in minitab? How can I do ?

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว +1

      Thank you for your valuable comments and appreciation ☺🙏
      Please use Factor Analysis for this analysis. Use the path: Minitab-Stat-Multivariate

    • @omerfarukunal110
      @omerfarukunal110 3 ปีที่แล้ว

      @@learnandapply Thank you, matrix is the BOM List ( Products * Materials). So, I'm not sure to use factor analysis. Actually, I want to do k-means but you know see again dimension error :(

  • @nathalieramos5942
    @nathalieramos5942 3 ปีที่แล้ว +1

    Thank you very much I am from Peru and it helped me a lot. I just have one question, how can I assign weights to my variables. Can the highest results for component one be the weights for my financial stability indicators? and if so, what would I do with the values that come out with a negative sign? Thank you so much for everything.

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว +1

      Thank you so much for your valuable comments and appreciation 🙏☺.
      Yes, absolutely. The components having higher eigen values need to be select first. The negative sign indicates that it is impacting adversely.

    • @nathalieramos5942
      @nathalieramos5942 3 ปีที่แล้ว +1

      @@learnandapply THANKS FOR YOUR HELP :)

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      You're welcome and thank you for your valuable comments ☺🙏

  • @localguy123
    @localguy123 3 ปีที่แล้ว +1

    I have a large Dataset consisting of two variables, Voltage and Time. Can I do PCA on it?

    • @localguy123
      @localguy123 3 ปีที่แล้ว +1

      And can we do curvilinear component analysis?

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      For 2 variables with large data set, you won't need to go for PCA. Just use a regression model. If you want it in more detail, please use nonlinear regression.

  • @mdmahmudulhasanmiddya9632
    @mdmahmudulhasanmiddya9632 2 ปีที่แล้ว +1

    Sir in .41mint in this vedio shows the correlation between original variable and PCA component.or it is different thing.please reply sir.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      We are grouping variables as per their similarities for easy understanding and interpretation of results. This grouping is called as Principal Components.

    • @mdmahmudulhasanmiddya9632
      @mdmahmudulhasanmiddya9632 2 ปีที่แล้ว +1

      @@learnandapply this is Eigen vectors

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Yes, this is weighted and grouped based on eigenvalues 👍

    • @mdmahmudulhasanmiddya9632
      @mdmahmudulhasanmiddya9632 2 ปีที่แล้ว +1

      @@learnandapply thank u sir

  • @uzmafatima2588
    @uzmafatima2588 3 ปีที่แล้ว +1

    Sir I have indoor air pollution data of 9 pollutants, and questionnaire data of households(socioeconomics, house features and product,health conditions )....how can I use this for my data ....Kindly please guide.

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Hi Uzma, you can use both the options Factor Analysis or Principal Components Analysis in this case.
      If you have some response y, on that you want to see the impact of all these 9 pollutants, then please use regression analysis in that case.

  • @FractalswithAbhi
    @FractalswithAbhi 3 ปีที่แล้ว +1

    How can I export the output graphs from Minitab?

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Just right-click on the graph and export it to Word or PowerPoint.

  • @yagusti_n
    @yagusti_n 2 ปีที่แล้ว +1

    Sir....how to analyze principal components manually, and how to get eigenvector values ​​by manual calculation

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว +1

      That's a great question. We can calculate them by using formulae for eigenvectors and eigenvalues. I think I should create a video on this topic.

    • @yagusti_n
      @yagusti_n 2 ปีที่แล้ว +1

      @@learnandapply I will wait for the video sir. thanks very much.

    • @yagusti_n
      @yagusti_n 2 ปีที่แล้ว +1

      @@learnandapply sir.... make a video of the manual calculation of the principal component analysis (eigenvector and eigenvalue) using the data in this video.
      Thank You so much Sir...

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      I think this video can help you to know how it is coming: th-cam.com/video/XvV7AwLF8v4/w-d-xo.html

  • @cyrilsantos3610
    @cyrilsantos3610 3 ปีที่แล้ว +1

    Sir, how did you know what each component measures? You said that age, residence, employ, and savings have large positive loadings on component 1 so this component measures long term financial stability. How did you arrive at long financial stability? Thanks :-)

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      In Principal components, we need to look at the variables having higher eigenvector values. So, if you look at the first principal component, the variables you had mentioned are having higher eigenvectors.
      Now, how to name them? Well, you must be a subject matter expert in that field. If not, you need to take a help from subject matter experts 😊

    • @cyrilsantos3610
      @cyrilsantos3610 3 ปีที่แล้ว +1

      @@learnandapply Many thanks Sir :-)

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว

      Your welcome 😊

    • @andpinto1
      @andpinto1 ปีที่แล้ว

      Your answer is sound, because eigenvectors only tell you how relevant the variables in the overall variance. So what you do is checking which have the highest eigenvectors and go check independently what they correlate to. This depends on your expertise. Here, the Manager would see he/she would have to look into residence, employment, age and savings. That´s as far as PCA goes. It also tells you which samples are more similar, ie cluster together.

  • @sonamchavan9346
    @sonamchavan9346 8 หลายเดือนก่อน

    Can you please explain the MNIST Handwritten Digits with PCA

    • @learnandapply
      @learnandapply  8 หลายเดือนก่อน

      Can you please elaborate on your question? Thank you.

  • @emiliooo2877
    @emiliooo2877 2 ปีที่แล้ว +1

    te amo

  • @sankar_riser
    @sankar_riser 4 ปีที่แล้ว +1

    Sir I've a doubt can I use replicated data for one variable

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Thank you for your valuable comments.
      Try not to use replicated data. It will create an error.

  • @nabilanursafha
    @nabilanursafha 4 ปีที่แล้ว +1

    Im still confused how did u know the variabel correlate with the principal component? It bases on proportion? So the nearst variable to propotion is correlated?

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว

      Please check for the highest values of the eigenvectors.

  • @pravakirandash9758
    @pravakirandash9758 3 ปีที่แล้ว +1

    Thanks for the video,Sir..How can I install minitab software? Is it free.

    • @learnandapply
      @learnandapply  3 ปีที่แล้ว +1

      Please check my video on statistical software to get all the details. This is a 30-Days FREE trial.
      Anything else that I can help with?

    • @pravakirandash9758
      @pravakirandash9758 3 ปีที่แล้ว +1

      @@learnandapply I will check, Sir.. Thanks a lot for your help.

  • @razheer100
    @razheer100 4 ปีที่แล้ว

    My issue is that I downloaded minitab express per my Universities free trial. Yet under stats, no multivariate option is available to do a PCA. Any suggestions?

    • @learnandapply
      @learnandapply  4 ปีที่แล้ว +1

      Please try reinstalling it, otherwise, this software is with some fewer options.

  • @ytubeleo
    @ytubeleo 2 ปีที่แล้ว

    The same Christmas song again on repeat?! Other than this it was very good.

    • @learnandapply
      @learnandapply  2 ปีที่แล้ว

      Thank you for your valuable comments. This is a video uploaded a year before. Sorry for the inconvenience caused to you. 🙏🙏

  • @DevanshiHingrajiya
    @DevanshiHingrajiya ปีที่แล้ว

    can you please share the data

    • @learnandapply
      @learnandapply  ปีที่แล้ว

      Thank you for your interest in learning this important topic and your valuable comments.
      For in-detail learning of this topic with data, notes, videos, and my mentoring support, please visit -
      vijaysabale.co/multivariate

  • @recepgunyuz3121
    @recepgunyuz3121 4 ปีที่แล้ว +1

    where is the scores ?

    • @learnandapply
      @learnandapply  8 หลายเดือนก่อน

      These are eigenvector values. You can get it from PCA output table.