Understanding and Applying Factor Analysis in R

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 ม.ค. 2025

ความคิดเห็น • 36

  • @alonsomadronal2919
    @alonsomadronal2919 3 ปีที่แล้ว +3

    Excelent video! Keep up with the good work, you are helping a lot of distressed students!

  • @aditiawasthi4900
    @aditiawasthi4900 3 ปีที่แล้ว +1

    Very nicely explained

  • @recordyao
    @recordyao 3 ปีที่แล้ว +1

    Good work!

  • @0banana751
    @0banana751 2 ปีที่แล้ว +1

    목소리가 너무 매력적이에요! 우와

  • @pakwidi531
    @pakwidi531 ปีที่แล้ว +1

    Ah thank you so much sir 🙏

  • @shaimaadoma
    @shaimaadoma 3 ปีที่แล้ว +1

    Thank you, that was very helpful! ^^

  • @gorraf70
    @gorraf70 2 ปีที่แล้ว +1

    thanks so much...I'm working on my thesis and this was extremely helpful... 🙃

  • @VISHESHKULSHRESHTHA3
    @VISHESHKULSHRESHTHA3 3 ปีที่แล้ว +1

    how to use output of factor analysis output/factors in clustering or other ML algo like classification?

    • @SpencerPaoHere
      @SpencerPaoHere  3 ปีที่แล้ว

      Great question!
      Until I do a topic on Factor Analysis Regression (FCR), you can take a look at this video: th-cam.com/video/H45NWCzIDkY/w-d-xo.html
      (I go over the logic on how to use the transforms for other modeling procedures -- its the same process)

  • @nataliaesquenazi7280
    @nataliaesquenazi7280 7 หลายเดือนก่อน

    After factor analysis, how we can create an index or indicator?

  • @FleurChelsea
    @FleurChelsea 2 ปีที่แล้ว

    Thank you for this video! Do you have any suggestions on how to handle missing data in factor analysis?

    • @SpencerPaoHere
      @SpencerPaoHere  2 ปีที่แล้ว

      I have just the video for you :) (data imputations)
      th-cam.com/video/MpnxwNXGV-E/w-d-xo.html

  • @TheBlomb.
    @TheBlomb. 2 ปีที่แล้ว +1

    How do you know the # of factors that you look for? Is this based from your research context, or can you read the is from the date by experimenting with more or less factors? & what if you have contrasting factor loadings, can you invert these? (I.E.: I have question on how challenging something is, and after how easy it is which is in contrast)

    • @SpencerPaoHere
      @SpencerPaoHere  2 ปีที่แล้ว +1

      I arbitrarily stated that the number of factors be 3 in this video. However, a more "scientific" way of determining number of factors is either 1) using a scree plot (elbow method) or you can check out this paper here: ruscio.pages.tcnj.edu/files/2016/08/Ruscio-Roche-2012-PA-Factor-Analysis.pdf on a more mathy way of finding out number of factors to optimize for. (factor loadings)
      You can also attempt to gridsearch for the number of factors (brute force based on model results)
      I'm not sure what you mean by contrasting loading factors?

    • @TheBlomb.
      @TheBlomb. 2 ปีที่แล้ว

      ​@@SpencerPaoHere Thank you for your response and the source. By contrasting factors I meant, variables, and I have solved the problem by inverting them. It should be sorted.
      With regards to the factor loading, after some reading, I used a parallel analysis to determine I need two factors. The new problem now is that the FA model does not satisfy RSMEA 0.9. I only achieve the satisfaction of those two requirements when I use 4 factors. What would be the best to listen to, the parallel analysis or adding/removing factors until the FA model makes sense w.r.t. RSMEA & Tucker's.
      Thanks again for your input, very much appreciated.

    • @SpencerPaoHere
      @SpencerPaoHere  2 ปีที่แล้ว +1

      ​@@TheBlomb. Haha yeah. Regarding which "better" method to use, I'd defer to the testing data. If there is a better "score" of some sorts, I'd more or less default to that specific method.
      However, if you want to avoid the black box algorithm output, and assuming they both have similar results, I'd go with the method that has the most explanatory power.
      Could you do some additional analysis and try and determine what the factors might represent your original features and determine if the representations make sense?

  • @Vhizooo
    @Vhizooo 4 หลายเดือนก่อน

    Hi, I'm having trouble using the data set in the link. It downloads as a .arff file and I've tried using both foreign and Weka packages to import the data with no success

    • @Vhizooo
      @Vhizooo 4 หลายเดือนก่อน

      Fixed this. Have to manually change the data types in the .arff file to Numeric from integer

  • @intentionallivingph7279
    @intentionallivingph7279 3 ปีที่แล้ว

    Do you have a video explaining Tucker’s? Thanks

    • @SpencerPaoHere
      @SpencerPaoHere  3 ปีที่แล้ว

      Hmm I do not. I am not even sure what Tucker's is haha. Sounds very niche.

  • @jaelee6093
    @jaelee6093 3 ปีที่แล้ว +1

    Why would you consider absolute BIC values only? Whether the values are positive or negative, you should consider the model that gives you smaller BIC: see en.wikipedia.org/wiki/Bayesian_information_criterion. Please let me know if I misunderstood anything and thanks in advance for your input. And thanks for the great tutorial.

    • @SpencerPaoHere
      @SpencerPaoHere  3 ปีที่แล้ว +1

      Great question! Sometimes, you have negative BIC values. You'd want to take the absolute value of these values to have an 'objective' valuation of this model with other models that you have. If you have a -inf BIC value.. this is as 'bad' as a +inf value.

  • @ramziraspaty6854
    @ramziraspaty6854 2 ปีที่แล้ว

    I have a question. So, you arbitrarily stated the number of factors you used in this video and you said that a more 'scientific' way of determining numbers of factors could be done by using scree plot (elbow method). You know that some data don't have a good scree plot diagram, right? If that's the case, can I use like the Eigenvalue thing? Thank you so much!

    • @SpencerPaoHere
      @SpencerPaoHere  2 ปีที่แล้ว

      Yes! You can use just the eigenvalues to determine if you want to keep X features. If the eigenvalue < 1 can be eliminated. However, this approach is more appropriate for smaller models and can restrict larger models.

    • @ramziraspaty6854
      @ramziraspaty6854 2 ปีที่แล้ว

      @@SpencerPaoHere Ah now I get it. THANK YOU SO MUCH! My lecturer had given me this material but I didn't get a single thing she said T_T

  • @tomthomas9138
    @tomthomas9138 ปีที่แล้ว

    while applying the pa and ml method the values for the h2 and u2 is not showing and also i have 10 variables but for 3 factors the cumulative variance is 0.43. so what to do?

    • @SpencerPaoHere
      @SpencerPaoHere  ปีที่แล้ว

      What is the variance explained with you add up the 4th factor? Also, you might have NaN values in your dataset which may have led to not having any h2 and u2 values? (speculating)

  • @Mayorex_21
    @Mayorex_21 2 ปีที่แล้ว

    My analysis for
    factor analysis is returning NA- not available

  • @sumbomarcus
    @sumbomarcus 3 ปีที่แล้ว +5

    His voice

  • @gaoelnlaojehc8913
    @gaoelnlaojehc8913 3 ปีที่แล้ว

    where did u learn bro?

    • @SpencerPaoHere
      @SpencerPaoHere  3 ปีที่แล้ว +2

      The internet :P

    • @gaoelnlaojehc8913
      @gaoelnlaojehc8913 3 ปีที่แล้ว

      @@SpencerPaoHere Any specific sites or blogs sir? BTW I like ur vids man... pls continue

    • @SpencerPaoHere
      @SpencerPaoHere  3 ปีที่แล้ว +2

      @@gaoelnlaojehc8913 I appreciate your kind words. :) I usually start off at .edu websites. Stanford has a lot of great theory related to ML. Medium articles have application oriented scripts that can hammer home understanding. And, of course there is stack overflow.

    • @gaoelnlaojehc8913
      @gaoelnlaojehc8913 3 ปีที่แล้ว

      @@SpencerPaoHere Thanks man. Appreciate it

  • @unpopularmoviereviews3622
    @unpopularmoviereviews3622 ปีที่แล้ว

    Half of me is thinking "You're just making this all up" haha