Multiple correspondence analysis on an incomplete dataset

แชร์
ฝัง
  • เผยแพร่เมื่อ 2 ธ.ค. 2024

ความคิดเห็น • 10

  • @christiaanpauw
    @christiaanpauw 11 ปีที่แล้ว +1

    Thanks, very informative. This is a great acompanyment to the book

  • @HussonFrancois
    @HussonFrancois  11 ปีที่แล้ว

    The package softImpute is rather for continuous variables I think. With continuous variables the first simulations we have done show better results for imputePCA.

  • @HussonFrancois
    @HussonFrancois  11 ปีที่แล้ว

    I use ncp=2 because the function estimencp_MCA returned a number of dimensions that it too large. So I know that with 0 dimension missing values are imputed with the "mean" of the category, so with 2 dimensions I have more information. Perhaps more information can be used, ie more dimensions, but it is better to impute with less dimensions than with too many dimensions (in this latter case, you add noise in your data).

  • @Hyakuman27
    @Hyakuman27 10 ปีที่แล้ว

    Is it possible to impute values and also us supplementary variables? I can't seem to figure out how to do this....

    • @HussonFrancois
      @HussonFrancois  10 ปีที่แล้ว

      It is possible to impute the data set with the imputeMCA function on the overall dataset (considering all the variables as active) and then to perform the MCA on the completed data set (with the object completeObs) using the supplementary variables.

  • @inmaalvarez5172
    @inmaalvarez5172 9 ปีที่แล้ว

    That's really useful, thanks. But when I run my data in the step
    nb

    • @HussonFrancois
      @HussonFrancois  9 ปีที่แล้ว

      +Inma Alvarez
      It is difficult to help you because I have never seen this error.
      All your variables are categorical and you have missing values?
      Best
      FH

    • @inmaalvarez5172
      @inmaalvarez5172 9 ปีที่แล้ว

      +François Husson
      Yes, all the variables are factors or booleans and all of them have missing values. Thank you very much

    • @inmaalvarez5172
      @inmaalvarez5172 9 ปีที่แล้ว

      +Inma Alvarez
      I'm sorry Dr. Husson I tried again and now the error is this
      Error in apply(tabdisj[, (vec[i] + 1):vec[i + 1]], 1, which.max) :
      dim(X) must have a positive length
      Thank you very much for your time

    • @MyLab87
      @MyLab87 5 ปีที่แล้ว

      @@inmaalvarez5172 Have you fixed the problem/error? I come across with the same error. Anyone who can help me please?