Logistic regression : the basics - simply explained

แชร์
ฝัง
  • เผยแพร่เมื่อ 23 ม.ค. 2025

ความคิดเห็น • 59

  • @ivicaacivi6579
    @ivicaacivi6579 3 ปีที่แล้ว +13

    Dear Editor of the TileStats series, sincere thanks for this video! I have read and heard many different explanations of the logistic regression model, but never really understood the intuition behind it. This is greatly done, I finally understood the sense of the model. I look forward to see other videos of yours.

    • @tilestats
      @tilestats  3 ปีที่แล้ว +1

      Thank you!

  • @stellazhou2834
    @stellazhou2834 2 ปีที่แล้ว +2

    Your illustration is easy to understand and also cover all the important point! And the subtitle is extremely helpful for a non native English speakers like myself.

  • @nikeforo2612
    @nikeforo2612 3 ปีที่แล้ว +6

    OMG you made it so enjoyable and easy to follow. I re-learned in few minutes what it took me hours over hours to understand reading relevant literature. Thanks a lot

  • @SnoopTomm
    @SnoopTomm 2 ปีที่แล้ว +1

    This channel is pure gold. Very clear explanation, thank you.

    • @tilestats
      @tilestats  2 ปีที่แล้ว +2

      Thank you!

  • @sisibocitytv
    @sisibocitytv 9 หลายเดือนก่อน +1

    The best explanation ever encountered.

  • @ratnakarbachu2954
    @ratnakarbachu2954 3 ปีที่แล้ว +3

    Great job and really awesome videos.
    We owe you and god bless to u and ur's family.

    • @tilestats
      @tilestats  3 ปีที่แล้ว +1

      Thank you!

  • @gr8potatosaurusofthunderfart
    @gr8potatosaurusofthunderfart 6 หลายเดือนก่อน +1

    does all scenarios probability form the sigmoid curve when plotted ?

  • @FarizDarari
    @FarizDarari 3 ปีที่แล้ว

    Many many thanks for this wonderful video with clear explanation!

    • @tilestats
      @tilestats  3 ปีที่แล้ว +1

      Thank you!

  • @helenadesoba8894
    @helenadesoba8894 2 ปีที่แล้ว

    You did a great job with your explanation. Thanks a lot.

  • @roopaperuri6721
    @roopaperuri6721 3 ปีที่แล้ว +1

    Very clearly explained... Thank you 🥰

  • @basbees
    @basbees 2 ปีที่แล้ว

    Amazing and very simple to understand, thanks for this great video :)

    • @tilestats
      @tilestats  2 ปีที่แล้ว +1

      Thank you!

  • @giovannibrufani3603
    @giovannibrufani3603 6 หลายเดือนก่อน

    Nice video. I have a question about using logistic regression with low prevalence (23:25): does NPV decrease, due to so many false negatives? However, in the example of the video dedicated to PPV and NPV, false negatives decrease and false positives increase with low prevalence

    • @tilestats
      @tilestats  6 หลายเดือนก่อน

      Do you mean low prevalence in the sample or in the population? With a low prevalence in the sample, you can adjust for this by changing the cutoff value.

    • @giovannibrufani3603
      @giovannibrufani3603 6 หลายเดือนก่อน

      @@tilestats ok, i'm agree. Is PPV calcolated considering prevalence in population? or in the sample? In last case, should I take into account the prevalence of the population when i'm sampling?

    • @giovannibrufani3603
      @giovannibrufani3603 6 หลายเดือนก่อน

      @@tilestats Please, tell me if I'm right: even considering a low prevalence in the population, I take a sample with a prevalence of 50% and I set the cutoff value that maximizes accuracy. Finally, I calculate PPV considering the prevalence in the population.

    • @tilestats
      @tilestats  6 หลายเดือนก่อน

      @giovannibrufani3603 yes, sounds right to me. I would also try to calculate the accuracy based on a test data set that I explain in the video about validation.

    • @giovannibrufani3603
      @giovannibrufani3603 6 หลายเดือนก่อน

      @@tilestats Sure. I'm not missing any videos in the playlist. Thank you very much for your work and for clarifyng my doubt!!!

  • @izb1275
    @izb1275 9 หลายเดือนก่อน +1

    Amazing video and explanation

  • @matilda7570
    @matilda7570 หลายเดือนก่อน

    where did you get the b0 and b1?,

  • @গোলামমোস্তফা-শ৮থ
    @গোলামমোস্তফা-শ৮থ 7 หลายเดือนก่อน

    How can we estimate the parameters of this model?
    Can we just use ols method by using the linear model (b+b1.x)? Which is used as power of "e" here?

    • @tilestats
      @tilestats  7 หลายเดือนก่อน

      No, have a look at this video:
      th-cam.com/video/J0yuLu3oLuU/w-d-xo.html

  • @koustubhmuktibodh4901
    @koustubhmuktibodh4901 6 หลายเดือนก่อน

    Sir, how much of statistics is required for the business analytics program?

  • @raghuveerbongu
    @raghuveerbongu 3 ปีที่แล้ว +1

    Great videos can I have the slides to refer with the transcript

    • @tilestats
      @tilestats  3 ปีที่แล้ว +2

      I'm planning to put the lectures as pdfs on my homepage after the summer.

  • @AhmedShaaban1
    @AhmedShaaban1 ปีที่แล้ว

    Thanks a lot for the videos ... very helpful. Wondering if the data used in this video is available to download to replicate the analysis being done? Thanks

    • @tilestats
      @tilestats  ปีที่แล้ว

      The data is the one you see in the video.

    • @AhmedShaaban1
      @AhmedShaaban1 ปีที่แล้ว

      Thanks, I guess I can use the data presented in the tables (middle of the video)@@tilestats

  • @WzRDxDiamond
    @WzRDxDiamond ปีที่แล้ว +1

    Thanks a lot for this great video!
    I understand how we get from probability to odds to log-odds. However, I don't understand what the purpose of this is. In maximum likelihood estimation, we adapt b1 so that the log-likelihood is maximized. But this process does not seem to depend on log-odds, right? Is log-odds only necessary for better intepretation of b0 and b1?

    • @tilestats
      @tilestats  ปีที่แล้ว +1

      You actually fit a linear model to the data, which explains why the response variable must be expressed as logged odds. See for example this page:
      arunaddagatla.medium.com/maximum-likelihood-estimation-in-logistic-regression-f86ff1627b67

    • @WzRDxDiamond
      @WzRDxDiamond ปีที่แล้ว

      @@tilestats Thanks a lot, I really appreciate your response! I have read the article and other articles from the author. However, I don't understand why it is necessary to fit a linear model to the data?

    • @tilestats
      @tilestats  ปีที่แล้ว +1

      You can fit a nonlinear model to the data, the sigmoid function in this case, but then you have to use nonlinear regression which is not that easy to work with. It is, for example, hard to find the global minimum of the error function for large nonlinear models. I'm actually working on a video about nonlinear regression.

    • @WzRDxDiamond
      @WzRDxDiamond ปีที่แล้ว

      @@tilestats Thank you, looking forward!

  • @Ruichen8104
    @Ruichen8104 2 ปีที่แล้ว +1

    super fucking clear explanation, I am so glad i learned knowledge from you sir, thank you

  • @mikeszymczuk8623
    @mikeszymczuk8623 2 ปีที่แล้ว

    How do you determine the quality of the fitted curve ?

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      Not sure what you mean with quality but maybe this video might help
      th-cam.com/video/J0yuLu3oLuU/w-d-xo.html

  • @pablop.7635
    @pablop.7635 2 ปีที่แล้ว

    How can this apply to qualitative variables. For instance Im reading an article on how social determinants can affect the probability of an adolescent girl being pregnant, but I don't really get how this can be interpreted. There is for example a determinant called "Age of onset of sexual relations" and there is an "estimate value" that is negative 0. And other values are positive and so on. I don't get it. Help

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      Let's say that we have a variable gender (men and women). If women are set as baseline (coded as zero), men are coded as one, then the estimated parameter say how much larger, or less, the value of the parameter is for the men compared to the women. If that value is positive, the OR is greater than one. If that value is negative, the OR is less than one (see 18:18 for how to calculate and interpret the OR).

  • @nishanttailor4786
    @nishanttailor4786 2 ปีที่แล้ว

    Thank for the amazing video!!

  • @HRVS_DZ
    @HRVS_DZ ปีที่แล้ว

    How did you get -5.75 and 2.75 ?
    I used the least square formula and I got -0.34 and 0.39 !

    • @tilestats
      @tilestats  ปีที่แล้ว

      You should use the maximum likelihood method.
      th-cam.com/video/J0yuLu3oLuU/w-d-xo.html

  • @twingsiacor8285
    @twingsiacor8285 2 ปีที่แล้ว

    What statistical software ate u referring to?

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      I use R and SPSS, but other tools also work fine.

    • @twingsiacor8285
      @twingsiacor8285 2 ปีที่แล้ว

      Can u give the exact formula for ur coefficients (b0 and b1) because we badly need it for a manual computation 😭

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      th-cam.com/video/J0yuLu3oLuU/w-d-xo.html

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      You estimate based on maximizing the likelihood. There is no simple formula to estimate the parameters like in linear regression.

  • @SalemAdel-lf3le
    @SalemAdel-lf3le 2 ปีที่แล้ว +1

    thank you so much

  • @md.musfiqueanwar226
    @md.musfiqueanwar226 2 ปีที่แล้ว

    Do you have the slides?

    • @tilestats
      @tilestats  2 ปีที่แล้ว

      If you go to my home page www.tilestats.com, you can buy some of the vidoes as PDFs

  • @learnfrommistakes9554
    @learnfrommistakes9554 ปีที่แล้ว

    How to calculate b1 and b 0

    • @tilestats
      @tilestats  ปีที่แล้ว

      By the maximum likelihood method:
      th-cam.com/video/J0yuLu3oLuU/w-d-xo.html

  • @sufianbadar
    @sufianbadar 9 หลายเดือนก่อน

    Please check the voice of your video before uploading the video. Please increase it if it is too low.