yuzaR Data Science
yuzaR Data Science
  • 70
  • 439 138
ROC Curves, AUC & Optimal Cutoffs: Master Decision-Making in Machine Learning & Medicine!
IF YOU WOULD LIKE TO SUPPORT, PLEASE JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin
IF YOU WANT JUST RAW R CODE PLEASE JOIN THE CHANNEL AND ASK ME TO POST A CODE ON THE COMMUNITY SPACE: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin
Enjoy! 🥳
Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;)
This channel is dedicated to data analytics, data science, statistics, machine learning and computational science! Join me as I dive into the world of data analysis, programming & coding. Whether you're interested in business analytics, data mining, data visualization, or pursuing an online degree in data analytics, I've got you covered. If you are curious about Google Data Studio, data centers & certified data analyst & data scientist programs, you'll find the necessary knowledge right here. You'll greatly increase your odds to get online master's in data science & data analytics degrees. Boost your knowledge & skills in data science and analytics with my engaging content. Subscribe to stay up-to-date with the latest & most useful data science programming tools. Let's embark on this data-driven journey together!
มุมมอง: 1 988

วีดีโอ

Multivariable Logistic Regression in R: The Ultimate Masterclass (4K)!
มุมมอง 5K2 หลายเดือนก่อน
IF YOU WOULD LIKE TO SUPPORT, PLEASE JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin IF YOU WANT JUST RAW R CODE PLEASE JOIN THE CHANNEL AND ASK ME TO POST A CODE ON THE COMMUNITY SPACE: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin Enjoy! 🥳 Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;) This channel is dedi...
Not Linear Relationship Between Numeric Predictor and Binary Outcome in Logistic Regression (4K)
มุมมอง 1.9K4 หลายเดือนก่อน
IF YOU WOULD LIKE TO SUPPORT, PLEASE JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin IF YOU WANT JUST RAW R CODE PLEASE JOIN THE CHANNEL AND ASK ME TO POST A CODE ON THE COMMUNITY SPACE: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin Enjoy! 🥳 Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;) This channel is dedi...
Mastering Logistic Regression with Categorical Predictors: Always Positive Odds Ratios (4K)
มุมมอง 2.4K4 หลายเดือนก่อน
IF YOU WOULD LIKE TO SUPPORT, PLEASE JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin IF YOU WANT JUST RAW R CODE PLEASE JOIN THE CHANNEL AND ASK ME TO POST A CODE ON THE COMMUNITY SPACE: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin Enjoy! 🥳 Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;) This channel is dedi...
Logistic Regression Basics Explained: Probabilities, Odds, Odds-Ratios and Log-Odds-Ratios (4K)
มุมมอง 2.6K5 หลายเดือนก่อน
IF YOU WOULD LIKE TO SUPPORT, PLEASE JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin IF YOU WANT JUST RAW R CODE PLEASE JOIN THE CHANNEL AND ASK ME TO POST A CODE ON THE COMMUNITY SPACE: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin Enjoy! 🥳 Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;) This channel is dedi...
Exact Binomial Test Explained! + Real-World Example: Counting Trash in the Baltic Sea 📊🌊🔬(4K)
มุมมอง 1.6K6 หลายเดือนก่อน
IF YOU WOULD LIKE TO SUPPORT ME, JOIN THE CHANNEL: th-cam.com/channels/cGXGFClRdnrwXsPHi7P4ZA.htmljoin The Exact Binomial Test is a simple yet powerful technique that every data scientist should have in their toolbox. In this video, we’ll explore why we need the Exact Binomial Test and examine a real-world application where I used it to publish a scientific paper on encounters of marine litter ...
Data Reveals | How to be Successful and Happy | How to avoid being Poor and Unhappy (4K)
มุมมอง 1.1K7 หลายเดือนก่อน
For more details and R code consider Joining the channel Enjoy! 🥳 Welcome to my VLOG! My name is Yury Zablotski & I love to use R for Data Science = "yuzaR Data Science" ;) This channel is dedicated to data analytics, data science, statistics, machine learning and computational science! Join me as I dive into the world of data analysis, programming & coding. Whether you're interested in busines...
Multivariable Linear Regression in R: Everything You Need to Know!
มุมมอง 7K8 หลายเดือนก่อน
The world is complex and messy because multiple factors constantly affect each other. That’s why univariable models fail to describe complex relationships. In this video, we’ll explore multivariable models, which provide a more accurate representation of reality. Expect to learn how to effectively visualize model results, how to extract the most knowledge out of multivariable models, how to int...
9 FLAWS of ‘Summary’ Function You DIDN’T Know About and How to Fix Them
มุมมอง 2.5K9 หลายเดือนก่อน
Exploring how one categorical predictor affects a numeric outcome is another way of saying - we’re comparing several groups. While ANOVA is a common approach, simple linear regression delivers more insights. Expect to learn how to maximize inference from your model, why famous “summary” function does’t provide a good summary and what are the best alternatives for it. The cartoon illustrations f...
Master Simple Linear Regression with Numeric Predictor in R
มุมมอง 2.2K9 หลายเดือนก่อน
Simple linear regression demonstrates how one numeric predictor affects a numeric outcome. For example, it can reveal whether age actually translates to higher paychecks. So, let’s learn (1) how to build a linear regression in R, (2) how to check ALL model assumptions with a ONE simple and intuitive command, (3) how to visualize and interpret the results, and much more. If you only want the cod...
Quantile Regression Reporting Made Easy: How to Create Stunning Plots and Tables in Minutes!
มุมมอง 3.9K11 หลายเดือนก่อน
In the previous episode, I presented four reasons why Quantile Regression (QR) is a better alternative to classic linear regression. However, I discovered that reporting QR results can be quite demanding. To make the process easier, I created better plots for model estimates and predictions, a comprehensive table of model results, including contrasts between groups and p-values. I found this co...
Make Multiplots Like a Pro with {patchwork} | R package reviews
มุมมอง 3.2K11 หลายเดือนก่อน
The Patchwork package makes it incredibly easy to combine separate plots into the same graphic by using the simplest mathematical operators, such as plus ( ), slash (/), parentheses and much more. If you only want the code (or want to support me), consider join the channel (join button below any of the videos), because I provide the code upon members requests. Enjoy! 🥳 Welcome to my VLOG! My na...
Master Box-Violin Plots in {ggplot2} and Discover 10 Reasons Why They Are Useful
มุมมอง 3.7Kปีที่แล้ว
Boxplots display a wealth of useful information about the dataset. In this video, we'll start with the most basic boxplot, build every part of this notched box-violin plot in {ggplot2} step by step, and understand why every detail matters 😉 If you only want the code (or want to support me), consider join the channel (join button below any of the videos), because I provide the code upon members ...
7 Reasons to Master Scatter Plots in {ggplot2} with World Happiness Data
มุมมอง 2.6Kปีที่แล้ว
In this video, we’ll explore happiness data and uncover seven compelling reasons why scatter plots are indispensable for data analysis. You’ll learn about (1) whether money can actually make you happy, (2) how wealth has changed in the USA, Germany, India, and Venezuela over the past 20 years, (3) whether happy people live longer, and much more. The results might surprise you 😉 If you only want...
Histograms and Density Plots with {ggplot2}
มุมมอง 4.3Kปีที่แล้ว
Histograms display the shape of the distribution of continuous numeric data. The distribution can be symmetrical, right-skewed, left-skewed, unimodal, or multimodal? Knowing the shape of the distribution helps us decide which statistical test is appropriate. For example, if the distribution is symmetrical, we could use a t-test or linear regression. However, if the distribution is skewed, we’d ...
Bar Charts with {ggplot2}
มุมมอง 7Kปีที่แล้ว
Bar Charts with {ggplot2}
Conditioning with {dplyr} Modify Your Data Quick
มุมมอง 1.7Kปีที่แล้ว
Conditioning with {dplyr} Modify Your Data Quick
Join Tables with {dplyr}
มุมมอง 1.3Kปีที่แล้ว
Join Tables with {dplyr}
Combine Tables with {dplyr}
มุมมอง 1.3Kปีที่แล้ว
Combine Tables with {dplyr}
Transform Your Data Like a Pro with {tidyr} and Say Goodbye to Messy Data!
มุมมอง 4.4Kปีที่แล้ว
Transform Your Data Like a Pro with {tidyr} and Say Goodbye to Messy Data!
Mastering {dplyr}: 50+ Data Wrangling Techniques!
มุมมอง 5Kปีที่แล้ว
Mastering {dplyr}: 50 Data Wrangling Techniques!
Top 10 Must-Know {dplyr} Commands for Data Wrangling in R!
มุมมอง 8Kปีที่แล้ว
Top 10 Must-Know {dplyr} Commands for Data Wrangling in R!
Don’t Ignore Interactions - Unleash the Full Power of Models with {emmeans} R-package
มุมมอง 9Kปีที่แล้ว
Don’t Ignore Interactions - Unleash the Full Power of Models with {emmeans} R-package
{emmeans} Game-Changing R-package Squeezes Hidden Knowledge out of Models!
มุมมอง 9Kปีที่แล้ว
{emmeans} Game-Changing R-package Squeezes Hidden Knowledge out of Models!
Quantile Regression as The Most Useful Alternative for Ordinary Linear Regression
มุมมอง 18K2 ปีที่แล้ว
Quantile Regression as The Most Useful Alternative for Ordinary Linear Regression
PERFECT TABLES IN #R ! 💪 {gtsummary}
มุมมอง 29K2 ปีที่แล้ว
PERFECT TABLES IN #R ! 💪 {gtsummary}
Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews
มุมมอง 5K2 ปีที่แล้ว
Effective Resampling for Machine Learning in Tidymodels {rsample} R package reviews
4 Reasons Non-Parametric Bootstrapped Regression (via tidymodels) is Better then Ordinary Regression
มุมมอง 10K2 ปีที่แล้ว
4 Reasons Non-Parametric Bootstrapped Regression (via tidymodels) is Better then Ordinary Regression
R demo | Many (Grouped / Nested) Models Simultaneously are Very Effective
มุมมอง 7K2 ปีที่แล้ว
R demo | Many (Grouped / Nested) Models Simultaneously are Very Effective
R demo | Robust Regression (don't depend on influential data)
มุมมอง 7K2 ปีที่แล้ว
R demo | Robust Regression (don't depend on influential data)

ความคิดเห็น

  • @Abdulaziz-yj1ns
    @Abdulaziz-yj1ns 7 ชั่วโมงที่ผ่านมา

    Thank you so much very informative

  • @MrThunderDataKit
    @MrThunderDataKit 22 ชั่วโมงที่ผ่านมา

    Excellent

  • @muhammadahmadkhalid364
    @muhammadahmadkhalid364 วันที่ผ่านมา

    Great to see 10k subs. Congragulations. Best of luck for 2025. Keep on making the content the way you do. Great job.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science วันที่ผ่านมา

      Thanks, Muhammad! I'll do my best to produce more videos in 2025 ;) Happy holidays and wish you also an amazing 2025!

    • @muhammadahmadkhalid364
      @muhammadahmadkhalid364 วันที่ผ่านมา

      @yuzaR-Data-Science what is the best export setting as png in ggsave, which can make things look best but also need not more making the lines bold or points thick. What will be good for a PowerPoint slide? Asking generally for any plot and for plots after patching together.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science วันที่ผ่านมา

      @@muhammadahmadkhalid364 it's more about ggplot, not ggsave settings. you build the plot first, I have actually video on every major type of plots in ggplot... and then you'll save it in the size of your choice

  • @bhagyamaduwanthi995
    @bhagyamaduwanthi995 2 วันที่ผ่านมา

    Wow, What a vedio! Concise, Simple and Perfect.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 2 วันที่ผ่านมา

      Glad you liked it! There are similar videos on other tests ;)

  • @emredunder9108
    @emredunder9108 3 วันที่ผ่านมา

    I have a short contibution: If any categorical variable exists, classical VIF values are not appropriate. Then, it would be the best to use generalize VIF values.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 2 วันที่ผ่านมา

      Nice contribution! Yeah, the gtsummary package uses GVIF be default: tbl_regression(model) |> add_vif()

  • @danilofreitas1648
    @danilofreitas1648 4 วันที่ผ่านมา

    The data need to be normal? As a linear regression?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 3 วันที่ผ่านมา

      not really, because if you have outliers, it won't be normal. but the focus is on ourliers, not on normality of data distribution. similarly good model to handle both is quantile regression. I also have 2 videos on QR on my channel

  • @팬더-n4w
    @팬더-n4w 6 วันที่ผ่านมา

    Hi! I am eager to practice quantile regression and your script would be great value to me. Is it possible to receive the R code?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 6 วันที่ผ่านมา

      Hello again, just answer your previous question. Sure. I just updated the link, which you'll find on the community tab. Just look for the first message on quantile regression (7 month ago) and click the wetransfer link. The message talkes only about the first video, but I uploaded PDFs for both videos on quantile regression. Enjoy and let me know whether download worked.

    • @팬더-n4w
      @팬더-n4w 5 วันที่ผ่านมา

      @@yuzaR-Data-Science Thank you for the reply. While I was studying the file, I just thought 'broom.helpers' package might answer some of your question...

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 5 วันที่ผ่านมา

      oh man, broom.helpers is amazing!!! Love it and use it inside of gtsummary::tbl_regression(). The only problem with this is - it was "experimental" a few month ago, now it's "stable", but still not "passing" like broom package for example. During the last two years broom.helpers changed a lot, so that my videos would not be evergreen anymore. But, as soon as the guys (larmarange etc.) stabilize it completely, I'll definitely make a video on broom.helpers! :)

  • @lorenzoquartaroli772
    @lorenzoquartaroli772 6 วันที่ผ่านมา

    nice video, how could i get the code ? i just started follow now

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 6 วันที่ผ่านมา

      thanks for the subscribe! I do send the PDFs with code and explanations to the members of the channel. It's the "join" button near the "subscribe" button. It asks you to support my channel with a small amount of money (starting with less than 1€ per month). But, please, don't feel obliged to do so. TH-cam is free and you can stop the video at any time and just type up the code. The "join" button is meant to show that you like my work and want me to produce more content. The PDF files is just symbolic - thanks for the folks who supports me, because, as I said, you can see the code for free at any time by pausing the video. Cheers

    • @lorenzoquartaroli772
      @lorenzoquartaroli772 5 วันที่ผ่านมา

      ​@@yuzaR-Data-Science perfect thank you so much. nice content.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 5 วันที่ผ่านมา

      you are welcome :)

  • @kristianvepsalainen1746
    @kristianvepsalainen1746 10 วันที่ผ่านมา

    Very interesting. Thanks.

  • @muhammadasadkhan9620
    @muhammadasadkhan9620 10 วันที่ผ่านมา

    yes, I have seen this video, but I don't know how to generate missing values by different mechanism like (MCAR, MAR & and MNAR) and how to test the performance of each imputation method by root mean square error and mean absolute error.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 10 วันที่ผ่านมา

      oh, yeah, I see what you mean. in this case I did not look at MCAR etc. yet, but I'll put it on the list for the future videos. until then have a look at this, and similar, articles: cran.r-project.org/web/packages/finalfit/vignettes/missing.html

  • @Abdulaziz-yj1ns
    @Abdulaziz-yj1ns 11 วันที่ผ่านมา

    Great video thanks

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      Glad you enjoyed it! Thanks for watching!

  • @sa-rc8zc
    @sa-rc8zc 11 วันที่ผ่านมา

    thank you! this is great.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      You are very welcome! Thanks for watching!

    • @sa-rc8zc
      @sa-rc8zc 11 วันที่ผ่านมา

      @@yuzaR-Data-Science I have a question about the video around the 8:10 mark. A line is drawn at misclassification_cost = 213, but why doesn't it go slightly lower? There are two more points below the red line where misclassification_cost can be confirmed. Could this be due to a default setting for tol_metric or something similar? Let me know if you'd like any refinements!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      deep question! I think that balancing sensitivity and specificity is also important. so that, whe the sum of sens-spec is literally maximazed (goes to those 2 points), either sensitivity or specificity will be smaller or bigger than the counterpart, while they should be very similar. again, floating cutpoint problem exists and the cutpoint could be a bit below or a bit above the "one optimal cutpoint". I never saw the issue about floating optimal cutoff raised in any paper. so, knowing this you'd have an advantage in inference ;)

    • @sa-rc8zc
      @sa-rc8zc 10 วันที่ผ่านมา

      @@yuzaR-Data-Science Thank you for your reply! In the code for cutpointr, the specified metric = misclassification_cost should take priority. This metric is calculated based on the number of False Positives (FP) and False Negatives (FN), so it seems unlikely that the logic would be influenced by considerations of the balance between sensitivity and specificity. Do you mean to suggest that the balance between sensitivity and specificity might be influencing the algorithm in a way that is independent of misclassification_cost? I would greatly appreciate your advice.

    • @sa-rc8zc
      @sa-rc8zc 10 วันที่ผ่านมา

      @@yuzaR-Data-Science After examining the data in detail, the results for the points where you drew the green line were as follows: misclassification_cost = 212 (fp = 59, fn = 153) misclassification_cost = 212 (fp = 60, fn = 152) misclassification_cost = 213 (fp = 60, fn = 153) misclassification_cost = 213 (fp = 68, fn = 145) misclassification_cost = 213 (fp = 69, fn = 144) Additionally, the optimal_cutpoint was the average of the predicted_glm values for the two points where misclassification_cost = 212. Given this, it seems logical that the result of res |> summary() should report misclassification_cost = 212 instead of 213. Wouldn't you agree?

  • @ayeshaiftikhar6450
    @ayeshaiftikhar6450 11 วันที่ผ่านมา

    Can we draw ROC for case and controls too? Just like male and female?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      sure. any groups you have in you dataset ;)

  • @ayeshaiftikhar6450
    @ayeshaiftikhar6450 11 วันที่ผ่านมา

    Kindly post RAW code

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      I only provide code for the members. There is another button near "subscribe", it's name is "join". So, in this way you would support me, but you don't need to do that at all. Just stop the video and type up the code. it's even good for learning. if you decide to support me, you can have access to any code from any video I made.

  • @muhammadahmadkhalid364
    @muhammadahmadkhalid364 12 วันที่ผ่านมา

    Brother always leaves people with a touch of drama at the end of the video so we have the urge to definitely check the next episode (video) to see what happened. Great video. Also in need of a video on the basics of tidy models if you can sometime. Thank you very much.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 12 วันที่ผ่านมา

      bro, I tried to do tidymodels videos already, only got a few, because they are more for predictions, not for inference. and honestly, tidymodels did not provide enough interpretability while always made a way to the result in a special own not-completely intuitive way. one day I will definitely do videos on tidymodels, but they are less useful for inference at the moment and there are many other useful and interesting topics to cover. hope you stick along for some time until I make tidymodels content. cheers mate

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 12 วันที่ผ่านมา

      by the way, did the end work? :) I don't see it working in channels analytics, but I definetely create some suspense at the end to keep people binch-learning ;) thanks for noticing!

    • @muhammadahmadkhalid364
      @muhammadahmadkhalid364 11 วันที่ผ่านมา

      @@yuzaR-Data-Science Thank you brother. Will surely wait. But I totally agree that you cover the things that you are doing they are really more important for R users. Thank you again mate.

    • @muhammadahmadkhalid364
      @muhammadahmadkhalid364 11 วันที่ผ่านมา

      @@yuzaR-Data-Science Well, I would honestly tell, maximum people are just watching videos and go by because they don't like statistics and are just watching it for a project or assignment. Then there will be a small proportion who like statistics and but not mainly R. Then there will be even smaller of both statistics and R lovers and they wouod have time. So what you are doing is good enough. Just see the percentage of people who will be effected by it. And surely. For those who don't know what to see next but want another topic. It gets the click through for sure.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      thanks mate!

  • @muhammadahmadkhalid364
    @muhammadahmadkhalid364 12 วันที่ผ่านมา

    Another great video, after watching it I realized there are many things that people get wrong as using the 1 cut-off they got from the result section and writing it into the paper. You have just super charged the analysis to a whole new level. I have just a bit difficulty in understanding the statistics theoretically because in my opinion without they one just gets lost is mis-using the tools and can make inferences that were not check or applied properly. Can you recommend a complete or enough resource that can teach about the regression, ROC and other healthcare related advanced multivariate statistics (that are in use now a days) to an extent from which we can then understand theory behind your videos and follow your channel. Any book, channel and/or video series about statistics which teaches use case properly. Not for R because we have got you.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 12 วันที่ผ่านมา

      oh man, thanks a lot for such a generous feedback! statistics is too wide of a field to have one single ressourse. There is classic stats, which I do here. There is stochastic and some folks follow it religiusly. And there is maths side of stats, which can be very beginner-unfriendly. So, what to do? If you are beginner, read many books about classic stats for non-statisticians. I am obviously biased, because I work in animal science, but my favorite is: "Statistics for veterinary and animal science". It's practical, not too wordy, but very clear and simple. Books for biologists of epidemiologist are also good for beginners. If you are advanced, go to ISLR book, it's free and online. This one is a good link between classic stats, math behind it and intro to ML and AI. If you are into stochastic and Bayesian framework - Statistical Rethinking. Main thing though: start analyzing data accepting that you will be wrong. My goal is to be less wrong every time I do something. If I don't know or doubt my analysis, it sucks in the beginning, but, as Richard Feynman said, it will show your gaps in knowledge and where you should invest more time. Ask 3 statisticians the same question and you'll often get 3 different answers. So, the answer to your question is - many ressources instead of one and don't try to be right, try to be less wrong :)

    • @muhammadahmadkhalid364
      @muhammadahmadkhalid364 11 วันที่ผ่านมา

      @@yuzaR-Data-Science Thank you, such a detailed and informative answer. Thank you for taking the time to write it. Yes indeed I wanted more of it related to health sciences. Will surely check all of these.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 11 วันที่ผ่านมา

      glad I could help. cheers

  • @jamiecorroon
    @jamiecorroon 12 วันที่ผ่านมา

    Hi, is the code for this video available?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 12 วันที่ผ่านมา

      sure, I just renewed the link for "emmeans" post. Just go to community (members space), search for "emmeans" and klick on link. this link will be active for 3 days. there is both code and explanations for the code. let me know where you was able to get it. cheers

  • @muhammadasadkhan9620
    @muhammadasadkhan9620 12 วันที่ผ่านมา

    Hello, can you please make a video. How to generate the missing rates of data in a percentage form by different data mechanism( like MCAR, MAR and MNAR). also check check the performance by root mean square error (RMSE) and MAE. Thanks

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 12 วันที่ผ่านมา

      I already have a video an imputation of missing values. Check out this one: th-cam.com/video/Akb401i32Oc/w-d-xo.html

  • @kellycriterion1019
    @kellycriterion1019 13 วันที่ผ่านมา

    Is the ROC done for every data analysis?For example, everytime one wants to do some data analysis e.g. linear regression,logistic regression, survival analysis?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      ROC curves are primarily used to evaluate the performance of binary classification models (like logistic regression), not linear regression or survival analysis.

    • @kellycriterion1019
      @kellycriterion1019 13 วันที่ผ่านมา

      @yuzaR-Data-Science Alright,thank you.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      You are welcome 🙏

  • @45tanviirahmed82
    @45tanviirahmed82 13 วันที่ผ่านมา

    Long waited video. Again, learned so much. I was using tidymodels workflow for prediction of both probability and class!! now this looks much easier. 😇

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      Glad it's useful! Thank for your nice feedback and for watching! Hope the videos are not too long and not too boring. Let me know if you'd prefer shorter less dense ones

  • @marcoesteves4367
    @marcoesteves4367 14 วันที่ผ่านมา

    Excelent! This is realy next level stats!!!!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      Glad you think so! Thanks for cool feedback! :)

  • @cleandata_sk
    @cleandata_sk 14 วันที่ผ่านมา

    this video came in the best time it could. Thank you!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      Perfect! Glad it's useful and and I was lucky it was a good timing :)

  • @arkadiuszpajda5366
    @arkadiuszpajda5366 14 วันที่ผ่านมา

    Amazing video. 99/100. I only miss a detailed walkthrough of how to read each of the charts in the second half of the video. But then the video would have to be an hour long.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      thanks for nice feedback! :) yes, you are right, the video would have become to long. what do you think if I would have made several separate videos, like one for multiple cutoffs, one for bootstrapping etc.? I don't to make videos for the sake of making videos ... but try to have a dense coherent high info story. but there is a trade off for everything :) so, what would you prefer?

    • @arkadiuszpajda5366
      @arkadiuszpajda5366 13 วันที่ผ่านมา

      @@yuzaR-Data-Science Several separate videos, sounds good! More videos on about model performance will always be useful. In general, the topics of model diagnostics, model performance, and variable importance are three topics I need to delve into the most right now. Your videos are full of enthusiasm and knowledge, so any material on these topics will be useful.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 13 วันที่ผ่านมา

      Have you seen my review on the performance package? That video is probably exactly what you want. It’s general though. Any kind of model has unique assumptions set. I have also several videos on few different models ready to watch. Hope they can help

  • @hikeaway1596
    @hikeaway1596 14 วันที่ผ่านมา

    I waited long time for this video. thanks!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 14 วันที่ผ่านมา

      Hope you enjoyed it! Thanks for watching 🙏

  • @gyandeepsharma5516
    @gyandeepsharma5516 14 วันที่ผ่านมา

    It was awesome

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 14 วันที่ผ่านมา

      thanks mate! I hope you'll enjoy the second video on quantile regression too

    • @gyandeepsharma5516
      @gyandeepsharma5516 14 วันที่ผ่านมา

      @yuzaR-Data-Science please send the link of your 2nd video

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 14 วันที่ผ่านมา

      here we go: th-cam.com/video/4nJD2tpZFDs/w-d-xo.html

  • @shubhrachoudhary9450
    @shubhrachoudhary9450 15 วันที่ผ่านมา

    can robust regression - mm model be used for non normal data

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 14 วันที่ผ่านมา

      first of all - sure, it can, secondly, you don't need to because bootstrapping will work the non-normality out ;) that's the whole purpose of the method

  • @tenzinwangchuk3997
    @tenzinwangchuk3997 16 วันที่ผ่านมา

    amazing!! I have joined the channel...Please can you share the codes? it will be helpful while using the package

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 15 วันที่ผ่านมา

      thanks :) you probably only subscribed. I send the code to people, who joined. It's a different button, where people support me monthly with a small pay. but you don't need to do that at all, you just can pause the video and write down the code, it's even better for a learning purposes to tipp out the code yourself. kind regards

  • @alexiscanari8776
    @alexiscanari8776 17 วันที่ผ่านมา

    Thank you for sharing!! could you please specify if there are differences when we work with glmer for logistic mixed effects model? I was wondering particularly how to check this type of model. thank you!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 17 วันที่ผ่านมา

      Not at all, you can use same visualization functions for both, mixed and not mixed models

  • @CROscarAbrahamJosePadillaSolis
    @CROscarAbrahamJosePadillaSolis 17 วันที่ผ่านมา

    Excellent video! I came here from the recommendation of the video on simple linear regression, and it's great. I have a question that I haven't been able to resolve. When using performance, I understand that categorical variables are analyzed by creating dummies, but I don't know how the VIF is calculated. Is there a formula, or how could we check multicollinearity for non-quantitative variables?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 17 วันที่ผ่านมา

      sure, vif works for both numeric and categorical variables. how it's calculated - I don't know exactly, just superficial formula like 1/(1 - summary(model)$r.squared) - but I treat it like a car: I don't know how engine works, but I know how to drive. so, if your vif is below 5 or in some cases below 10, you can accept the results. when vif is above 10 you'll find some multicollinear variables (both numeric and categorical)

  • @kellycriterion1019
    @kellycriterion1019 18 วันที่ผ่านมา

    Great video❤ Please make some videos on Survival Analysis as well.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 18 วันที่ผ่านมา

      thanks you very much for the feedback! will do survival analysis with R similar to this one! I have two very old not very good and not R, but a bit theoretical videos on survival analysis on this channel. I don't think they are helpful, but you want, you could check them out.

  • @alexisdosis5524
    @alexisdosis5524 19 วันที่ผ่านมา

    Hi Sir, great video. How do we extract the invidual plots from the check_model(model) function from the patchwork package?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 19 วันที่ผ่านมา

      good question, I have a video on performance package. check this out and you'll see the demo with answers your question :)

  • @muhammadahmadkhalid364
    @muhammadahmadkhalid364 22 วันที่ผ่านมา

    Another great video, great job Yury ❣. For medical science regression modelling. What do you think is the place of tidymodelling and their workflows (regression the tidy way using tidymodels) in medical science? Is it worth learning or it is more towards the prediction modelling side?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science 22 วันที่ผ่านมา

      Good question mate! The role of tidymodels in med science is small but steadily growing. I also wanted to make more videos on tidymodels, but they don’t provide much inference. They predict but often don’t explain why and how. With images and videos in medicine they will gain more insight influence though. So it’s definitely useful to learn and I’ll cover them in the future videos too.