Data Garden
Data Garden
  • 31
  • 96 696
Multivariate Imputation for Missing Values in R - Part 2
This is a follow up video with more advanced ways of working with the MICE package for filling in missing values in your data.
If you haven't seen the first part, you can find it here:
th-cam.com/video/lFDgTr1RMho/w-d-xo.html
Also my video on for loops is over here:
th-cam.com/video/g2YwSSkGBSM/w-d-xo.html
Finally, if you want to follow this video, without having seen the last one, you can find the code here:
pastebin.com/byypkMKU
มุมมอง: 156

วีดีโอ

Multivariate Imputation for Missing Values in R
มุมมอง 226ปีที่แล้ว
This video is a follow up on the video about missing values. Here we will discuss how to get more accurate replacements for missing values. For the original video, you can go here: th-cam.com/video/jVtB16cOXus/w-d-xo.html Also, if you want to start with this tutorial without having done the previous one, you can find the code here: pastebin.com/p688SNHX If you liked this video, please consider ...
R-Studio: Dealing with Missing Values (NA) - Two Easy Ways
มุมมอง 4292 ปีที่แล้ว
In this #datascience #tutorial we will explore why missing values are a problem and take a look at two easy ways to deal with them. In this video, I make use of for loops. If you are not familiar with the concept, you can see my video on them here: th-cam.com/video/g2YwSSkGBSM/w-d-xo.html Also, I am using the pipe operator from dplyr. I made a video on it here: th-cam.com/video/McrRaix9-NM/w-d-...
Easily Style Your GGPlots with Complete Themes
มุมมอง 3362 ปีที่แล้ว
In in #RStudio #tutorial we will explore how to make your plots a bit more shiny using preconfigured themes from the ggthemes package. For more GGPlot related tutorials, check out this playlist of mine: th-cam.com/play/PLz70pddaPJ-Y250Ri9c1oXYfzXby2e_uh.html If you have any questions regarding this video or you have a suggestion for another video, please leave a comment and I will see what I ca...
Easily Select Rows in R with Dplyr Filter
มุมมอง 2362 ปีที่แล้ว
In this #rstudio #tutorial we will discover how to select rows in a data frame with an easier method than base R. If you are not familiar with the pipe operator used in the video, consider checking out this tutorial I made about it: th-cam.com/video/McrRaix9-NM/w-d-xo.html If you have any questions or suggestions, please leave them down below in the comments. Thanks for watching!
Tidy Your Code in R With This Simple Command (Pipe Operator)
มุมมอง 3002 ปีที่แล้ว
In this quick tutorial we are going to explore how to use the pipe operator from the package dplyr. This makes your data handling a bit easier and your code tidier. If you want more quick R tips, have a look at this playlist: th-cam.com/play/PLz70pddaPJ-ZvQrFFG3hVeUA0IzYNI7kR.html If you have any questions or suggestions than please leave them down below in the comments. Thanks for watching.
Easily Import Financial Data to R with Quantmod
มุมมอง 6K2 ปีที่แล้ว
In this quick #statistics #tutorial we will explore how to load financial #timeseries data into R without handling any csv or excel files using the quantmod package and yahoo finance as well as the FRED data base. Find more quick R tips here: th-cam.com/play/PLz70pddaPJ-ZvQrFFG3hVeUA0IzYNI7kR.html
Bring your GGPlots to Life with GGPlotly
มุมมอง 9762 ปีที่แล้ว
In this #RStudio #tutorial we will be using the command ggplotly from the plotly package to create an interactive htmlwidget out of any ggplot. If you are interested in more content regarding GGPlot, I have a playlist for you here: th-cam.com/play/PLz70pddaPJ-Y250Ri9c1oXYfzXby2e_uh.html As always, post your questions and comments below and I'll read all of them and try my best to follow up.
XGBoost in R Part 5 - Tuning XGBoost Linear in Caret
มุมมอง 1.2K2 ปีที่แล้ว
In this #machinelearning #tutorial we will use the caret package in R to optimise the XGBoost linear algorithm. This algorithm is particularly suited for regression problems and we will compare it to the XGBoost Tree algorithm from the previous video. You can find the video about the XGBoost Tree algorithm here: th-cam.com/video/lwv8GneTUOk/w-d-xo.html The code from the previous tutorial can be...
XGBoost in R Part 4 - Regression tuning with Caret - Machine Learning
มุมมอง 2.7K2 ปีที่แล้ว
In this #machinelearning #tutorial we are building on what we have learned in the last video. This time, we want to establish a more scientific approach to parameter tuning. We will also explore what the different parameters mean and how to use them to our advantage. You can find the last video here: th-cam.com/video/T0lcTbujsWw/w-d-xo.html You can find the code from the last video here: pasteb...
XGBoost in R Part 3 - Regression Native Implementation - Machine Learning
มุมมอง 6612 ปีที่แล้ว
In this #machinelearning #tutorial we are back with the XGBost algorithm. We are going to discover how to use the xgboost package to make predictions in a #regression problem and why it performs so much better than a linear model. If you are interested in XGBoost and want to learn more about it, make sure to subscribe to the channel, because the playlist will be extended soon. You can find the ...
XGBoost in R Part 2 - Classification Native Implementation - Machine Learning
มุมมอง 1K2 ปีที่แล้ว
In this #machinelearning #tutorial we are back with the XGBost algorithm. We are going to discover how to use the xgboost package to make predictions in a multi-class #classification problem. If you are interested in XGBoost and want to learn more about it, make sure to subscribe to the channel, because the playlist will be extended soon. Here is the complete playlist of my XGBoost tutorials: t...
XGBoost in R Part 1 - Introduction - Machine Learning
มุมมอง 8082 ปีที่แล้ว
In this video we will discover the XGBoost #machinelearning #algorithm and discuss the pros and cons of using it. I will also give an outlook on what topics regarding XGBoost will be covered in the following videos of the series. If you liked this video, you will not want to miss future videos in the series. Click subscribe above to get notified about future updates. If you have any questions r...
R-Studio Tutorial: Adding a custom legend to a plot with multiple elements
มุมมอง 8K3 ปีที่แล้ว
In this #tutorial we will discover together how to add a legend to a plot with multiple elements. For the original tutorial on how to create multiple lines in one plot go to this video: th-cam.com/video/ZY2TUTeZ-7o/w-d-xo.html For all videos related to GGPlot check out this playlist: th-cam.com/play/PLz70pddaPJ-Y250Ri9c1oXYfzXby2e_uh.html
R-Studio and GGPlot: Smoothing Functions for Line Plots
มุมมอง 3.2K3 ปีที่แล้ว
In this tutorial we will explore how to use loess and lowess smoothing functions in R and how to implement them in your line plots. You can find the previous video on how to create a line plot with two lines and dots for observations here: th-cam.com/video/ZY2TUTeZ-7o/w-d-xo.html If you haven't followed the previous tutorial, you can also find the code for it here: pastebin.com/weHsvaxC If you ...
R-Studio Tutorial: Multiple Lines in One Plot With GGPlot
มุมมอง 52K3 ปีที่แล้ว
R-Studio Tutorial: Multiple Lines in One Plot With GGPlot
R Quick Tips: Start Your Projects REALLY Fast! (Source Command)
มุมมอง 2.3K3 ปีที่แล้ว
R Quick Tips: Start Your Projects REALLY Fast! (Source Command)
R-Studio Basics 12: Binary Classification Algorithm
มุมมอง 5193 ปีที่แล้ว
R-Studio Basics 12: Binary Classification Algorithm
R-Studio Basics 11: Create your own functions - 2 easy examples!
มุมมอง 1083 ปีที่แล้ว
R-Studio Basics 11: Create your own functions - 2 easy examples!
R-Studio Basics 10: If, else and Ifelse
มุมมอง 1663 ปีที่แล้ว
R-Studio Basics 10: If, else and Ifelse
R-Studio Basics 09: Using For Loops
มุมมอง 523 ปีที่แล้ว
R-Studio Basics 09: Using For Loops
R-Studio Basics 08: Your First Regression Analysis
มุมมอง 864 ปีที่แล้ว
R-Studio Basics 08: Your First Regression Analysis
R-Studio Basics 07: Merging Data Frames
มุมมอง 954 ปีที่แล้ว
R-Studio Basics 07: Merging Data Frames
R-Studio Basics 06: The Aggregate Function - Creating Pivot-Style Analysis in R
มุมมอง 1954 ปีที่แล้ว
R-Studio Basics 06: The Aggregate Function - Creating Pivot-Style Analysis in R
R Studio Basic 05: Reordering and Selecting Rows Based on Multiple Criteria
มุมมอง 6034 ปีที่แล้ว
R Studio Basic 05: Reordering and Selecting Rows Based on Multiple Criteria
R Studio Basics 04: Rearranging Columns and Changing Column Names
มุมมอง 4014 ปีที่แล้ว
R Studio Basics 04: Rearranging Columns and Changing Column Names
R-Studio & GGPlot - Arranging Multiple Plots in a Layout
มุมมอง 9K4 ปีที่แล้ว
R-Studio & GGPlot - Arranging Multiple Plots in a Layout
R-Studio Basics 03: Vectors, Matrices, Dataframes & Lists
มุมมอง 1734 ปีที่แล้ว
R-Studio Basics 03: Vectors, Matrices, Dataframes & Lists
R-Studio Basics 02: Create Graphics with the Plot Function
มุมมอง 1914 ปีที่แล้ว
R-Studio Basics 02: Create Graphics with the Plot Function
R-Studio Basics 01: The interface, workflows & first analysis
มุมมอง 2564 ปีที่แล้ว
R-Studio Basics 01: The interface, workflows & first analysis

ความคิดเห็น

  • @aram5704
    @aram5704 หลายเดือนก่อน

    So informative videos. Where are you? We miss you

  • @samboususo6781
    @samboususo6781 3 หลายเดือนก่อน

    Thank you for making this video. You've not only shown us how to deal with missing values but you have shown multiple ways it can be done. Well done!

  • @user-yz2zx5hu5g
    @user-yz2zx5hu5g 4 หลายเดือนก่อน

    Well done. Is this comparision between XGB and Regression valid in academia (Just ask myself if this is the way to compare these different models in my thesis)?

  • @jcornish2362
    @jcornish2362 7 หลายเดือนก่อน

    This was a really useful video. Thanks for this. I wondered if you had any advice on how to connect two points with a line using the mean of different columns in a dataset? I have tried using different methods but I thought the following would work in R but it doesn't. ggplot()+ geom_point(data = wsd, mapping = aes(x = "Mud intact", y = mean(Mud_intact)), colour = "blue", size = 3)+ geom_point(data = wsd, mapping = aes(x = "Sand intact", y = mean(Sand_intact)), colour = "blue", size = 3)+ geom_line()+ geom_point(data = wsd, mapping = aes(x = "Mud hair cut", y = mean(Mud_hair.cut)), colour = "red", size = 3)+ geom_point(data = wsd, mapping = aes(x = "Sand hair cut", y = mean(Sand_hair.cut)), colour = "red", size = 3)+ geom_line()+ labs(title = "Comparison of mean worm speed with/without hairs on different substrates", x = "Worm condition and Surface type", y = "Mean Speed")+ theme_bw() Any advice greatly appreciated.

  • @michaelpappas3857
    @michaelpappas3857 7 หลายเดือนก่อน

    Awesome video! Been searching forever to find this specific information. Thank you!!

  • @user-px7fy8hl7f
    @user-px7fy8hl7f 9 หลายเดือนก่อน

    Du bist ein banger!! Vielen Dank!

  • @reshjnithin
    @reshjnithin 9 หลายเดือนก่อน

    Thanks a lot! I was breaking my head for the past 3 hours because I was getting so many errors. This saved my day and grades for my assignment.

  • @shahabikimiya
    @shahabikimiya 9 หลายเดือนก่อน

    Thanks a lot 🙏🏻

  • @user-hm8ot8ro6v
    @user-hm8ot8ro6v 10 หลายเดือนก่อน

    you are an excellent teacher! Thanks for the work~

  • @kylelauture
    @kylelauture 10 หลายเดือนก่อน

    you are my hero

  • @gregan18
    @gregan18 10 หลายเดือนก่อน

    Thanks, thats great! I want to also analyze corporate financial ratios, so I need to import financial statements from Yahoo, but i really dont know how to properly do that. Can you maybe do a short video or guide me to the right source?

    • @robertooll-vd4qe
      @robertooll-vd4qe 10 หลายเดือนก่อน

      Data garden - can you help him this question?

  • @aman201156
    @aman201156 11 หลายเดือนก่อน

    Excellent

  • @michaelogunmakin9143
    @michaelogunmakin9143 ปีที่แล้ว

    Thanks, really helped solve an issue I had

  • @remyxi
    @remyxi ปีที่แล้ว

    Thanks so much for this update! Its perfect and great that you optimize this, you rock mannn!

  • @nevinthomas5130
    @nevinthomas5130 ปีที่แล้ว

    Hi, Great Tutorial. Is there any way to the animate lines using ggplotly.

  • @Lara-zo8gn
    @Lara-zo8gn ปีที่แล้ว

    You've literally saved my butt in my statistics course at university, thank you! (:

    • @DataGarden
      @DataGarden ปีที่แล้ว

      Brilliant! Glad that it was helpful.

  • @ProfBoggs
    @ProfBoggs ปีที่แล้ว

    This is one of the few videos I have found showing how to combine ggplot and wide format data. Thank you.

  • @viantube_45526
    @viantube_45526 ปีที่แล้ว

    Thank you sir, you have saved my keister.

  • @CrispySmiths
    @CrispySmiths ปีที่แล้ว

    Thanks for the video! Do you know if there is a way to extract the model coefficients on the regression?

  • @bernardrobenson5071
    @bernardrobenson5071 ปีที่แล้ว

    Thanks but how to show the smooth lines without the points displayed

  • @Superboy_87
    @Superboy_87 ปีที่แล้ว

    Love your work! Can you make videos/playlist on How to Make DCF valuation model for a Company on Rstudio?

  • @thatgoanguy4427
    @thatgoanguy4427 ปีที่แล้ว

    Legend doesn't appear automatically

  • @matthewholmes2803
    @matthewholmes2803 ปีที่แล้ว

    How can I change the name of the legend from "colour" to "population"?

  • @ardykharabian3453
    @ardykharabian3453 ปีที่แล้ว

    I have a small R script below that creates a graph. This script only imports one .csv to .tsv file. However, I need to tweak it in a way that it can import multiple .tsv files and create overlaying graphs (transparent histogram and line) with different colours. Can you do it for me? # read in data df = read.csv("your_distribution.tsv", sep="\t") # filter Ks distribution (0.001 < Ks < 5) lower_bound = 0.001 upper_bound = 5 df = df[df$Ks < upper_bound,] df = df[df$Ks > lower_bound,] # perform node-averaging (redo when applying other filters) dff = aggregate(df$Ks, list(df$Family, df$Node), mean) # reflect the data around the lower Ks bound to account for boundary effects ks = c(dff$x, -dff$x + lower_bound) # plot a histogram and KDE on top hist(ks, prob=TRUE, xlim=c(0, upper_bound), n=50) lines(density(ks), xlim=c(0, upper_bound))

  • @wibulord926
    @wibulord926 ปีที่แล้ว

    can you link your code in the decription please

  • @sofialeitao9815
    @sofialeitao9815 ปีที่แล้ว

    I have a problem with my code. I used what you taught in this video and added this: +scale_color_discrete(name="Legenda", labels=c("Nº de gaivotas adultas","Nº de gaivotas juvenis", "Nº total de gaivotas")) to change the name of the tittle of the legend, but the colours I chose dissapeard and now it has the original automatic colours. How do I get the colours I want back??? Please help!

    • @DataGarden
      @DataGarden ปีที่แล้ว

      Hey Sofia, can you maybe post your complete complete code here in the comment? Then I can have a look. Otherwise I suggest that using the code from the video and adding "+ guides(color = guide_legend(title = "Legenda"))" might do what you are having in mind. Let me know if that works.

    • @sofialeitao9815
      @sofialeitao9815 ปีที่แล้ว

      @@DataGarden IT WORKED!!! Again, thank you so much :) Now how can I change the name of the labels (in the legend)? Here is my complete code: ggplot(data=X56195IolCle56163SofLei, mapping = aes(x=horas)) + geom_point (mapping =aes ( y=gadlt, color="gadlt")) + geom_point ( mapping = aes ( y=gjvnl, color="gjvnl"))+ geom_point ( mapping = aes( y=ngaiv, color="ngaiv"))+ labs ( title="Gráfico 1 - Variação de gaivotas ao longo do dia", x= "Tempo (min)", y= "Número de gaivotas")+ theme(plot.title = element_text(hjust = 0.5))+ guides( color = guide_legend ( title = "Legenda"))

    • @DataGarden
      @DataGarden ปีที่แล้ว

      @@sofialeitao9815 Hey Sofia, that's nice! I think you should be able to change the names of the legend items in the scale_color_manual command such as the following +scale_color_manual( values = c("gadlt"="blue","gjvnl"="orange","ngaiv"="red"), labels=c("your value 1 here", "your value 2 here","your value 2 here")) Let me know if that works.

    • @sofialeitao9815
      @sofialeitao9815 ปีที่แล้ว

      @@DataGarden It worked :) Thank you for your time and help, I was really desperate.

  • @sofialeitao9815
    @sofialeitao9815 ปีที่แล้ว

    Thank so soooo much, I spent hours looking for a way to create a legend using my data and everywhere I looked they never explained how to addapt my data so that the coding works. Thank you :)

  • @pradhandeepa
    @pradhandeepa ปีที่แล้ว

    Can we add filters in?

  • @juanrike5401
    @juanrike5401 ปีที่แล้ว

    I want insert legend, can you help me ?

    • @DataGarden
      @DataGarden ปีที่แล้ว

      I made a follow-up here: th-cam.com/video/0TzDjG8jwXE/w-d-xo.html

  • @RealismHD1
    @RealismHD1 ปีที่แล้ว

    good video

  • @ferosekhan23
    @ferosekhan23 ปีที่แล้ว

    Great Video. Thank you

  • @Alonso_Cid
    @Alonso_Cid ปีที่แล้ว

    Thanks, just what I was looking for. Do you know how to add a legend?

  • @rainstormandthundersounds4191
    @rainstormandthundersounds4191 ปีที่แล้ว

    That's great and working! and many thanks for that!! however - where is the legend? how can I know which group is the correct one?

  • @chionomys
    @chionomys 2 ปีที่แล้ว

    Many thanks

  • @DataGarden
    @DataGarden 2 ปีที่แล้ว

    I hope you liked the video and found it helpful. One thing I forgot to mention is that due to the randomness in the algorithm, your results might be different from the ones seen in the video.

    • @dominikbernard9939
      @dominikbernard9939 ปีที่แล้ว

      Do you have LinkedIn or other social media, where we can follow you? I really like your videos !

  • @alvarodel2950
    @alvarodel2950 2 ปีที่แล้ว

    Hello, and how would it be if instead of 2 graphs I wanted to make 3 or more on the same graph? Could you help me please? Thank you very much.

  • @widnaldvalcin6677
    @widnaldvalcin6677 2 ปีที่แล้ว

    🔥🔥🔥🔥

  • @mychudungco
    @mychudungco 2 ปีที่แล้ว

    Hi, I do the same but the result only shows the point and warning that geom_path: Each group consists of only one observation. Do you need to adjust the group aesthetic? It means that the line does not appear. Could you help me out? hear is my data : RCI = data.frame (year = c(2015, 2016, 2017, 2018, 2019, 2020), b1_AU = c(1.0975, 1.0998, 1.1341, 1.1386, 1.1141, 1.1598), b5_AU = c(0.9262, 0.9754, 0.9163, 0.9637, 0.9399, 0.9284), b10_CN = c(0.9201, 0.9679, 1.0682, 1.0217, 1.0690, 1.0687), b21_CH = c(1.1347, 1.1067, 1.0332, 0.9687, 0.9618, 0.9989), b43_KS = c(0.9532, 0.9278, 0.9561, 1.0208, 1.0095, 1.1224), b79_US = c(1.0332, 1.0311, 0.9370, 0.9749, 0.9929, 1.0182)) I transfer year into factor then, library(ggplot2) ggplot() + geom_point(data = RCI, aes(x=year, y = b1_AU))+ geom_line(data= RCI,aes(x=year, b1_AU))

    • @DataGarden
      @DataGarden 2 ปีที่แล้ว

      Hi, thanks for leaving a comment. I have reproduced your example and I think if you leave "year" as numeric instead of factor the code should work fine. At least it did for me. Let me know if that works.

    • @mychudungco
      @mychudungco 2 ปีที่แล้ว

      @@DataGarden Yes, you are right. It was my mistake to transform year to factor. Thanks for your well informative video and quick response. How can I contact you in case we have some small projects in the future

    • @DataGarden
      @DataGarden 2 ปีที่แล้ว

      @@mychudungco Great, happy I could help. You can contact me at data.garden@icloud.com with any follow up questions.

  • @maxiromero789
    @maxiromero789 2 ปีที่แล้ว

    Thanks! How do I get real time quotes from Google/Yahoo into R? Is there a (free) way?

    • @DataGarden
      @DataGarden 2 ปีที่แล้ว

      There is no way that I've tried and as far as I know there is no out of the box method for it. However, there are some web parsing tools for R where you can select yourself what you want to load from a website, but it would require some html knowledge so that you can analyse where on the website the data is that you need. The package "rvest" would be my first try here.

  • @simonchantack23
    @simonchantack23 2 ปีที่แล้ว

    Very cool !! Great video

  • @multitaskprueba1
    @multitaskprueba1 2 ปีที่แล้ว

    Fantastic information! You are a great professor! Thank you so much!

  • @nourailham2544
    @nourailham2544 2 ปีที่แล้ว

    Thank you for the informations What's the difference between data.frame and data_frame?!

  • @DataGarden
    @DataGarden 2 ปีที่แล้ว

    I have noticed that the audio quality in this video turned out to be a bit bad, and that the typing on the keyboard is quite loud. Apologies for that. I will try a different microphone setting next time.

  • @zahrasadatvaziri5481
    @zahrasadatvaziri5481 2 ปีที่แล้ว

    At a loss for words. It save me from a lot of headache. Much appreciated.

    • @DataGarden
      @DataGarden 2 ปีที่แล้ว

      Excellent, glad it was helpful. Thanks for leaving a comment!

  • @Stephen-3o3o
    @Stephen-3o3o 2 ปีที่แล้ว

    Thanks so much for the informative video!!!

  • @nyigambole1925
    @nyigambole1925 2 ปีที่แล้ว

    library(ggplot2) #set working directory mydir = "C:/Users/Nyigam/OneDrive/Academic/M.Tech_Project/Project_progressive_work/Project_work_development/Project Part-3-Nyigam/R-analysis" setwd(mydir) dir() #call data monthly_ppt <-read.csv("ppt_monthly_with_date.csv") monthly_ppt View(monthly_ppt) ggplot()+ geom_line(data = monthly_ppt, mapping = aes(x=time, y=CWC,IMD,CFSR,TRMM,APHRODITE,PERSIANN)) ******when I use the above code, I get an error like this****** > ggplot()+ + geom_line(data = monthly_ppt, mapping = aes(x=time, y=CWC,IMD,CFSR,TRMM,APHRODITE,PERSIANN)) Error in `[.data.frame`(df, setdiff(names(df), names(transformed))) : undefined columns selected In addition: Warning messages: 1: Duplicated aesthetics after name standardisation: 2: Ignoring unknown aesthetics:

  • @nl7247
    @nl7247 2 ปีที่แล้ว

    Thank you 😊

  • @Lewstag
    @Lewstag 2 ปีที่แล้ว

    This really helped me create some great boxplots for my PhD thesis. Many thanks for the effort you made to run me through it. Just wanted to feedback that you made a difference! Thank you

    • @DataGarden
      @DataGarden 2 ปีที่แล้ว

      That's great to hear. Thanks for your comment and good luck with your PhD thesis!

  • @dasrotrad
    @dasrotrad 2 ปีที่แล้ว

    Thank you for the helpful videos. Mail I suggest to zoom in on the panes you are modifying. You gave a lot of dead space on the screen where nothing is happening. It is impossible to see what you are writing when viewing your presentation on a cell phone screen. Thanks.

  • @gmm552
    @gmm552 2 ปีที่แล้ว

    @Data Garden: Plz let me know what are few other hyperparameters and what they imply