Dplyr Advanced Guide: data cleaning, reshaping, and merging with lubridate, stringr, tidyr, ggplot2

แชร์
ฝัง
  • เผยแพร่เมื่อ 24 ก.ค. 2024
  • Dplyr Advanced Guide: data cleaning, reshaping, and merging with lubridate, stringr, tidyr, ggplot2
    Timeline
    0:00 Intro
    1:30 Cleaning dates
    3:15 String cleaning & extraction
    7:30 Reshaping data
    10:57 Merging data frames
    14:05 Graphing
    Code: github.com/abhimotgi/dataslic...
    Datasets:
    Billboard Hot 100: www.kaggle.com/dhruvildave/bi...
    MTV 10,000 artists: gist.github.com/mbejda/9912f7...
    Cheat sheets:
    Lubridate: evoldyn.gitlab.io/evomics-201...
    Stringr: evoldyn.gitlab.io/evomics-201...
    Tidyr: community.rstudio.com/t/tidyr...
    Ggplot2: www.maths.usyd.edu.au/u/UG/SM...

ความคิดเห็น • 29

  • @willykitheka7618
    @willykitheka7618 3 ปีที่แล้ว +12

    I have really resonated well with your teaching style. If you can produce more tutorials on data cleaning, creation of data pipelines(basically data engineering) that would have a substantial impact given that many content developers in this space are focused on model creation, we know that 80% of the time is spent in data cleaning. My two cents.

  • @krzysadamczyk7944
    @krzysadamczyk7944 3 ปีที่แล้ว +1

    Fantastic video as always. Keep up the good work!

  • @ajpalec
    @ajpalec 2 ปีที่แล้ว +3

    Trying to pick up R for my job (from a non-DS background) and getting so much out of these videos. Many lessons packed in and tips upon the rewatch since learning more - thank you for all the work you put into these ! 🙏🏼.
    Echoing the other commenters from this and other vids on your video and tutorial quality. Definitely my favorite tutorials on R that I’ve found!

  • @mkklindhardt
    @mkklindhardt 2 ปีที่แล้ว +1

    Awsome insights! Thanks for sharing these videos ;)

  • @nth.education
    @nth.education ปีที่แล้ว

    Really fortunate to have stumbled on your channel, nicely explained :)

  • @marcinterlecki3024
    @marcinterlecki3024 2 ปีที่แล้ว +1

    The way you teach this subject is very accessible!

  • @jonaskolb1941
    @jonaskolb1941 2 ปีที่แล้ว +1

    Thank you. Very informative and easy to understand.

  • @janaileonce1270
    @janaileonce1270 2 ปีที่แล้ว +1

    One of the best R videos I have ever watched. Thanks.

  • @mursidgurbuz2935
    @mursidgurbuz2935 ปีที่แล้ว

    Very clear explanation! Thank you so much!

  • @fourlokody
    @fourlokody ปีที่แล้ว

    bro, these videos are really helpful. please keep uploading

  • @LogicApparatus
    @LogicApparatus 2 หลายเดือนก่อน

    Good stuff dude! Very helpful video, you know your stuff!

  • @eduardobustamante1797
    @eduardobustamante1797 3 ปีที่แล้ว +2

    Thanks for the videos, that really help, you're the best explaining, greetings from mexico :)

  • @Pitipapeta
    @Pitipapeta 3 ปีที่แล้ว +3

    Im an applied statistics student and these videos are AMAZING. In 17 minutes you summarised multiple classes worth of dplyr knowledge, hell, I even learned some new stuff even though I've been using this package for almost 2 years now. I've watched most of your videos and I'm going to redo some of my old assignments using the stuff I've learned from your channel to assimilate the new knowledge. Thank you a lot for your work!!!

    • @dataslice
      @dataslice  3 ปีที่แล้ว

      Thanks, I’m glad I could help!

  • @75Miah
    @75Miah 3 ปีที่แล้ว +1

    Really nice video! I had used lubridate before but not the way you showed off in the video. Gonna have to keep this series bookmarked to go back to for ref.

  • @naterush5019
    @naterush5019 3 ปีที่แล้ว +1

    Great vid!

  • @alexdaudeinv5270
    @alexdaudeinv5270 ปีที่แล้ว

    clear, simple and useful, just thanks :)

  • @djangoworldwide7925
    @djangoworldwide7925 3 ปีที่แล้ว +6

    This tutorial can probably cover 3-4 classs

  • @ChrisMao_708
    @ChrisMao_708 2 ปีที่แล้ว +4

    hey, great video. BIG THANKS to all the work!! btw your [name] column in the MTV dataset seem to have leading and trailing space which may impact the inner join process. I used (strip.white=TRUE) when reading from the csv file, it seems working. again, really learned a lot from this fantastic tutorial video. really appreciate it : )

    • @justinfoo2905
      @justinfoo2905 ปีที่แล้ว +2

      This. How did everyone else miss this??

  • @khanhhiendinh
    @khanhhiendinh 2 ปีที่แล้ว +1

    Thank you

  • @leeslater8962
    @leeslater8962 3 ปีที่แล้ว +1

    Excellent video, very helpful and clear. Planning the next video?

    • @dataslice
      @dataslice  3 ปีที่แล้ว

      Yep, planning for the next video! Not sure what the topic will be though yet.

  • @rvhsshvr
    @rvhsshvr 2 ปีที่แล้ว +1

    I love you

  • @grainofsalt2113
    @grainofsalt2113 3 ปีที่แล้ว +2

    awesome, thanks man. Can you do some python tutorials as well, on predictive modeling?

    • @dataslice
      @dataslice  3 ปีที่แล้ว +1

      Yeah definitely trying to do some Python tutorials soon

    • @grainofsalt2113
      @grainofsalt2113 3 ปีที่แล้ว +1

      @@dataslice awesome, im looking forward to it! your vids have helped me improve in my job

  • @gritmanish
    @gritmanish ปีที่แล้ว

    The way you have expained is very good but please go slow.