This video is literally PERFECT for ggplot2 beginners! In only one hour, you'll learn the basics of ggplot2 R coding and you'll end up falling in love with ggplot2 (I thought this language was weird and not intuitive at first, but after this video I think it's very useful and practical to visualize more accurate data plots!)
This was amazing - thanks. It was, literally, the first time I've ever coded anything in my life, I've wanted to learn about data vis stuff for sports analytics for a long time and this video was the perfect introduction. If anyone is in a similar position to myself, and has zero ! previous knowledge of R or programming, and wants to learn about data vis, I'd just start with this video. The only thing I needed to hit google for was to learn how to import the dataset into RStudio (yes, really). Thank you!
Glad to hear that, Jaime. It's still not too late to start with data science. Take a look at our Bootcamp, which might be a great way for you to start: datasciencedojo.com/data-science-bootcamp/
Thanks Dave... I think the way you present the code and interpreting the result is awesome. even I have just new to ggplot2, the presentation bring me as if we have so familiar with the code. You make R is not that difficult.
Currently a college student pursuing a degree in Economics. I'm taking Intro to Economic Data Analysis, and we have the choice of using R or Excel. Our first Homework Project directly coincides with using ggplot2. I haven't even finished half of the video but can already say I have learned so much about R. Sweet vid!
I think if you don't convert them into factors (which are basically categorical variables), then R will think them just as a string of words rather than recurring categories.
Good lecture on ggplot and its functionalities. I liked the examples. I would have liked to see it go a little deeper into examples coders and analysts can use directly in their analysis and data science problems. Thank you.
Thanks for the tutorial! Small caveats on the density plot and the histogram towards the end. The axes are mislabeled. Y should be probability density or counts, while X should be age
I loaded the dataset in both SPSS and R and did all of the plots - to me SPSS was more easy to use, but the plots actually look better in R. Great video.
46:08 May be instead of using copy and paste, we could use, for example: ggplt = ggplot2(titanic,aes...) and add layers to that ggplt + theme_bw().+ labs()....
Hi! How can I change the graphic color? I created the graphic but I don't want it to have only the basic colors of the ggplot2, like this pink and blue colors. I did a little research on google, but I only find how I can change the color on graphics that has continous variables and my variables are discrets. Can you help me?
Hello..the titanic.csv dataset is stored in code.datasciencedojo..May I please know where or how else can I retrieve the file as I would like to go through these class? Thank you.
Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.
i guess im asking randomly but does anybody know a trick to log back into an instagram account?? I was dumb forgot the password. I would appreciate any assistance you can give me
@Chaim Vance I really appreciate your reply. I found the site on google and I'm trying it out now. Seems to take a while so I will reply here later when my account password hopefully is recovered.
Thanks for great lecture, and I'm wondering in the lecture around 1:05:58, the code indicates that aes(x is Age), but the label says it is Survived. Shouldn't it be flipped? labs(y = Survived, x = Age ...)
What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.
Hi I have started working with ggplot2 recently install.packages("ggplot2")is working fine but while using library(ggplot2) I am having below and can't come out. library(ggplot2) Error: package or namespace load failed for ‘ggplot2’: object ‘enexprs’ is not exported by 'namespace:rlang' In addition: Warning message: package ‘ggplot2’ was built under R version 3.4.4 Your help is highly needed
Sir very good video...I just had a doubt ...if we have 8-10 categories instead of 3 for pclass is there an option to select and show the top 5 pclass from the 8-10 categories and plot them.using ggplot...because I work in the aerospace industry and have multiple categories for each variable...thanks Amod Shirke
I wanted to practice with the file while watching this video. But I haven't been able to download the file that he mentions as 'easily downloadable'. I wish he made the file available for downloading on github.
ggplot(titanic, aes(x = Age)) + theme_bw()+ geom_histogram(binwidth = 5) + labs(y="Passanger Count", x="Age (binwidth=5)", title = "Titanic Age Distribution") This doesnt work getting an error Error: StatBin requires a continuous x variable: the x variable is discrete. Perhaps you want stat="count"? >
Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel
Liberty Mgbanyi you can use anotation function and use Paste0 to show the r^2 and equation. If you have r^2 and equation for every facet, you would need to make more variable / column.
Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.
You can find our supplemental material here: code.datasciencedojo.com/datasciencedojo/tutorials/tree/master/Introduction%20to%20Data%20Visualization%20with%20R%20and%20ggplot2
I would like to use "ggplot2" to create a graph showing whether life satisfaction returns to the value it had before the unemployment event occurred. Unemployment I have coded with 0 = not unemployed and 1 = unemployed. General life satisfaction is coded 0 - 10. I have already created the data set df_emp with a subset command, which contains all persons who were unemployed at least once. It also contains all years of observation of these persons. So all years before, during and after unemployment, as well as the corresponding values for life satisfaction The years before, during and after the event should now be entered on the x-axis. Where 0 is the event unemployment. The values -1 -2 and 1 2 etc. show values for the years before and after unemployment. On the y-axis the values for life satisfaction should then be deducted (centered). Unfortunately, I can't manage to model this graphic in R and would therefore be very pleased if somebody could help me or give me some tips on how to proceed. Best regards ps. I got the graphic from Lucas et al. (2004) - Unemployment alters the set point for life satisfaction
what a load of unnecessary talking.... 25 minutes of not needed introduction. 5 minutes to tell why passengerID and name are not relevant.... get to the point already...
I would suggest everyone beginning with ggplot2 to go through this 1hr vedio, it will save you a lot of time understanding the basics.
@Kushagra Mishra - You are too kind, glad you liked the video!
Dave
Kushagra Mishra uwl
This video is literally PERFECT for ggplot2 beginners! In only one hour, you'll learn the basics of ggplot2 R coding and you'll end up falling in love with ggplot2 (I thought this language was weird and not intuitive at first, but after this video I think it's very useful and practical to visualize more accurate data plots!)
THIS IS ABSOLUETELY ONE OF THE BEST TUTORIALS ON CODING THAT I'VE EVER SEEN !!!! THANK YOUUU !! UP UP UP
This was amazing - thanks. It was, literally, the first time I've ever coded anything in my life, I've wanted to learn about data vis stuff for sports analytics for a long time and this video was the perfect introduction.
If anyone is in a similar position to myself, and has zero ! previous knowledge of R or programming, and wants to learn about data vis, I'd just start with this video. The only thing I needed to hit google for was to learn how to import the dataset into RStudio (yes, really).
Thank you!
Simply exceptional. Thank you. I'm hooked. And I'm not even a Data Scientist. If I was 20 years younger, I'd get into this field.
Glad to hear that, Jaime. It's still not too late to start with data science. Take a look at our Bootcamp, which might be a great way for you to start: datasciencedojo.com/data-science-bootcamp/
Thanks Dave... I think the way you present the code and interpreting the result is awesome. even I have just new to ggplot2, the presentation bring me as if we have so familiar with the code. You make R is not that difficult.
glad to help you out, keep following us for more content!
26:30 - Actual video
thnks bruh
Thank you!
I thought this was an exaggeration, but no - thanks for the head's up!
Thank you
Thanks man
The seventh question, I believe the labs should be as:
labs(x = "Age", y = "Density"), and
labs(x = "Age", y = "Survived Count")
Legit the perfect video for a beginner. Thanks a ton man
Keep following us for more crash courses!
Great presentation! Thanks for making ggplot2 easier to be understood.
Currently a college student pursuing a degree in Economics. I'm taking Intro to Economic Data Analysis, and we have the choice of using R or Excel. Our first Homework Project directly coincides with using ggplot2. I haven't even finished half of the video but can already say I have learned so much about R. Sweet vid!
Wow, the Best video on ggplot2. Love you Data Science Dojo. So very much helpful and really got me excited.
Thank you for this video, it was indeed helpful. Didn't have sufficient knowledge in ggplots but now i do. Thanks a lot!
Why the column names are converted to factors in 32:38 ?
I tried using them without converting to factors.
Some were executed but some gave errors.
I think if you don't convert them into factors (which are basically categorical variables), then R will think them just as a string of words rather than recurring categories.
Definitely one of the best intros to ggplot2
Super helpful and crystal clear intro. Thank you very much!
Clear and pertinent!
Keep following us for more content!
Such a wonderful video!!!So simple and easy way to make it understand
Great intro to ggplot2. Made the basics very clear.
An excellent video! Thanks a lot!
Glad you liked it!
So clear and nice lecture. Thank you so much.
Glad you liked it, stay tuned for more lectures!
Good lecture on ggplot and its functionalities. I liked the examples. I would have liked to see it go a little deeper into examples coders and analysts can use directly in their analysis and data science problems. Thank you.
Thanks for the tutorial! Small caveats on the density plot and the histogram towards the end. The axes are mislabeled. Y should be probability density or counts, while X should be age
I loaded the dataset in both SPSS and R and did all of the plots - to me SPSS was more easy to use, but the plots actually look better in R. Great video.
Wonderful, this is was so useful and one hour full of knowledge and hand on practice.
Thanks alot guys !
Belo
Thank you for making this great tutorial! It's easy and simple to follow! I've learned a ton from it; keep making more, please!
46:08 May be instead of using copy and paste, we could use, for example:
ggplt = ggplot2(titanic,aes...) and add layers to that
ggplt +
theme_bw().+
labs()....
Hi! How can I change the graphic color? I created the graphic but I don't want it to have only the basic colors of the ggplot2, like this pink and blue colors. I did a little research on google, but I only find how I can change the color on graphics that has continous variables and my variables are discrets. Can you help me?
Isn't there a mistake in the axis description of the last histogram?
oh man I can't thank enough, you are so good I lost my mind in understanding u hold my back, thanks
Hello..the titanic.csv dataset is stored in code.datasciencedojo..May I please know where or how else can I retrieve the file as I would like to go through these class? Thank you.
Very helpful and appreciated, thanks for uploading
Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.
i guess im asking randomly but does anybody know a trick to log back into an instagram account??
I was dumb forgot the password. I would appreciate any assistance you can give me
@Brantley Grey Instablaster :)
@Chaim Vance I really appreciate your reply. I found the site on google and I'm trying it out now.
Seems to take a while so I will reply here later when my account password hopefully is recovered.
@Chaim Vance it worked and I now got access to my account again. Im so happy:D
Thanks so much you saved my ass :D
@Brantley Grey Happy to help :D
worth watching 1 hr..Really helpful. Thanks a lot
This is so awesome. Thank you so much.
Keep following us for more content!
Hi, I tried running the code for 2nd question w.r.t Sex but still getting the grey bars. The color for survived is not coming. Please help.
excellent video! Thank you very much Dave
Thanks for great lecture, and I'm wondering in the lecture around 1:05:58, the code indicates that aes(x is Age), but the label says it is Survived. Shouldn't it be flipped? labs(y = Survived, x = Age ...)
Thank you very much for your explanation.
thanks for such good vedio. Loved it,
What is the use of factorise here? I thought factorising some variable was going to be used later in exercise.
What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.
Found this super helpful! Thanks so much
Superb presentation
can i do you one cuestion? is about a graphic that i can't resolve
Hi I have started working with ggplot2 recently
install.packages("ggplot2")is working fine but while using library(ggplot2) I am having below and can't come out.
library(ggplot2)
Error: package or namespace load failed for ‘ggplot2’:
object ‘enexprs’ is not exported by 'namespace:rlang'
In addition: Warning message:
package ‘ggplot2’ was built under R version 3.4.4
Your help is highly needed
Nice video to get you hook with ggplot2
Sir very good video...I just had a doubt ...if we have 8-10 categories instead of 3 for pclass is there an option to select and show the top 5 pclass from the 8-10 categories and plot them.using ggplot...because I work in the aerospace industry and have multiple categories for each variable...thanks
Amod Shirke
Great tutorial, thank you!
for people that are familiar with R skip to 33:33
I just started loving R
I wanted to practice with the file while watching this video. But I haven't been able to download the file that he mentions as 'easily downloadable'. I wish he made the file available for downloading on github.
Very good!
ggplot(titanic, aes(x = Age)) +
theme_bw()+
geom_histogram(binwidth = 5) +
labs(y="Passanger Count",
x="Age (binwidth=5)",
title = "Titanic Age Distribution")
This doesnt work getting an error
Error: StatBin requires a continuous x variable: the x variable is discrete. Perhaps you want stat="count"?
>
very nice explanation with the dataset. Thank You.
very useful video... thank you
in 42:30min, you have a color on you bars. but with same code, my bars are having the same color. Why? Please give me a solution. Thanks in advance
You likely forgot to set up the factors. Go back to around 32:00 in the video for the explanation.
Yes. I have done after making comment. Thanks for replying.
please create more content on prediction and fitting function
Mistakes in your code 162, 163, 171 and 172. The x axis should be "Age" and y axis should be "Survived". By the way, great tutorial!
I need some help with my R language biostatistics and I'm glad to pay the affordable R tutorial through Zoom or any other platform.
Hello Joseph, do check out our free course on R: online.datasciencedojo.com/course/R-Programming
I don´t read the CSV file.....please help..
thank you very much, very helpful for me.
Very nice sir,
Please make a videos for 3d visualization .
Excellent!
thanks very informative
DATA SCIENCE DOJO Piliz share the R code , my not running properly
Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel
Liberty Mgbanyi you can use anotation function and use Paste0 to show the r^2 and equation. If you have r^2 and equation for every facet, you would need to make more variable / column.
start from @24:00
Thank you very much. You are far too kind
Incredible, telling a story with data!
Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.
you should do geom_boxplot(notch=T) so folks understand the concept of visually comparing medians; also read_csv preferred over read.csv
This is a great tur]torial, good job
Thanks a ton david ...
Thank you so much..
I think in last two graphs both density and histogram are wrongly labeled
Would you elaborate on why this is?
I was thinking the same. It doesn't make sense to label the y axis as survived, but rather frequency? while the x axis should just be age.
I also believe it is wrong. It should be age on the x-axis.
The age is on the x-axis and survived is on the y-axis.
Infinite SNR - Thanks!
really helpfull
Really thanks
I need the excel or csv file to practice
You can find our supplemental material here: code.datasciencedojo.com/datasciencedojo/tutorials/tree/master/Introduction%20to%20Data%20Visualization%20with%20R%20and%20ggplot2
I would like to use "ggplot2" to create a graph showing whether life satisfaction returns to the value it had before the unemployment event occurred.
Unemployment I have coded with 0 = not unemployed and 1 = unemployed. General life satisfaction is coded 0 - 10.
I have already created the data set df_emp with a subset command, which contains all persons who were unemployed at least once. It also contains all years of observation of these persons. So all years before, during and after unemployment, as well as the corresponding values for life satisfaction
The years before, during and after the event should now be entered on the x-axis. Where 0 is the event unemployment. The values -1 -2 and 1 2 etc. show values for the years before and after unemployment.
On the y-axis the values for life satisfaction should then be deducted (centered).
Unfortunately, I can't manage to model this graphic in R and would therefore be very pleased if somebody could help me or give me some tips on how to proceed.
Best regards
ps. I got the graphic from Lucas et al. (2004) - Unemployment alters the set point for life satisfaction
thanks a lot !!!
ggplot2 best package in data science for visulaization
Wow. Thanks
Pretty cool
Wait a minute, I am gonna update my linked In bio to R expert
Thanks
Keep following us for more tutorials.
That video was fantastic. I now know i should be a girl if i want to go on a cruise
Histogram 54:44
Install equiser package
tq dojo
what a load of unnecessary talking.... 25 minutes of not needed introduction. 5 minutes to tell why passengerID and name are not relevant.... get to the point already...