you are so amazing and articulated. I mean you OBVIOUSLY have the experience and knowledge in DS but unlike some other videos I watched on youtube you do 10000% sound like someone has a job in DS and have to share your findings to stakeholder. you are so nice. I hope you'd update your channel after some time, anything would be great.
An excellent introduction to time-series forecasting and FB prophet, very well explained and well writen code. I will be watching many more of your videos :)
Having started my degree in astrophysics and then deciding about halfway through i wanted to do data science, your channel has been a gold mine! I graduate next spring with my degree in Data Science. You have been a large part of my learning and i thank you!!
For me, your timing is absolutely spot on - I am sitting for Microsoft DP-100 in three weeks and starting a machine learning class the week after that, so THANK YOU! I can't wait for the next installment :)
I majored in Mathematical Economics in undergrad and graduated in fall of 2020. I'm currently going to grad school for Data Science in healthcare analytics. This channel really helps!!
The timing on this is absolute gold, this is literally the type of project I'm building as a Django app for my work right now! It's an analytics dashboard to monitor sales activity by channel and since we're dealing with a lot of seasonality there, Prophet seems like a spot-on fit for incorporating forecasts. Thank you for doing this, excited for more of the series!
Fantastic video.. exactly what I've been looking for.. Will be in corporating this into or forecasting workflow.. thank you 😊.. subscribed and will be watching the full 30 days 😊
This is great! Just started the video, cool to see another time series forecasting model. I have primarily used the Nixtla forecasting libraries like Neuralforecast and Statsforecast. Excited to see another approach! Keep up the good work!
Just finished the video - great work! I really enjoyed how you walked through all the aspects of the code and even re-ran some cells to really help explain what is going on. Excited to see the rest of this series, keep it up!
Thanks for watching! That’s awesome, I’ll have to check those libraries out! I’ve primarily used Prophet because it felt so easy to use and also explain to stakeholders haha. Appreciate you keeping up with the series!
This is so awesome! Your patience, attention to detail and communication style makes it feel like I am chatting with a colleague across the room to learn more about time-series. I'm now subscribed and will probably go watch all the other videos :)
As a newbie to FB prophet these 2 tutorials rock! Very easy to follow along and digest. Are you planning on releasing the 3rd part of the series any time soon? Excited to watch!
I know but get hEr ( side note ) she told everyone to pleAse engage. Did everyone obliGe??? When the youth find her it’s a wrap. Ok I will show everyone how exciting she and this channel is. Y’all have no idea how you’re about to love learning again! Let’s gooooOoOo!
Would you consider this logic valuable also for LTV calculation, where instead of categories (such as automotive, babycare, beauty ...) we have cohort months (such as Jan-23, Feb-23 ...)?
@TheAlmostAstrophysicist - Thanks for this. Currently working on a forecasting model and this video came in right on time. Looking forward to the next videos. I also studied Physics, by the way 😄
Great series, thanks. Just on high volume you refer to largest sales by day, but is transaction volume not more important than total daily sales? High transaction volume will give better signal than low transaction volume.
Great question! Absolutely - I think what you define as transaction volume is what I'm referring to when I say total daily sales. i.e. the higher the transactions are daily/the higher the volume, the better signal we get. In the video, I use the "np.mean" function across the columns to see what the average daily sales (i.e. avg. transaction volume) is. In general, the lower the volume, (under $1000 usually) leads to higher errors since it's hard to get signal. So I use >=$1000 as a cut-off. Does this make sense? I think we mean the same thing haha
Can you please do a video on predicting discrete yes or no events in a time series using only categorical data?? That would be immensely helpful. I’m approaching feature selection with mutual information classification, but I’d like to know how you’d pipeline it!
this is nicely in-depth, thank you! in terms of scaling, would it be best to run this as a Python script instead of notebook and automate it using something like airflow?
Thanks for watching! Absolutely! So you'd want to fully automate it as a pipeline, my second video (coming out Saturday this week) is a second full pipeline notebook and you'd want sometime like that pipeline either automated as a script OR you can use a service like Databricks/something similar to schedule regular notebook runs/jobs. :)
HI Priya, I love the video series idea. I'm currently in sales and looking to propose a sales forecasting pipeline at work as an audition to transition to a full time position. I love the video and still trying to get head around the coding itself. Keep up the good work and I look forward to more in your series
I have a question related to this video series and perhaps a request. As mentioned, I'm trying to make a forecasting proposal for my workplace and would like to cover all the basis that would be applicable from a data science perspective I built an RFM and CLTV customer segmentation Kmeans model based off e-commerce data from Kaggle and wanted to use these clustering to make forecasting prediction based off leads received and classified into the identified clusters. I will be forecasting total sales for the month using regression and wanted to know if this is something you would be doing on a day to day as a data scientist in a sales environment or am I missing a step?
If you're given daily sales data and the agency that makes the order per day, what would you do to predict the sales per agency for a particular day? The data I have contains over 1000 agencies.
Still new to ds, but will your videos. I dont really understand eda, its purpose in the end and how to use your findings in eda for the followng processes in ds cycle. If you could make a video on it, in this series with an simple example case study, I would appreciate it.
Is there a way to deal with having lots of 0s in the time series? I'm currently working on a procurement forecast model. Therefore there are lots of days where procurement doesn't happen, making the y value 0 for most days. This is really affecting the model performance.
I am starting out in DS (around a year into it) and I am really inspired by your content. Never used prophet but will make sure to run your notebook and accompany the series! Just curious, how long does it take you to make something like this notebook? I am struggling to execute faster and was wondering if you have any tips on that? Great content as always!
Awesome! To make the notebook, took about I'd say 20-30 minutes since I've worked with prophet before! The hardest part was honestly finding good open source data lol. And the whole notebook takes about 20ish minutes to run if you go through the whole hypertuning cross-validation for every category of products! I have that notebook pipeline for video 2 finished!
Hmm that's weird. I download the "train.csv" from www.kaggle.com/c/favorita-grocery-sales-forecasting and I renamed it on my desktop to "store_data.csv" Maybe that's the issue if you can't read in the data?
USE df_cv = cross_validation(m, initial='365 days', period='30 days', horizon = '30 days', parallel='threads') INSTEAD OF df_cv = cross_validation(m, initial='365 days', period='30 days', horizon = '30 days', parallel='processes') IF YOU HAVE A OLD COMPUTER.
An astrophysicist into data science? Well it does seem like a natural transitition, you can use pdes or matrix algebra you've learnt from physics for data science.
I just found your channel, loved this video and subscribed. But looks like you stoped making content. Please come back, I like the way you give a background on the items discussed, like the Fourier math etc. 🫶
you are so amazing and articulated. I mean you OBVIOUSLY have the experience and knowledge in DS but unlike some other videos I watched on youtube you do 10000% sound like someone has a job in DS and have to share your findings to stakeholder. you are so nice.
I hope you'd update your channel after some time, anything would be great.
An excellent introduction to time-series forecasting and FB prophet, very well explained and well writen code. I will be watching many more of your videos :)
Having started my degree in astrophysics and then deciding about halfway through i wanted to do data science, your channel has been a gold mine! I graduate next spring with my degree in Data Science. You have been a large part of my learning and i thank you!!
So glad I can be a part of your journey! Sounds so similar to mine haha
For me, your timing is absolutely spot on - I am sitting for Microsoft DP-100 in three weeks and starting a machine learning class the week after that, so THANK YOU! I can't wait for the next installment :)
Oh awesome! I'm glad you can follow along with classes! 😀
first of all thank u for giving the information of data science and take out us to the real world data science word
course
I majored in Mathematical Economics in undergrad and graduated in fall of 2020. I'm currently going to grad school for Data Science in healthcare analytics. This channel really helps!!
That's awesome - good luck on your journey and glad the channel can be a part of it!
The timing on this is absolute gold, this is literally the type of project I'm building as a Django app for my work right now! It's an analytics dashboard to monitor sales activity by channel and since we're dealing with a lot of seasonality there, Prophet seems like a spot-on fit for incorporating forecasts. Thank you for doing this, excited for more of the series!
Ahhhh this makes me so happy! Incorporating DS to solve business problems for the win!
Fantastic video.. exactly what I've been looking for.. Will be in corporating this into or forecasting workflow.. thank you 😊.. subscribed and will be watching the full 30 days 😊
This is great! Just started the video, cool to see another time series forecasting model. I have primarily used the Nixtla forecasting libraries like Neuralforecast and Statsforecast. Excited to see another approach! Keep up the good work!
Just finished the video - great work! I really enjoyed how you walked through all the aspects of the code and even re-ran some cells to really help explain what is going on. Excited to see the rest of this series, keep it up!
Thanks for watching! That’s awesome, I’ll have to check those libraries out! I’ve primarily used Prophet because it felt so easy to use and also explain to stakeholders haha. Appreciate you keeping up with the series!
Your videos have made me seriously look into Graduate programs in Data Science. Thank you!
Thank you so much for this amazing video, it's so pretty useful. Not enought words to thank you
This is so awesome! Your patience, attention to detail and communication style makes it feel like I am chatting with a colleague across the room to learn more about time-series. I'm now subscribed and will probably go watch all the other videos :)
this series will be FIRE 🔥🔥🔥
Appreciate it!!!
As a newbie to FB prophet these 2 tutorials rock! Very easy to follow along and digest. Are you planning on releasing the 3rd part of the series any time soon? Excited to watch!
best video i see for prophet, thanks
Looking forward to all your videos!
Thank you so much for sharing this knowledge with us for free! The video was amazing!
Been waiting for this series from you! Thank you!!
Of course! More to come with the series, thanks for following along! 😀
Middle of doing my Stats 5301 hw.. can’t wait to finish up and get into this vid!
Means a lot that you're following along, thank you!! Hope this helps 😀
Hello Priya! I am a new follower of yours here and I new fan as well! Congratulations! This explanation is beautiful!
Fantastic density of the content.
Many thanks for the super great video!
I would like to know why you have loaded holidays, but they are not (or cannot be) used by Prophet later?
Thank you for this productive video! Learnt a lot!!
Great to hear, thanks for watching!
Super excited to start this!
Hope you enjoy it!!
You stopped uploading??? Nooooo
I know but get hEr ( side note ) she told everyone to pleAse engage. Did everyone obliGe???
When the youth find her it’s a wrap.
Ok I will show everyone how exciting she and this channel is. Y’all have no idea how you’re about to love learning again! Let’s gooooOoOo!
Would you consider this logic valuable also for LTV calculation, where instead of categories (such as automotive, babycare, beauty ...) we have cohort months (such as Jan-23, Feb-23 ...)?
I would like to think so.. Did you end up utilizing Prophet for this use case?
@TheAlmostAstrophysicist - Thanks for this. Currently working on a forecasting model and this video came in right on time.
Looking forward to the next videos.
I also studied Physics, by the way 😄
That’s awesome! Glad the video can help, thanks for following along! also so fun that you did physics too!
Thank you for making these
Great series, thanks. Just on high volume you refer to largest sales by day, but is transaction volume not more important than total daily sales? High transaction volume will give better signal than low transaction volume.
Great question! Absolutely - I think what you define as transaction volume is what I'm referring to when I say total daily sales. i.e. the higher the transactions are daily/the higher the volume, the better signal we get.
In the video, I use the "np.mean" function across the columns to see what the average daily sales (i.e. avg. transaction volume) is. In general, the lower the volume, (under $1000 usually) leads to higher errors since it's hard to get signal. So I use >=$1000 as a cut-off.
Does this make sense? I think we mean the same thing haha
Good One!
Quality content as always!
thank you!!!
Can you please do a video on predicting discrete yes or no events in a time series using only categorical data?? That would be immensely helpful. I’m approaching feature selection with mutual information classification, but I’d like to know how you’d pipeline it!
this is nicely in-depth, thank you! in terms of scaling, would it be best to run this as a Python script instead of notebook and automate it using something like airflow?
Thanks for watching! Absolutely! So you'd want to fully automate it as a pipeline, my second video (coming out Saturday this week) is a second full pipeline notebook and you'd want sometime like that pipeline either automated as a script OR you can use a service like Databricks/something similar to schedule regular notebook runs/jobs. :)
Hi Priya - When can we see next video? These are par excellence.... KumR
thank you for sharing. This is very informative!
HI Priya, I love the video series idea. I'm currently in sales and looking to propose a sales forecasting pipeline at work as an audition to transition to a full time position. I love the video and still trying to get head around the coding itself.
Keep up the good work and I look forward to more in your series
I have a question related to this video series and perhaps a request. As mentioned, I'm trying to make a forecasting proposal for my workplace and would like to cover all the basis that would be applicable from a data science perspective
I built an RFM and CLTV customer segmentation Kmeans model based off e-commerce data from Kaggle and wanted to use these clustering to make forecasting prediction based off leads received and classified into the identified clusters. I will be forecasting total sales for the month using regression and wanted to know if this is something you would be doing on a day to day as a data scientist in a sales environment or am I missing a step?
Woow, amazing class, thank you (from Brazil)
If you're given daily sales data and the agency that makes the order per day, what would you do to predict the sales per agency for a particular day? The data I have contains over 1000 agencies.
what data do you have exactly? Is it:
agency | sales
So just 2 columns?
Still new to ds, but will your videos. I dont really understand eda, its purpose in the end and how to use your findings in eda for the followng processes in ds cycle. If you could make a video on it, in this series with an simple example case study, I would appreciate it.
Is there a way to deal with having lots of 0s in the time series? I'm currently working on a procurement forecast model. Therefore there are lots of days where procurement doesn't happen, making the y value 0 for most days. This is really affecting the model performance.
Useful information
Thanks so much Tata! 😄😄
bruh your the goat
It is an amazing video!
I am starting out in DS (around a year into it) and I am really inspired by your content. Never used prophet but will make sure to run your notebook and accompany the series! Just curious, how long does it take you to make something like this notebook? I am struggling to execute faster and was wondering if you have any tips on that?
Great content as always!
Awesome! To make the notebook, took about I'd say 20-30 minutes since I've worked with prophet before! The hardest part was honestly finding good open source data lol. And the whole notebook takes about 20ish minutes to run if you go through the whole hypertuning cross-validation for every category of products! I have that notebook pipeline for video 2 finished!
Thank you for this series, when i downloaded the dataset from kaggel it didn't downloaded right
Hmm that's weird. I download the "train.csv" from www.kaggle.com/c/favorita-grocery-sales-forecasting and I renamed it on my desktop to "store_data.csv" Maybe that's the issue if you can't read in the data?
is the Prophet still working and if is let me know how to download , i am not able to import or install that algo
Thanks a lot from France 👌
Why 1-MAPE as the accuracy metric?
This is great content, thank you.
You’re amazing
USE
df_cv = cross_validation(m, initial='365 days', period='30 days', horizon = '30 days', parallel='threads')
INSTEAD OF
df_cv = cross_validation(m, initial='365 days', period='30 days', horizon = '30 days', parallel='processes')
IF YOU HAVE A OLD COMPUTER.
Please make more content ❤
Waiting the 3rd video about prophet =)
part 2 ????
Please come back online
Great video. Is it ok just to leave some troll comments/questions?
i luv you
An astrophysicist into data science? Well it does seem like a natural transitition, you can use pdes or matrix algebra you've learnt from physics for data science.
me struggling to change my major from IS to CS🤣🤣🤣🤣🤣
good girl
I just found your channel, loved this video and subscribed. But looks like you stoped making content. Please come back, I like the way you give a background on the items discussed, like the Fourier math etc. 🫶