Python Pandas Tutorial (Part 10): Working with Dates and Time Series Data

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 ก.ค. 2024
  • In this video, we will be learning how to work with DateTime and Time Series data in Pandas.
    This video is sponsored by Brilliant. Go to brilliant.org/cms to sign up for free. Be one of the first 200 people to sign up with this link and get 20% off your premium subscription.
    In this Python Programming video, we will be learning several different concepts about working with DateTimes and Time Series data in Pandas. We will learn how to convert values to datetimes, how to filter by dates, how to resample our dates to do some more in-depth analysis, and more. Let's get started...
    The code for this video can be found at:
    bit.ly/Pandas-10
    StackOverflow Survey Download Page - bit.ly/SO-Survey-Download
    Datetime Formatting Codes - bit.ly/python-dt-fmt
    Pandas Date Offset Codes - bit.ly/pandas-dt-fmt
    ✅ Support My Channel Through Patreon:
    / coreyms
    ✅ Become a Channel Member:
    / @coreyms
    ✅ One-Time Contribution Through PayPal:
    goo.gl/649HFY
    ✅ Cryptocurrency Donations:
    Bitcoin Wallet - 3MPH8oY2EAgbLVy7RBMinwcBntggi7qeG3
    Ethereum Wallet - 0x151649418616068fB46C3598083817101d3bCD33
    Litecoin Wallet - MPvEBY5fxGkmPQgocfJbxP6EmTo5UUXMot
    ✅ Corey's Public Amazon Wishlist
    a.co/inIyro1
    ✅ Equipment I Use and Books I Recommend:
    www.amazon.com/shop/coreyschafer
    ▶️ You Can Find Me On:
    My Website - coreyms.com/
    My Second Channel - / coreymschafer
    Facebook - / coreymschafer
    Twitter - / coreymschafer
    Instagram - / coreymschafer
    #Python #Pandas

ความคิดเห็น • 413

  • @coreyms
    @coreyms  4 ปีที่แล้ว +101

    Hope you all are all staying safe! In this video we'll be learning a lot about working with dates and time-series data in Pandas, and also look at doing some basic plotting. In the next video, we'll be learning how to load in data to Pandas from different (Excel, JSON, SQL, etc). Let me know if there is anything else you'd like me to cover in the Pandas series. I will likely be taking a break from this series after the next two videos are released just so I can focus on some different topics.

    • @kaaaaasraa49563
      @kaaaaasraa49563 4 ปีที่แล้ว +11

      Hey Corey
      Will you be able to cover some machine learning libraries like scikit learn,tensorflow,...?
      Because matplotlib and pandas that you have covered so far is related to machine learning and we can put this libraries into practical

    • @coreyms
      @coreyms  4 ปีที่แล้ว +49

      @@kaaaaasraa49563 I'll do my best. It's a very large topic to cover, so I've been a bit intimidated to tackle such a large course, but I know many people are wanting that, so I'll do my best to start working on something soon

    • @zahash1045
      @zahash1045 4 ปีที่แล้ว +2

      Hey guys checkout dfcleaner package on pypi

    • @kulpreetsingh9064
      @kulpreetsingh9064 4 ปีที่แล้ว +8

      Hey Corey, i was wondering if you could help do a complete EDA on one dataset ,kaggle or from anywhere in which you could cover the entire operations that we've learnt in one complete tutorial video with like a real world dataset. That could be like the end of this series video or something.

    • @loves_running5860
      @loves_running5860 4 ปีที่แล้ว +2

      Please help with numpy

  • @anubhavtomar1384
    @anubhavtomar1384 4 ปีที่แล้ว +90

    3:50 convert to datetime using to_datetime
    7:45 parse dates while loading data from csv
    11:20 using day_name on whole series using dt class
    12:20 create column of dayname
    13:20 min and max methods on datetime series
    15:00 filtering by dates as string
    17:20 filter by to_datetime
    18:40 set date as index
    19:21 filter data by just passing the date in brackets
    20:05 using a slice to get specific dates data
    21:00 calculating average of a slice(timeframe)
    22:05 getting max value of a column on a given day
    23:50 resampling(breaking down by days) a whole column into a new variable
    27:00 plotting with matplotlib in pandas
    28:28 resampling df with multiple columns
    30:55 using agg to apply different function on different columns while resampling

    • @mas7850
      @mas7850 4 ปีที่แล้ว +1

      Would it be possible for you to tell me how can I download ETH_1h.csv file? I found just row data on bit.ly/Pandas-10. Thanks in advance.

    • @lakshitakamboj198
      @lakshitakamboj198 2 ปีที่แล้ว

      😊

    • @gersonbronstein4664
      @gersonbronstein4664 5 หลายเดือนก่อน +1

      Wow! Thk you very much! I appreciated a lot!

  • @larc99
    @larc99 4 ปีที่แล้ว +150

    For all those getting the following error: *FutureWarning: The pandas.datetime class is deprecated and will be removed from pandas in a future version. Import from datetime module instead.*
    Just do this:
    *from datetime import datetime*
    and change the *pd.datetime.strptime()* to:
    *datetime.srtptime()*
    and it should work the same.

    • @mrrohitnayak1
      @mrrohitnayak1 3 ปีที่แล้ว +6

      I used the following
      import datetime as dparser
      d_parser = lambda fn : dparser.datetime.strptime(fn, '%Y-%m-%d %I-%p')
      this also worked. is there any advantage of your method over mine ?

    • @alenjose3903
      @alenjose3903 3 ปีที่แล้ว +2

      @@mrrohitnayak1 its the same thing, u just named datetime as dparser, His code would call datetime.datetime.strptime() and yours call dparser.datetime.strptime()

    • @mrrohitnayak1
      @mrrohitnayak1 3 ปีที่แล้ว +1

      @@alenjose3903 any advantage in using his method over mine or vice versa

    • @alenjose3903
      @alenjose3903 3 ปีที่แล้ว

      @@mrrohitnayak1 no its the same, u can name it anything you want.

    • @Iris-fp9vo
      @Iris-fp9vo 3 ปีที่แล้ว +3

      Consider changing 'datetime.srtptime()' to 'datetime.strptime()'

  • @amazing-graceolutomilayo5041
    @amazing-graceolutomilayo5041 4 ปีที่แล้ว +90

    I bless the day I came across your channel

  • @lingen2193
    @lingen2193 4 ปีที่แล้ว +11

    While there are already other excellent pandas tutorials on TH-cam your videos IMHO stand out for their carefully crafted examples, clarity of explanation, information density and relevance to practical applications.
    I'd really love this series to be continued and cover some of the more advanced topics.

  • @ashishdeora8522
    @ashishdeora8522 3 ปีที่แล้ว +3

    The concept of resample is so damn crazy!!! I struggle so much in other systems with this kind of aggregation and it's easy like nuts in pandas...
    Corey, you are great!

  • @next848484
    @next848484 4 ปีที่แล้ว +26

    Hi Corey, thank you so much for taking the time and putting this series together. This is one of the most structured series on panda on the web. I feel confident in pandas after watching the series. We would love to see something similar on the NumPy. Please, Corey...put together a Numpy series for us.

  • @zio.eza25
    @zio.eza25 ปีที่แล้ว +1

    I learn through this video in 2023 as a new entry-level data analyst and non-native English. It's still relevant and lite but packed, especially for the `.resample()` method. It saves my time compared to using `.groupby()`. In addition, we need to keep in mind indice filtering. It's better to use `.loc[]` to avoid the deprecation issue. Thank you, Corey Schafer. Your channel is such a gem!

  • @ste9240
    @ste9240 4 ปีที่แล้ว +4

    Thanks a lot for this whole series Corey!! I've watched 3/4 other tutorials between youtube and other platforms and yours were the best by far, especially due to the fact that I'm from Italy so it was easier to follow your explanations step by step
    I can't wait for your plotting with Pandas tutorials too, those are gonna be pretty useful! Have a nice day a stay safe ;)

  • @piyushkonher8405
    @piyushkonher8405 4 ปีที่แล้ว +5

    this whole series is so beautifully explained that if one is little bit aware of python and its syntax whole series can be watched without volume .

  • @hermancoutinho
    @hermancoutinho 2 ปีที่แล้ว

    Thank you Corey for the wonderful tutorials. No paid courses can come close to the way you present the depth of the concept. Thank you for making it free and a big thank you for keeping up the good work.

  • @ZsoltPal23092011
    @ZsoltPal23092011 ปีที่แล้ว +2

    These are the most developed, best explained videos on how to use pandas - thank you Corey for putting these together!

  • @artygecko7429
    @artygecko7429 10 หลายเดือนก่อน +3

    To anyone getting the message that 'date_parser' is deprecated and will be removed in a future version...
    You no longer need to pass in a function, just the date string formats in the argument date_format='%Y-%m-%d %I-%p'
    example:
    df = pd.read_csv('ETH_1h.csv', parse_dates=['Date'], date_format='%Y-%m-%d %I-%p')

  • @mahmoudgamal9835
    @mahmoudgamal9835 4 ปีที่แล้ว +13

    Yesterday i was searching for any tutorial that explains dates and times in pandas and i couldn't find a good material
    Thanks man

  • @LegendaryGamer-px9dp
    @LegendaryGamer-px9dp 5 หลายเดือนก่อน +3

    For anyone getting error in the first part of the video, this is what worked for me
    d_parser = lambda x: pd.to_datetime(x, format='%Y-%m-%d %I-%p')
    df = pd.read_csv('data/ETH_1h.csv', parse_dates=['Date'], date_parser=d_parser)

    • @rafabws
      @rafabws 4 หลายเดือนก่อน +1

      or pass the date format directly
      df = pd.read_csv("data/ETH_1h.csv",parse_dates=['Date'],date_format = "%Y-%m-%d %I-%p")

    • @user-testo21347
      @user-testo21347 3 หลายเดือนก่อน

      @@rafabws for dfx.loc['2020-01':'2020-02'] , I get KeyError: 'Value based partial slicing on non-monotonic DatetimeIndexes with non-existing keys is not allowed.; What to do with it? plz help

    • @barissucuoglu
      @barissucuoglu หลายเดือนก่อน

      ​@@user-testo21347 I was getting same error too and I believe I found the solution. For some reason our indexes are not in order. For this reason, if we sort our indexes by writing the code dfx.sort_index(inplace=True) before writing the code dfx.loc['2020-01':'2020-02'], we get rid of the error.

    • @barissucuoglu
      @barissucuoglu หลายเดือนก่อน

      @@user-testo21347 I was getting same error too and I believe I solved it. For some reason our indexes are not in order. For this reason, if we sort our indexes by writing the code dfx.sort_index(inplace=True) before writing the code dfx.loc['2020-01':'2020-02'], we get rid of the error.

  • @MrSubWright
    @MrSubWright 9 หลายเดือนก่อน +10

    Also to get the mean for resampling, the default for the mean function was changed. So to get the weekly resampling at around 30:05 of the video:
    df.resample('W').mean() will give an error
    do this instead:
    df.resample('W').mean(numeric_only=True)

    • @sahilmehra290
      @sahilmehra290 9 หลายเดือนก่อน

      where to download this time series data please

  • @nikhil1303
    @nikhil1303 4 ปีที่แล้ว +2

    Hi Corey..I have referred to various resources to learn Pandas properly on TH-cam. And your tutorials are the best I could find. And looking forward to learn more from your matplotlib tutorials. Thanks for creating top class content!!

  • @Davidkiania
    @Davidkiania 4 ปีที่แล้ว +3

    This is the only channel I can’t wait for the new video to drop. Two weeks and waiting but who’s counting. ☺️

  • @aatt3209
    @aatt3209 4 ปีที่แล้ว +1

    I really appreciate your detailed explanation on the errors, which help us to select the right corrective action to take. Love your videos.

  • @benjamindeporte3806
    @benjamindeporte3806 4 ปีที่แล้ว +4

    Very, very, very good. The accurate and straightforward explanations are supported by great examples. Big thanks.

  • @electromorphous9567
    @electromorphous9567 4 ปีที่แล้ว +5

    Right now I'm learning flask from your playlist and it's freaking amazing!

  • @DavidChigbo
    @DavidChigbo 2 หลายเดือนก่อน +2

    You taught me pandas completely free of charge.
    Thank you, sir.

  • @kanimozhipanneerselvam3017
    @kanimozhipanneerselvam3017 4 ปีที่แล้ว +1

    I'm happy that I found a Awesome channel to learn Python for Data Science Easily!!!! Very Nice and Crystal Clear Explanation!!👌🙂

  • @dentupul
    @dentupul 4 ปีที่แล้ว +3

    Gave a thumbs up before even started watching the tutorial..and I'm not disappointed at all. Keep up the good work Corey!!!
    BTW you are the only TH-camr getting likes from me.

  • @mByyurt
    @mByyurt 4 ปีที่แล้ว +2

    I want to point out something. I like your videos. There is nothing boring about them. But I do not have much time so I watch them at 2x. Your normal talking speed is not slow but very clear so I can easily understand you at 2x too.

  • @void_invoker
    @void_invoker 4 ปีที่แล้ว +3

    My man Corey on youtube dropping bangers.
    Thanks man. Amazing content.

  • @siddharthmodi5908
    @siddharthmodi5908 3 ปีที่แล้ว

    Your every video is soo nice and perfect. The explanation is very friendly and in a proper sequence. Your channel works for me as a one-stop-shop. Thank you soo much man.❤️❤️
    Love from India 🇮🇳

  • @spicer41282
    @spicer41282 4 ปีที่แล้ว +9

    Please show more *errors and its corrections* for future vids.
    The fixes are really beneficial!
    Thanks for recording, editing, uploading and sharing! 👍

  • @puffziggy7330
    @puffziggy7330 4 ปีที่แล้ว +3

    Since I stumbled on Corey Schafer videos, it has become my favourite TH-cam Channel

  • @robcz3926
    @robcz3926 4 ปีที่แล้ว +4

    great job Corey, I was going to go through Wes McKinney's book but looking at the pandas section I realized that I already know most of it by now form your series. Pls do the data plotting soon! ;)

  • @gordonmorrow6945
    @gordonmorrow6945 4 ปีที่แล้ว

    Fantastic series! Thank you for providing this awesome content. I have been using Pandas quite a bit lately to process data, one thing I am struggling with and would really find helpful would be a video on taking these Pandas notebooks and functions and converting them into actual Python apps that can be reused on other projects. Pandas can really be like magic sometimes and being able to pass data into a Python app that could apply these functions would be extremely powerful, I have yet to find any good tutorials or reading on this subject, one from the master would be greatly appreciated!!

  • @dnyaneshwar53
    @dnyaneshwar53 2 ปีที่แล้ว

    All I can say you are awesome. The way you explained 😍✌️✌️✌️✌️. You have the best play list for pandas on web. Your efforts are truly appreciated 👍👍👍👍👍👍

  • @gwanghyeongim768
    @gwanghyeongim768 4 ปีที่แล้ว +2

    I learnt so much from this single video. Thank you Corey

  • @michaelmitchell155
    @michaelmitchell155 9 หลายเดือนก่อน

    I used this video to backtrack over some material that I wasn't sure of. This was a great video as is your other instructional videos. Thanks so much.

  • @RK_97
    @RK_97 4 ปีที่แล้ว

    Fantastic series on Pandas. I have no words to express how i am thankful to you for this series. You are a great teacher. I am yet to check on the 11th video on this series and up next to the Matplotlib series (definetely). You are awesome. A request, could you please plan to do Numpy series as well.

  • @Dara-lj8rk
    @Dara-lj8rk 4 ปีที่แล้ว +2

    Well done! I didn't know that I can pass in a dict in agg. Always passed in a list and then selected the column-aggregation tuples that I needed, which is quite resource intensive and hard to read. Also I did not know I could make datetime comparisons to a string. Thank you.

  • @djuka8121
    @djuka8121 4 ปีที่แล้ว +7

    I always give you a thumbs up before i even watch the video!
    Will you introduce us to machine learning in the near future?

  • @PrateekTrivedi6
    @PrateekTrivedi6 2 ปีที่แล้ว +1

    You are a great teacher, many thanks for putting in all the effort and time in creating such a great series :)

  • @tassoskat8623
    @tassoskat8623 3 ปีที่แล้ว

    This was an absolute masterpiece!! Thank you so much. I hope you are good and safe

  • @Terence818
    @Terence818 3 ปีที่แล้ว

    pandas plotting will be useful and intesting! looking forward to it! Thanks Corey!

  • @shreyashkashyap
    @shreyashkashyap ปีที่แล้ว +1

    Extremely useful. Thanks for the video Corey! You have my gratitude.

  • @pookpratch
    @pookpratch 4 ปีที่แล้ว

    I really like how you explain things so clearly. Super helpful for my project. Thank you so much!

  • @latenines
    @latenines 2 ปีที่แล้ว

    This is the best episode from the series, well done

  • @vishaalranjan1783
    @vishaalranjan1783 4 ปีที่แล้ว +7

    Hi Corey! Your series on pandas is really informative and helpful. Can you make a similar series for numpy? I believe that's a topic that will be of interest to many of your viewers.

  • @enrichomsrivera3637
    @enrichomsrivera3637 2 ปีที่แล้ว

    That's great Corey !!
    The best lesson about Time series, ever.

  • @drhastings
    @drhastings 4 ปีที่แล้ว

    Thank you Sir for sacrificing your precious time to teach the World.
    We appreciate you .
    l Appreciate You.

  • @rockeyvalley
    @rockeyvalley 4 ปีที่แล้ว +1

    Another great video Corey. Keep up the good work!!!

  • @wasirizvi2437
    @wasirizvi2437 4 ปีที่แล้ว +1

    As usual your videos are great ! You explain things very clearly. Please start videos on Machine Learning and Artificial Intelligence soon. Earnest request !

  • @kohyuze
    @kohyuze 4 ปีที่แล้ว +2

    There should be a mega-like button for this. Thank you so much!

  • @ScriptsNerd
    @ScriptsNerd 3 ปีที่แล้ว

    OMG!!!
    You are awesome Corey.
    I learned a lot from your videos!

  • @lukeshepherd2342
    @lukeshepherd2342 9 หลายเดือนก่อน +3

    I got an error with the lambda function trying to parse the dates while loading from csv. The error stated pandas no longer supports pd.datetime. After checking the comments I tried @larc99's suggestion but still got an error as it was expecting a string rather than a function for the format. What gave the same output and I think is much simpler (without the lambda function) turned out to be:
    df = pd.read_csv('data/ETH_1h.csv', parse_dates=['Date'], date_format='%Y-%m-%d %I-%p')

  • @HabibAlSaki
    @HabibAlSaki 4 ปีที่แล้ว +3

    Very well-explained. Thanks for this. I am requesting you to make a video on multi index in Pandas.

  • @AbdAlkareemAlBadawi
    @AbdAlkareemAlBadawi 14 วันที่ผ่านมา

    You're my savior I was searching for video as this thank you.

  • @dennisamaya9892
    @dennisamaya9892 3 ปีที่แล้ว

    This Video was sent by the Data Science Gods. Thank you so much

  • @Rocklee46v
    @Rocklee46v 4 ปีที่แล้ว +2

    I watched this video when the likes for this video are 999 and I hit like right away to make it 1K, because I believe Corey's content!

  • @tinylittleanj2
    @tinylittleanj2 3 ปีที่แล้ว +1

    Corey this series is fantastic, thank you!

  • @Prasanna_Rahavendra
    @Prasanna_Rahavendra 4 ปีที่แล้ว

    Hey Corey! All your teachings are really good. I just need to know where you refer for your tutorials. Apart from the documentation, can you please tell me some other references that I could use?

  • @Soulfulreader786
    @Soulfulreader786 2 ปีที่แล้ว

    Pandas is so easy to work on after seeing this vid.
    Wish my coaching teacher taught me this simpler way

  • @aaronglenn7764
    @aaronglenn7764 4 ปีที่แล้ว

    Hi Corey, Thank you so much for the videos! I was wondering when you are going to come out with the Pandas plotting series you mention? Thanks again!

  • @Dark16Force
    @Dark16Force 4 ปีที่แล้ว +1

    Oh Corey, you legend! Thanks mate, this is some great stuff.

  • @user-vd1rj7hb1o
    @user-vd1rj7hb1o 6 หลายเดือนก่อน +1

    I loved the point Corey made about how much easier it is to use date as your index, however while following along I got a warning this warning:
    FutureWarning: Indexing a DataFrame with a datetimelike index using a single string to slice the rows, like `frame[string]`, is deprecated and will be removed in a future version. Use `frame.loc[string]` instead.

  • @timstark6744
    @timstark6744 4 ปีที่แล้ว +3

    Great job Corey! Thank you!

  • @gersonbronstein4664
    @gersonbronstein4664 5 หลายเดือนก่อน

    Hy, Corey. First of all, I would like to thank you for the videos. They are very clear and comprehensive. I already became a member to help you. I have one question regarding the week agregation (around 32'.) The open and close prices for a week should be the prices for the first and last hours of the week respectively. How do I do that?? Should I create a custom function for this?? Thks, GB

  • @ketanbutte3497
    @ketanbutte3497 4 ปีที่แล้ว

    just 11 minutes in the video, i hit the like button as hard as i can !!

  • @Jota_VA
    @Jota_VA 2 ปีที่แล้ว

    Your tutorials are the bests on the web. Before I can do the index filter (19:35 in the video), I had to sort the dataframe by index(df = df.sort_index()) I don't know why but it was given me an error (AssertionError: ).
    Thanks for sharing.

  • @kaifahmad4131
    @kaifahmad4131 3 หลายเดือนก่อน +1

    Best tutorial series for pandas ever❤❤❤

  • @dakshaudawatta8219
    @dakshaudawatta8219 4 ปีที่แล้ว +1

    Thanks a lot for this pandas series Corey ♥️.

  • @17avskadoo
    @17avskadoo 3 ปีที่แล้ว

    again and again awesome information

  • @DerMaikNichJa
    @DerMaikNichJa 4 ปีที่แล้ว

    Cool stuff. Could you show how to do this if you have more than one "index column"? For this particular Domain, how do I aggregate over intervals by grouping variable like separate stocks

  • @misseghe3239
    @misseghe3239 2 ปีที่แล้ว

    Thank you thank you thank you, Finally found the help that i needed!

  • @2w2e-channel
    @2w2e-channel 4 ปีที่แล้ว +2

    Corey, Thank you for the nice vide series. Could you please provide a video on multiple indexes? Also, do you have any plans for other Python libraries? such as Numpy, Sklearn?
    Regardless, Thank you so much for nice videos.

  • @ioannisnikolaospappas6703
    @ioannisnikolaospappas6703 ปีที่แล้ว

    Thanks Corey!
    Your videos are always lit!
    😁✌️

  • @mapa5000
    @mapa5000 ปีที่แล้ว

    What a detailed video !! Thank you man !!! 🤝

  • @RishiSaikia
    @RishiSaikia 3 ปีที่แล้ว +11

    At 20.08, calling df['2019'] didn't work for me. Gave an 'Assertion Error'. df.loc['2019'] worked though.

  • @stayshikshit
    @stayshikshit 3 ปีที่แล้ว

    @CoreySchafer, you're awesome. Loved your quality video content. Your work is fabulous.
    I'd really like to see DevOps related content if it comes under your scope. Thanks 👍

  • @tassoskat8623
    @tassoskat8623 3 ปีที่แล้ว

    Hello, this was a great!
    I was wondering wouldn't it make more sense to take the close of the last day of the week rather than mean (in the resampling part).
    Any answer would be appreciated!
    Thanks!

  • @kavehyarohi2886
    @kavehyarohi2886 4 ปีที่แล้ว +1

    corey man, i feel obligated to say thank you , aaaaaaaaaaaaaaaaaand a big well done sir.

  • @kmohan1010
    @kmohan1010 4 ปีที่แล้ว +1

    How to extract the Close value for each Day? Is there an Agg function for the last value similar to min/max/mean?

  • @juanestebanmartinez2806
    @juanestebanmartinez2806 4 ปีที่แล้ว

    Hi Corey, first I`ll like to thank you for this excellent material, it has been a great series. But I do have a question, i`m currently working in a project where I need to set the value of a variable equal to the value of a cell in excel. For example, var1 = Value in cell F12, so i can use the value of this variable to operate over a data frame.
    The problem has been that the cells to be collected as values are not in a data frame format, just spread across different sheets.
    Thanks

  • @P3ace1oveUn1ty
    @P3ace1oveUn1ty 2 ปีที่แล้ว

    Loving this series and have learned a lot. The prices of ETH in this video, wow!

    • @RAJASEKARCMR
      @RAJASEKARCMR 9 หลายเดือนก่อน

      டேmn true. $200 vs $4900 at peak and $1700 now

  • @finnmccool8671
    @finnmccool8671 4 ปีที่แล้ว

    Thanks, Corey. A great series of video lessons!
    I've been using .agg along the lines of:
    dfGroup1 = dfTest.groupby(
    ["area", "instrument"]).agg(
    Sales=('quantity', 'sum')
    )
    This allows you to rename columns. Is the dictionary method the preferred way?

  • @RajatSharma-ct6ie
    @RajatSharma-ct6ie 4 ปีที่แล้ว +7

    Hey Corey, Shall we wait for NumPy series as well after this ?

  • @swarnachoppella388
    @swarnachoppella388 ปีที่แล้ว +1

    Thank you for the detailed explanation

  • @jubayerhossain8812
    @jubayerhossain8812 4 ปีที่แล้ว +1

    Great! I love your lectures.

  • @charlesmiele2135
    @charlesmiele2135 2 ปีที่แล้ว

    This is so in depth I love it thank you!

  • @obi666
    @obi666 4 ปีที่แล้ว

    After using d_pars = lambda x: datetime.strptime(x, '%Y-%m-%d %I-%p') i got this warning: FutureWarning: The pandas.datetime class is deprecated and will be removed from pandas in a future version. Import from datetime module instead. I mean, i still get correct output , but if u wanna remove this warning just do: from datetime import datetime then d_pars = lambda x: datetime.strptime(x, '%Y-%m-%d %I-%p').
    Treat it like curiosity or sth
    Btw, great videos, sir!

  • @questscape
    @questscape 3 ปีที่แล้ว

    Values the time of viewers. Not sure if you really speak so fast or just increase speed in the first and last few mins of video.
    I just like the way it goes. Thanks for the great tutorial.

  • @disko.kommando
    @disko.kommando 4 ปีที่แล้ว +3

    Thank you again Corey!!

  • @mercyjhansibai8302
    @mercyjhansibai8302 2 ปีที่แล้ว

    This is great. Can you also make a video on how to work with timeseries data where I also want index by another categorical column and how to work on grouping them and doing forward filling

  • @tomasvallotton9264
    @tomasvallotton9264 4 ปีที่แล้ว +1

    Unrelated question: what theme do you use in you videos when using sublime text? I'm dying to know.

  • @tumul1474
    @tumul1474 3 ปีที่แล้ว +1

    Amazing tutorials man ! thank you so much

  • @tejasjoshi3724
    @tejasjoshi3724 4 ปีที่แล้ว

    Hi Corey,
    I am facing the following error while loading the csv in the dataframe variable df.
    ParserError: Expected 1 fields in line 278, saw 2
    Can you please provide some guidance for the same?? Thank you

  • @shreyashkarlalsahu9248
    @shreyashkarlalsahu9248 4 ปีที่แล้ว

    On executing the part of d_parser while reading the CSV file, the interpreter shows "FutureWarning: The pandas.datetime class is deprecated and will be removed from pandas in a future version. Import from datetime module instead."
    How to prevent from getting any error in the future version of pandas?

  • @antman7673
    @antman7673 ปีที่แล้ว

    This will be the video I am looking for. I am pretty sure. LET IT BE!
    Pretty mad how much I failed at plotting panda dataframe in matplotlib sorted by month.
    We have a german saying, „taking a detour increases the knowledge of your surroundings“.
    From 0% to 5% domain knowledge in multiple hours:/

  • @shurooqfehaid218
    @shurooqfehaid218 2 ปีที่แล้ว

    Thank god I find ur channel it’s been helpful, but I have a question.. If I had 2 columns in a data frame and I want to calculate the end_date and start_date how that can be done ?

  • @joshsmart9178
    @joshsmart9178 3 ปีที่แล้ว

    Hi what if I have a dataset with the same date in like 6 rows but a different hour attached to each row and I want to condense the 6 date rows into just two rows of the same date and use that to group my other non-date columns into things like .agg{ revenue: 'sum', orders: 'count"}? In your dataset for example, what if I want to aggregate the dates for every day into two rows of 12 hours each? How do I do that?

  • @zhongyuqin3274
    @zhongyuqin3274 2 ปีที่แล้ว +2

    Hi Corey, thanks for the video. I do have one question. When I call by df ['2019'], it will give me error: AssertionError: . I have checked the data type, the index is DateTimeIndex. Can you help to this? Thank you!

  • @puffziggy7330
    @puffziggy7330 4 ปีที่แล้ว +1

    Awesome stuff as always

  • @Reeemt
    @Reeemt ปีที่แล้ว

    Thank you for the great videos!
    Is it possible to resample textbody like comments? and what would be the argument for that in the resample function ?

  • @moushumitamanna
    @moushumitamanna 4 ปีที่แล้ว

    You are the best teacher. period