Anomaly detection in time series with Python | Data Science with Marco

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ก.ค. 2024
  • A hands-on lesson on detecting outliers in time series data using Python.
    Full source code: github.com/marcopeix/youtube_...
    Dataset can be found here: github.com/numenta/NAB/blob/m...
    Labels can be found here: github.com/numenta/NAB/blob/m...
    Chapters:
    Introduction - 0:00
    Get the data - 4:11
    Robust Z-score method - 9:08
    Robust Z-score method (code) - 13:12
    Isolation forest - 20:48
    Isolation forest (code) - 22:33
    Local outlier factor - 27:16
    Local outlier factor (code) - 31:21
    Thank you - 34:01
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 27

  • @user-ms4et7nv9p
    @user-ms4et7nv9p 5 หลายเดือนก่อน

    Hi Marco!! Thank you so much for making great videos on "Anomaly detection". Great Great work! Please keep sharing! 🙏🙏🙏🙏

  • @pabloarriagadaojeda6452
    @pabloarriagadaojeda6452 9 หลายเดือนก่อน +2

    hey Marco!! This is the first time I've watched one of your videos, and after 5 minutes of starting the video, I quickly went through your entire channel, looking at your content. It's AMAZING! Thank you for all your efforts to share your knowledge with the community. A hug from Chile!!

    • @datasciencewithmarco
      @datasciencewithmarco  9 หลายเดือนก่อน

      Thanks Pablo for the kind words! Really appreciate it!

    • @belkmouf
      @belkmouf 6 หลายเดือนก่อน

      You rock bro🎩 off to you.

  • @EngMAli-vk3nz
    @EngMAli-vk3nz 11 หลายเดือนก่อน +4

    Thanks for this
    We Hope to make Some One For MultiVariate Time Series Anomaly Detection

  • @nathaliegrand4018
    @nathaliegrand4018 9 หลายเดือนก่อน

    Excellent presentation. Very clear explanation. Would be great to have more info on the impact of the context and wich one of the methods is expected to work best in wich context.

  • @joaovict007
    @joaovict007 3 หลายเดือนก่อน

    Very interesting content, thank you!

  • @eladiomendez8226
    @eladiomendez8226 ปีที่แล้ว +1

    Great video

  • @pinoyguitartv
    @pinoyguitartv ปีที่แล้ว

    Thanks for this 🤘

  • @salafghaniedsa669
    @salafghaniedsa669 ปีที่แล้ว

    awesome!

  • @littlepigywigy
    @littlepigywigy 4 หลายเดือนก่อน

    nice and clear

  • @hoanhvong
    @hoanhvong 5 หลายเดือนก่อน

    🎉 thank you a lot

  • @user-ou5yb7uk3g
    @user-ou5yb7uk3g 4 หลายเดือนก่อน

    Hello Marco, thank you so much for such a great video. Can you please make a video on anomaly detection for time series data using pycaret.

  • @jannoona123
    @jannoona123 8 หลายเดือนก่อน

    Hi Marco! I'm working on a project and this has a lot of components I need. I noticed the specification of the data said that it was being recorded every 5 minutes, could you create a tutorial on how to retrieve a stream of live data and pass it to the algorithm in a somewhat real-time fashion? I hope this is similar to what I understood from your data collection in the video

    • @jiretkatharpi1099
      @jiretkatharpi1099 7 หลายเดือนก่อน

      Hi I wanted to work on the same thing, did you get anything?

  • @isabelahorta3063
    @isabelahorta3063 4 หลายเดือนก่อน

    Hi! Do you recomend any video for pattern-wise anomaly detection?

    • @datasciencewithmarco
      @datasciencewithmarco  4 หลายเดือนก่อน +1

      I don't know any, but you can look at the library TOAD for anonaly detection in time series. They do pattern-wise detection if I remember well

  • @harrishvar7677
    @harrishvar7677 8 หลายเดือนก่อน

    Hey !, Is it possible to identify and flag anomalies within a continuous numerical attribute?

    • @datasciencewithmarco
      @datasciencewithmarco  8 หลายเดือนก่อน

      If by continuous, you mean at a very high frequency, then yes, I don't see why not!

    • @harrishvar7677
      @harrishvar7677 8 หลายเดือนก่อน

      Thanks !, If possible, can you make a video on that, it would be really helpful !@@datasciencewithmarco

  • @code2compass
    @code2compass หลายเดือนก่อน

    Hello!! quick question, why is the threshold 3.5 any reason please?

  • @user-qz3nx4xy8c
    @user-qz3nx4xy8c 3 หลายเดือนก่อน

    how about random cut forest ?

  • @ananya_kathak
    @ananya_kathak 10 หลายเดือนก่อน +1

    What is the accuracy?

    • @datasciencewithmarco
      @datasciencewithmarco  10 หลายเดือนก่อน +1

      Here, accuracy is really not a good idea, because there are so few anomalies. A simple baseline could achieve 99% accuracy, even though there is no "learning". That's why we use the confusion matrix here to see if we can actually identify anomalies.

  • @gouthamkarakavalasa4267
    @gouthamkarakavalasa4267 7 หลายเดือนก่อน +1

    Anomaly detection is unsupervised, how did you get to if a point is anomaly or not, even before training the model ?

    • @datasciencewithmarco
      @datasciencewithmarco  7 หลายเดือนก่อน +2

      The dataset is labeled. That way, we can measure the performance of each anomaly detection methods.

    • @devanshtalapa1416
      @devanshtalapa1416 3 หลายเดือนก่อน

      We got a few positive labels in cross validation