Why do we split data into train test and validation sets?

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ก.พ. 2023
  • To train machine learning models we need to provide the model with a training and testing set. And sometimes even a validation set. These terms tend to be used interchangeably causing confusion. So once and for all, let's learn what each of these data splits do and how they contribute to model development.
    👋 Keep in touch?
    ==========================
    🐥 Twitter - / misraturp
    🔗 LinkedIn - / misraturp
    📹 TH-cam - / @misraturp
    🌎 Website - misraturp.com/
    Courses & resources
    ============================
    📙 Fundamentals of Deep Learning in 25 pages
    misraturp.gumroad.com/l/fdl
    👩‍💻 Hands-on Data Science: Complete your first portfolio project
    www.misraturp.com/hods
    📥 Streamlit template
    misraturp.gumroad.com/l/stemp
    🤖 Deep Learning 101 with Python and Keras (FREE)
    • 50 Days of Deep Learning
    🏃‍♀️ Data Science Kick-starter mini-course (FREE)
    misraturp.gumroad.com/l/kick-...
    🐼 Pandas cheat sheet (FREE)
    misraturp.gumroad.com/l/pandascs
    📝 NNs hyperparameters cheat sheet (FREE)
    misraturp.gumroad.com/l/hcs
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 33

  • @iaboodws11
    @iaboodws11 ปีที่แล้ว +12

    Just what I was looking for, your video is so simple and easy to understand, and straight to the point!!!

    • @misraturp
      @misraturp  ปีที่แล้ว +1

      Great to hear, thank you :)

  • @jamalnuman
    @jamalnuman 4 หลายเดือนก่อน

    Thanks for making a distinction between testing and validation

  • @syahwiza
    @syahwiza ปีที่แล้ว

    Thanks for your explanation, it is very useful for me.

  • @sapnilpatel1645
    @sapnilpatel1645 ปีที่แล้ว

    video is very much useful. Your channel is so underrated.

  • @toyl6727
    @toyl6727 ปีที่แล้ว

    Brilliant and clear!

  • @facundostratocaster356
    @facundostratocaster356 3 หลายเดือนก่อน

    Simple and good explanation, thank you so much ☺️

  • @bay-bicerdover
    @bay-bicerdover ปีที่แล้ว

    Good one!

  • @Randomsi_10
    @Randomsi_10 6 วันที่ผ่านมา

    This is very helpful on my ongoing thesis🥹

  • @ArifMuhammad-qd6vf
    @ArifMuhammad-qd6vf ปีที่แล้ว

    Superb

  • @volodyslove
    @volodyslove ปีที่แล้ว

    You are the best, thank you!😊

  • @misha4915
    @misha4915 ปีที่แล้ว +6

    After finding the best hyperparameters for a model using validation data, should we retrain the model using both the training and validation data before using it on the test data?

  • @SocialAviation
    @SocialAviation ปีที่แล้ว +1

    I love your content. Everytime I split my data into train and valid, either using trainsplit function or manually, my val loss does not decrease below 1. The only way to get my val loss lower and lower, is to use part of my train data as validation data 😢

    • @misraturp
      @misraturp  ปีที่แล้ว +1

      Your model might be overfitting in the first case :/

  • @aowowow-no1xe
    @aowowow-no1xe 12 วันที่ผ่านมา

    What camera do you use?

  • @jamesadeke9873
    @jamesadeke9873 7 หลายเดือนก่อน

    Good day ma, please can you help me out? I have been trying to figure out this for a long time but i could not. I want to know the best evaluation plots for machine learning models, specifically for classification problems. How best can someone visualize performance? Unlike deep learning models, you can use train and test curves, how best can we visualize using machine learning models? Do you have any video you have done about that? been checking your playlists but i can't find such, kindly help us out. Thanks

  • @ozgurartok9488
    @ozgurartok9488 10 หลายเดือนก่อน

    Teşekkürler.

  • @097_suryakantdhote9
    @097_suryakantdhote9 11 หลายเดือนก่อน

    please make a video on logestic regression

  • @jamalnuman
    @jamalnuman 4 หลายเดือนก่อน

    what is the need of testing data is the hyperparameters don't to be optimized?

  • @patiklimikrofon
    @patiklimikrofon 2 หลายเดือนก่อน

    Teşekkürler!

    • @misraturp
      @misraturp  2 หลายเดือนก่อน

      Rica ederim!

  • @jameshopkins3541
    @jameshopkins3541 5 หลายเดือนก่อน

    Can you explain something about it?????
    Example the meaning and useful of each one

  • @jsingh190
    @jsingh190 ปีที่แล้ว

    Can you make a project use of some ML application

  • @bay-bicerdover
    @bay-bicerdover ปีที่แล้ว

    0:50'de blop efekti ödümü kopardı

  • @jameelabduljalil25
    @jameelabduljalil25 ปีที่แล้ว

    No demo?

    • @misraturp
      @misraturp  ปีที่แล้ว

      Not for this one :)

  • @SyamKishoreNaidu
    @SyamKishoreNaidu ปีที่แล้ว

    Can we expect more pandas related videos

    • @misraturp
      @misraturp  ปีที่แล้ว

      For sure! more to come.

  • @sumitranjan7858
    @sumitranjan7858 9 หลายเดือนก่อน

    You are soo cute❤❤

  • @babaabba9348
    @babaabba9348 9 หลายเดือนก่อน

    Ah, if you were living in France, I would have married you immediately, I would have taken you to some fancy restaurant everyday and during the night, you would have done my assignments within data science.

  • @jameshopkins3541
    @jameshopkins3541 5 หลายเดือนก่อน

    NOLIKE for UN useful!!!!!