Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ส.ค. 2023
  • Today we take a look at how we can apply feature scaling to data sets within scikit-learn in python. This is useful when applying Normalization or standardization to data which allows for machine learning models to perform better.
    Dataset is available on my Github
    Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
    📧 Email: ryannolandata@gmail.com
    🌐 Website & Blog: ryannolandata.com/
    🍿 WATCH NEXT
    Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
    Cross Validation: • A Comprehensive Guide ...
    Train Test Split: • Train Test Split with ...
    Logistic Regression: • Hands-On Machine Learn...
    MY OTHER SOCIALS:
    👨‍💻 LinkedIn: / ryan-p-nolan
    🐦 Twitter: / ryannolan_
    ⚙️ GitHub: github.com/RyanNolanData
    🖥️ Discord: / discord
    📚 *Practice SQL & Python Interview Questions: stratascratch.com/?via=ryan
    WHO AM I?
    As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
    This TH-cam channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
    *This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 13

  • @v.jananayagan3284
    @v.jananayagan3284 8 วันที่ผ่านมา

    you teach very well than other channels but i don't know why pepoles are not spend time on your channel really helpfull man

    • @RyanNolanData
      @RyanNolanData  8 วันที่ผ่านมา +1

      Thanks it’ll happen over time

  • @user-rw3vn5om3t
    @user-rw3vn5om3t 2 หลายเดือนก่อน +4

    underrated channel great video

  • @sara-sx7gm
    @sara-sx7gm 10 วันที่ผ่านมา

    Helpful . Thank you so much

  • @Welcomereddy
    @Welcomereddy 9 หลายเดือนก่อน +1

    Excellent brother !

  • @lancerkind
    @lancerkind 3 หลายเดือนก่อน

    Very good video! I learned a lot. If I was to ask for more, it would be to fill in WHY normalize or standardized. You mention some about “getting your numbers in order.” Add to that there are reasons for visualization tools, comparison analysis, and whatever else. I have some ideas why, but I’m guessing as a Pandas user you have encountered many more.
    Thank you for sharing.

    • @RyanNolanData
      @RyanNolanData  3 หลายเดือนก่อน

      No problem and I may make a statistics course video in the future. Just waiting on my job to apply more skills

  • @onurbltc
    @onurbltc 10 หลายเดือนก่อน +1

    Great video!

  • @lafo1639
    @lafo1639 6 หลายเดือนก่อน

    Could you also explain how the choice of feature_range affects the output processing please? Trying to understand in which case it should be (0,5) and when it should be (0,10), and how you then interpret the output, for example? Also, I am wondering: you are applying scalers to the whole dataset, but what if you have a regression type task (predicting an actual number)? If you apply scalers to all columns then your targets also change