The 5 Most Important Matrices for Data Scientists

The Multinomial Distribution : Data Science Basics

A Review of 10 Most Popular Activation Functions in Neural Networks

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ใครขยับไม่ได้เป็น!!

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

Why do we use "e" in the Sigmoid?

ritvikmath

มุมมอง 10 533

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 10 ม.ค. 2025

ความคิดเห็น • 42

@halibrahim ปีที่แล้ว ⁺¹⁵
Thank you for making us love math even more.
@ritvikmath ปีที่แล้ว ⁺¹
Glad you enjoy it!
@pawanbhatt314 ปีที่แล้ว ⁺³
thankyou for bringing this intuitive video, I just had this thought yesterday.
Please keep uploading videos like this it makes my intuitive more strong and closer to statistics.
@ritvikmath ปีที่แล้ว ⁺²
More to come!
@haojiang4882 หลายเดือนก่อน
Holy shit! I wish I could watch this video 6 years ago when I just got into machine learning. You did a great job! Thank you so much!
@ritvikmath 29 วันที่ผ่านมา
Thanks so much! Glad it was helpful!
@ramirolopezvazquez4636 ปีที่แล้ว ⁺²
Thanks! I really appreciate this bits of useful, subtle and insightful ideas about common objects in data science
@ritvikmath ปีที่แล้ว
Glad to hear it!
@SuperMtheory ปีที่แล้ว ⁺⁴
Makes sense. dy/dx = y (1- y ) if k=e. Great video!
@ritvikmath ปีที่แล้ว ⁺¹
Thanks!
@sashayakubov6924 7 หลายเดือนก่อน
Love this nonchalant explanation :)
@JeremiahLam-s6d ปีที่แล้ว ⁺²
Huh, so I guess this is like a tradeoff of annoyances where using e upfront is just less annoying than discovering ln(k) much later.
@apoorvatiwari8287 ปีที่แล้ว
Nice explanation. Clarifies everything
@ritvikmath ปีที่แล้ว
Glad it was helpful!
@JoeBurnett 4 หลายเดือนก่อน
Thank you for this explanation!
@ChocolateMilkCultLeader ปีที่แล้ว ⁺¹
The best in the game for this kind of conteht
@ritvikmath ปีที่แล้ว
Thanks!
@Justin-zw1hx ปีที่แล้ว
always high quality content
@uusserrrreesssuuu ปีที่แล้ว
are operations with 'e' are more expensive then with 2 or 3?
@giorda77 ปีที่แล้ว
Really good explanation. Keep it up :)
@ritvikmath ปีที่แล้ว
Thanks, will do!
@jasdeepsinghgrover2470 ปีที่แล้ว ⁺¹
Actually there is a better reasoning but I am still not sure about it... Sigmoid is derived through the linear regression on log odds of the two classes... So mx+c = ln(p/(1-p)) which gives p = 1/(1+e^-(mx+c))
@JMBalaguer ปีที่แล้ว
Thanks for the explanation!
@ritvikmath ปีที่แล้ว
Of course!
@masster_yoda ปีที่แล้ว
This is a nice explanation, however one question is left open for me: We interpret the result of the sigmoid as probability. So sigmoid(x) results in some probability of something to be classified as some category. Let's assume the standard sigmoid(x) results in a value of 0.7. When I change sigmoid to use some other number k instead of e, this probability would change. Let's say it would now be 0.9 instead of 0.7. This appears to me as semantically completely different from 0.7. So I would conclude that with respect to the interpretation as probability, it is not arbitrary to choose e oder some other number k.
@MalTimeTV ปีที่แล้ว ⁺¹
When we use the sigmoid function we are doing so because we can map from the real number space to the [0,1] space. So, in practice, this means that regression values can be mapped to probabilities. So, like you say, you might have some regression (x) value that maps to 0.7. But what you are generally interested in is not the 0.7 itself but rather the value of the sigmoid for the given data point relative to other data points. A concrete example might help to clarify:
Say, we have a bunch of predictors (from a linear regression, say) - e.g. weather data, say, for temperature and pressure at some given location. And we want to combine these somehow via a linear relationship y = b0 + b1 x temp + b2 x pressure. We now want to use y (a real value) and map it to a probability for rain. So we use the sigmoid. And so we might get 0.7, like you say, for a given observation of temperature and pressure. Does that mean that we have a model which predicts 70% chance for rain. Not necessarily - and probably not even close. In practice, you will use the 70% relative to the value of the other observations. You might use a threshold value of 50% and say that all values above 50% should be classified as "expecting rain" and all values below as "not expecting rain". But then you might find that the 50% threshold for classification does not really hold up when you apply your model to historical data with known outcomes. However, if you tune the threshold (and explore other possible values, e.g. from 20%, 21% .... 69%, 70%), you might find that a threshold of 30% yields very high accuracy (even against data which you set aside and with which you didn't train your model).
So, in other words, in practice, you rarely take the sigmoid function as a literal mapping from the real line to the probability line. You just allow it to perform a mapping to the probability line because here it helps to define, in some sense, a classification rule. And when you have this classification rule, you can fine tune the threshold to optimise your model. A long answer, I know, but I figured I would share this since I had wondered the same thing for quite a long time - until I saw how things worked in practice.
@thankforyourvideo ปีที่แล้ว
@@MalTimeTV thanks a lot for your answer
@reekdas9219 9 หลายเดือนก่อน
thanks bro, interesting video!
@ritvikmath 9 หลายเดือนก่อน
Glad you liked it!
@marcfruchtman9473 ปีที่แล้ว
Great info. Thank you.
@ritvikmath ปีที่แล้ว
Glad it was helpful!
@jakobpirs1392 ปีที่แล้ว ⁺¹
Very simmilar to the logit function.
@gamuchiraindawana2827 ปีที่แล้ว
It looks like the logistic function
@r0cketRacoon 5 หลายเดือนก่อน
amazing
@16876 ปีที่แล้ว
very nice 😎
@ritvikmath ปีที่แล้ว
Thanks!
@pypypy4228 ปีที่แล้ว
❤
@ritvikmath ปีที่แล้ว ⁺¹
❤️
@pratyush7987 2 หลายเดือนก่อน
nice
@luispericchi9347 ปีที่แล้ว
i like my curves like that
@nononnomonohjghdgdshrsrhsjgd 8 หลายเดือนก่อน
sorry, you didn't explain anything.
@Aiden-ml4cw ปีที่แล้ว
🤔 *PromoSM*

ต่อไป

เล่นอัตโนมัติ

The 5 Most Important Matrices for Data Scientists

The 5 Most Important Matrices for Data Scientists

The Multinomial Distribution : Data Science Basics

The Multinomial Distribution : Data Science Basics

A Review of 10 Most Popular Activation Functions in Neural Networks

A Review of 10 Most Popular Activation Functions in Neural Networks

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ใครขยับไม่ได้เป็น!!

ใครขยับไม่ได้เป็น!!

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

🔴𝐋𝐈𝐕𝐄 การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 9

🔴𝐋𝐈𝐕𝐄 การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 9

What's so special about Euler's number e? | Chapter 5, Essence of calculus

What's so special about Euler's number e? | Chapter 5, Essence of calculus

Why Do We Use the Sigmoid Function for Binary Classification?

Why Do We Use the Sigmoid Function for Binary Classification?

The Sigmoid : Data Science Basics

The Sigmoid : Data Science Basics

The KL Divergence : Data Science Basics

The KL Divergence : Data Science Basics

What is a Jacobian Matrix | Physical Interpretation

What is a Jacobian Matrix | Physical Interpretation

The Sigmoid Function Clearly Explained

The Sigmoid Function Clearly Explained

Which Activation Function Should I Use?

Which Activation Function Should I Use?

The Softmax : Data Science Basics

The Softmax : Data Science Basics

The Most Beautiful Equation

The Most Beautiful Equation

PiXXiE - Pick A Card | OFFICIAL M/V

PiXXiE - Pick A Card | OFFICIAL M/V

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

How Strong Is Tape?

How Strong Is Tape?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

มายคราฟ, แต่ ไลค์ = หัวใจ!

มายคราฟ, แต่ ไลค์ = หัวใจ!