Tic-Tac-Toe Winner Classification - Data Every Day #140

University Salary Prediction (Model Selection) - Data Every Day #240

Time Series Forecasting with XGBoost - Use python and machine learning to predict energy consumption

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Predicting Absenteeism at Work - Data Every Day

Gabriel Atkin

มุมมอง 2 067

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 3 ก.พ. 2025

ความคิดเห็น • 5

@subrahmanyamhsb1756 ปีที่แล้ว
Very High Quality Content! You deserve a lot more Subs!
@davidda5599 หลายเดือนก่อน
Hi, the distribution of absence hours in most samples is highly right-skewed. Would it help to first log-transform y in your example? Or can you also train a poisson or gamma regression model which is likely to fit the data better?
@nitigyahanda3516 2 ปีที่แล้ว
Hi there, could you please tell me how "workload" variable has been coded? Like what does workload/day= 253 actually mean?
@aroonkumar6202 3 ปีที่แล้ว
Is there a way to ensure that we are not scaling the dummy variables and just scaling the numerical columns? I feel leaving them as it is might increase the R-square.
@gcdatkin 3 ปีที่แล้ว ⁺¹
In my experience, scaling the dummies does nothing to inhibit the performance of the model. What is important is that the relative differences between the values is maintained, not the actual values themselves.
However, if you want to scale only the numerics, just split the DataFrame into two sections (numeric and categorical):
numeric_columns = []
numeric_data = df.loc[:, numeric_columns].copy()
categorical_data = df.drop(numeric_columns, axis=1).copy()
Then apply the scaler's fit and transform functions to only the numeric data:
scaler = ()
numeric_data = scaler.fit_transform(numeric_data)
Finally, concatenate the DataFrames back together:
df = pd.concat([numeric_data, categorical_data], axis=1)
Hope this helps! :)

ต่อไป

เล่นอัตโนมัติ

Tic-Tac-Toe Winner Classification - Data Every Day #140

Tic-Tac-Toe Winner Classification - Data Every Day #140

University Salary Prediction (Model Selection) - Data Every Day #240

University Salary Prediction (Model Selection) - Data Every Day #240

Time Series Forecasting with XGBoost - Use python and machine learning to predict energy consumption

Time Series Forecasting with XGBoost - Use python and machine learning to predict energy consumption

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

Nec Red Rockets Kawasaki vs. LP Bank Ninh Binh - Pool B | Highlights | Club World Champs 2024

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

แมนยู Corner : คุยหลังเกม แมนฯซิตี้ 1-2 แมนฯยู ชัยชนะมาจากอโมริมกล้าตัด แรชฟอร์ด , การ์นาโช

แมนยู Corner : คุยหลังเกม แมนฯซิตี้ 1-2 แมนฯยู ชัยชนะมาจากอโมริมกล้าตัด แรชฟอร์ด , การ์นาโช

Netherlands Rent Prediction (Pipeline) - Data Every Day #239

Netherlands Rent Prediction (Pipeline) - Data Every Day #239

Electronic Health Record Data for Research at UCSF

Electronic Health Record Data for Research at UCSF

Wild Blueberry Yield Prediction (Hyperparameter Optimization) - Data Every Day #237

Wild Blueberry Yield Prediction (Hyperparameter Optimization) - Data Every Day #237

Semiconductor Test Result Prediction (Imbalanced Classes) - Data Every Day #238

Semiconductor Test Result Prediction (Imbalanced Classes) - Data Every Day #238

Company Market Cap Prediction (Model Selection) - Data Every Day #246

Company Market Cap Prediction (Model Selection) - Data Every Day #246

Learning Pandas for Data Analysis? Start Here.

Learning Pandas for Data Analysis? Start Here.

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Machine Learning Regression Models Metrics

Machine Learning Regression Models Metrics

Hospital Patient Type Prediction (Model Selection) - Data Every Day #249

Hospital Patient Type Prediction (Model Selection) - Data Every Day #249

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

มายคราฟ, แต่ ไลค์ = หัวใจ!

มายคราฟ, แต่ ไลค์ = หัวใจ!

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

guncharlie - จากกันโดยสมบูรณ์ | OFFICIAL MV

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助