Linear Regression Python Sklearn [FROM SCRATCH]

Python Marathon

มุมมอง 85 965

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 27 ธ.ค. 2024

ความคิดเห็น • 68

@joseordonez7738 2 ปีที่แล้ว
Mi loco, no se si entiendas; pero tu video salvo mi ser, eres grande
@shrishsharma8333 4 ปีที่แล้ว ⁺⁶
An awesome video and great explanation. Why it ain't got any views i wonder!!!! Thanks a lot!!
@paarthmadan1315 4 ปีที่แล้ว ⁺⁷
At 3.32, what was the reshaping criteria: why reshaped to (-1,1) and not anything else? I didn't understand that part.
@dome8116 5 ปีที่แล้ว ⁺¹²
Considering your question I guess for a linear regression model is it pretty okay. Much higher accuracy is probably not possible with LR. Other ml models would have to be taken into consideration
@bradwang3648 3 ปีที่แล้ว ⁺²
very helpful, thank you!!! I can finally do my HW after watching this video!
@gnanashrishetty1465 3 ปีที่แล้ว ⁺²
can anyone explain 3:40? I couldnt get the output[7], my output was just LinearRegression() and due to that I couldn't further use the .predict either
@Continentalky 3 ปีที่แล้ว ⁺¹
@@pythonmaraton I am still having the same problem. I used the LR = LinearRegression code, but still just returning LinearRegression() when I run the next line of code.
@theh1ve 2 ปีที่แล้ว
@@Continentalky Did you find a solution I am having the exact same issue?
@edson_winner 3 ปีที่แล้ว ⁺¹
Thanks bro. I already got subscribed and no doubt I will watch all your videos as you are a great teacher. God bless you!
@akshatbhadani1710 ปีที่แล้ว
i think it was a great model and u are aa great person tysm for making this vid
@michaelcstorm3808 3 ปีที่แล้ว ⁺¹
thanks. this helps me to do a data science assignment
@abhishekambawale7456 ปีที่แล้ว
Great Tutorial. Thanks
@jaywiji 4 ปีที่แล้ว ⁺¹¹
Very clear video thanks a lot.
One questions I have is why do we need to reshape the data ? And why do we need to use .values? Wouldn't it work if we just used X_train, Y_train instead of X_train.values ?
@wobblyjelly345 3 ปีที่แล้ว ⁺¹
That's what I want to know too
@amikhalsa3173 3 ปีที่แล้ว ⁺⁹
The reason is essentially because of the datatype. It needs to be in nd array form and needs to be a 2d array. you will get an error if you try to just use X_train because at this point it is a series datatype. You can convert it to a numpy.ndarray by using X_train = X_train.to_numpy() and then reshape to (-1, 1) OR you can just take the values stored in the series and reshape the values directly to (-1, 1). I think this is because the lr.fit() function takes only 2d arrays, not series. Hope that helps!
@jongcheulkim7284 3 ปีที่แล้ว ⁺¹
Thank you so much. I learned a lot.
@jeandy4495 ปีที่แล้ว ⁺¹
The accuracy of this particular model over this data is pretty good (~40%). The linear model is pretty good at catching the general (linear) trend of the datapoints. But it will be difficult to improve the accuracy with this model, as the datapoints are distributed with a wide variance around the linear model. Other regressors could be more accurate.
4 ปีที่แล้ว ⁺³
Is the score the R2?
@hudsontorrent6672 2 ปีที่แล้ว
Thanks for the video. Just to make a contribution, there is an outlier with high leverage in the training set (the observation with coordinates around (100, 35)). This is affecting the estimation of the slope coefficient, making its estimated value smaller than it should be. As a result, the estimated line does not fit the testing set well. There are no outliers in the testing set. Thanks again. Cheers.
@mridulagarwal5881 4 ปีที่แล้ว
Nice video! Short and crisp.
@muhammadhamza7369 2 ปีที่แล้ว
Love it 🥰🔥
@deekshantchoudhary8454 4 ปีที่แล้ว ⁺¹
Great video, you've explained it nicely. Thanks!!
@gongjiaji2489 4 ปีที่แล้ว
is the fit() function did all the training job? why is so quick?
@ricardosalas7048 4 ปีที่แล้ว ⁺³
gracias por existir
@hannesvideo ปีที่แล้ว
@Python Marathoón: can you explain the reshape? Is it just the selection of 2 features from possibly more features? Why -1?
@pythonmaraton ปีที่แล้ว ⁺¹
Hi, thanks for the question. Sklearn wants the arrays to be vertical. The -1,1 is just a shortcut to flip it vertical. It’s like saying reshape to size N,1 (N rows and 1 column). Likewise if you reshape to (1,-1) it would reshape to size 1,N (1 row and N columns)
@hannes672 ปีที่แล้ว
@@pythonmaraton thanks, that explains it. Great video!
@toihirhalim 4 ปีที่แล้ว
thanks Rylan you are awesome dude !
@aravindkramesh 2 ปีที่แล้ว
*Thank you, man. I understood.*
@alokeveer 4 ปีที่แล้ว ⁺⁴
hey, I just love to work in a dark background. How did you make your background dark... ??
@alokeveer 4 ปีที่แล้ว
@@pythonmaraton Exactly man.... Would u mind telling me the name of the chrome plugin or if possible sending the link of the chrome plugin!!?? Thanks for reply by the way..
@alokeveer 4 ปีที่แล้ว
@@pythonmaraton Thank you so much man. Your tutorial was also awesome!!
@ishikakesarwani6278 3 ปีที่แล้ว
At 4:00 what if we don't reshape?
@ishikakesarwani6278 3 ปีที่แล้ว
@@pythonmaraton 👍
@sachindilhan7504 3 ปีที่แล้ว ⁺¹
how to find dataset
@spitfirelast8761 3 ปีที่แล้ว ⁺¹
How do you make their values appear normal again after running the model?
Like for example:
I had a value of 3070.55 then after processing the data, the machine made the value from 3070.55 to 7.189879, then after running the model i get 0.46598782 on mean square error and
0.47839596 for cross validation score.
How do i return the value of 7.189879 to original 3070.55 so that i can output the value to original amount?
@ced4030 4 ปีที่แล้ว ⁺²
looks good and it helped me a lot. I did this for a class project a few months ago but it was a great refresher. a question, if i wanted to plug my predictions back into the actual data - to for example tie the prediction to a womans name if it existed in the original data set; how would we do that?
@Huy-G-Le 3 ปีที่แล้ว ⁺⁴
ModuleNotFoundError: No module named 'pydataset'
@bellatrix625 2 ปีที่แล้ว
pip install pydataset
@junknewera 2 หลายเดือนก่อน
Bruh
@skhdukes1888 4 ปีที่แล้ว ⁺³
Great video. To answer your question, since the model scored under 70% wouldn't it be considered poor performance?
@farelferdinand1089 3 ปีที่แล้ว ⁺¹
I think it depends because if you are going to predict the percentage of people surviving after an operation, 70 might be a low number.
@Olivander-u9k 3 ปีที่แล้ว ⁺¹
is there a way to predict "x" using a specific "y" value?
@aadhuu ปีที่แล้ว
I mean just feed y instead of x into the model
@jeffgalef121 4 ปีที่แล้ว
That was great. Thank you!
@ibrahimnadeem1064 ปีที่แล้ว
how to get this dataset?
@sanketchore657 4 ปีที่แล้ว ⁺¹
Thanks bro
@imtiazsajwani4239 2 ปีที่แล้ว
Ryan, thanks for the great video.
Do you happen to know why am I getting the fit error
ValueError: Input contains NaN, infinity or a value too large for dtype('float64').
@jaymanhire 2 ปีที่แล้ว
My model score is very low, but the predictions are very close. Interesting.
@estebanduarte1792 2 ปีที่แล้ว
Extreme outliers?
@oddnumber8149 3 ปีที่แล้ว
how to import pydataset in jupyter notebook?
@prateekyadav7679 4 ปีที่แล้ว
i am working on an excel file but I get key error for 'height' which is the first column in my data.
@raminlakin7888 2 ปีที่แล้ว
The notebook can we have the code?
@afifkhaja 2 ปีที่แล้ว
I tried installing then importing sklearn but Python didn't recognize it. I had to install skicit-learn instead.
# Go to File -> Settings -> Python Interpeter and install pydataset and scikit-learn packages
# scikit-learn is called sklearn when using the import statement
from sklearn.linear_model import LinearRegression # For linear regression
from sklearn.model_selection import train_test_split # To split data into train and test
@yeahjustlikethat 3 ปีที่แล้ว
Sometimes a less accurate but simpler model is better to get others "buy in". I guess that one can need some help though.
@mandalamtarun5414 4 ปีที่แล้ว
I am getting an error: fit() missing 1 required positional argument: 'y'
Any suggestions on removing this?
@zackmorey 4 ปีที่แล้ว
@@pythonmaraton Could you help me understand why the reshape is important and what it's doing?
@crazycxr 3 ปีที่แล้ว
yo, good succint video. thanks
@prafuldhakde3601 10 หลายเดือนก่อน
Bhai mera to nhi ho rha... mene code type kiya jaisa aapne likha vese copy paste lekin vo error dera
@vasudhashrikhandey2194 4 ปีที่แล้ว
My score is coming 0.0348.Am I still correct?Since I have done all the steps same
@devilzwishbone 3 ปีที่แล้ว
No, not a good model as its 39% accurate, ideally you want it in the 3/4 mark or more (75% accuracy) for it to be an okish model and 90% or more for it to be brilliant
@prafuldhakde3601 10 หลายเดือนก่อน
Plz aapka koi contact hoto mujhe dede
@ihsanayyach3696 2 ปีที่แล้ว
at least do something to improve ur model 0.3 R is very low

ต่อไป

เล่นอัตโนมัติ

Linear Regression From Scratch in Python (Mathematical)