Data Cleaning after Identifying Data Problems in Pandas
ฝัง
- เผยแพร่เมื่อ 10 ม.ค. 2025
- Hello friends,
This is my course Hands-on Data Science which was released back in 2020 to help aspiring data scientists learn how the end-to-end process of a data science project works. It has been a paid course since it was published but now I believe it is time for it to be publicly accessible.
Here are the links you need for the course.👇
📙 Course Notion page where you can find the questions and assignments: fishy-dessert-...
👩💻 Course repository: github.com/mis...
If you have any questions, feel free to leave comments. I will try to answer them as much as I can, but look through the comments and help others as much as you can! Let's make this a safe learning environment.
Previous video of the course: • Data Exploration: Iden...
👋 Keep in touch?
==========================
🐥 Twitter - / misraturp
🔗 LinkedIn - / misraturp
📹 TH-cam - / @misraturp
🌎 Website - misraturp.com/
Courses & resources
============================
📙 Fundamentals of Deep Learning in 25 pages
misraturp.gumr...
📥 Streamlit template
misraturp.gumr...
🤖 Deep Learning 101 with Python and Keras (FREE)
• 50 Days of Deep Learning
🏃♀️ Data Science Kick-starter mini-course (FREE)
misraturp.gumr...
🐼 Pandas cheat sheet (FREE)
misraturp.gumr...
📝 NNs hyperparameters cheat sheet (FREE)
misraturp.gumr...
00:00 Shallow copy vs Deep Copy
04:13 Correcting data types
i find your tutorials very easy and understandable however i have one simple question. can you please explain how statistics is important in data science like how it is useful in anyway possible. can you please make a video on it and explain it using dataset
Regarding the Payment_type attribute As mentioned in the the data dictionary the payment code options are
A numeric code signifying how the passenger paid for the trip.
1= Credit card
2= Cash
3= No charge
4= Dispute
5= Unknown
6= Voided trip
but the data in the paraquet files have payment codes as [1, 2, 0, 4, 3 ] (even seen in the the old files now updated to paraquet format)
could not understand what the payment code 0 refers to ?
I am doing analysis for yellow_taxi_jan_2024 (january 2024) data
could observe that after payement_type 1 (credit card ) and 2 (cash)
could see high ocurrence of payment_type 0 (around 140K)
is this something to think about or the relevant meta data is not updated in the website ?
if anyone is working on this data kindly share your thoughts regarding this matter,
would appreciate a response
thanks :-)