Data Cleaning after Identifying Data Problems in Pandas

แชร์
ฝัง
  • เผยแพร่เมื่อ 10 ม.ค. 2025
  • Hello friends,
    This is my course Hands-on Data Science which was released back in 2020 to help aspiring data scientists learn how the end-to-end process of a data science project works. It has been a paid course since it was published but now I believe it is time for it to be publicly accessible.
    Here are the links you need for the course.👇
    📙 Course Notion page where you can find the questions and assignments: fishy-dessert-...
    👩‍💻 Course repository: github.com/mis...
    If you have any questions, feel free to leave comments. I will try to answer them as much as I can, but look through the comments and help others as much as you can! Let's make this a safe learning environment.
    Previous video of the course: • Data Exploration: Iden...
    👋 Keep in touch?
    ==========================
    🐥 Twitter - / misraturp
    🔗 LinkedIn - / misraturp
    📹 TH-cam - / @misraturp
    🌎 Website - misraturp.com/
    Courses & resources
    ============================
    📙 Fundamentals of Deep Learning in 25 pages
    misraturp.gumr...
    📥 Streamlit template
    misraturp.gumr...
    🤖 Deep Learning 101 with Python and Keras (FREE)
    • 50 Days of Deep Learning
    🏃‍♀️ Data Science Kick-starter mini-course (FREE)
    misraturp.gumr...
    🐼 Pandas cheat sheet (FREE)
    misraturp.gumr...
    📝 NNs hyperparameters cheat sheet (FREE)
    misraturp.gumr...
    00:00 Shallow copy vs Deep Copy
    04:13 Correcting data types

ความคิดเห็น • 2

  • @abdullahhanif4541
    @abdullahhanif4541 5 หลายเดือนก่อน

    i find your tutorials very easy and understandable however i have one simple question. can you please explain how statistics is important in data science like how it is useful in anyway possible. can you please make a video on it and explain it using dataset

  • @ajithdevadiga9939
    @ajithdevadiga9939 8 หลายเดือนก่อน +1

    Regarding the Payment_type attribute As mentioned in the the data dictionary the payment code options are
    A numeric code signifying how the passenger paid for the trip.
    1= Credit card
    2= Cash
    3= No charge
    4= Dispute
    5= Unknown
    6= Voided trip
    but the data in the paraquet files have payment codes as [1, 2, 0, 4, 3 ] (even seen in the the old files now updated to paraquet format)
    could not understand what the payment code 0 refers to ?
    I am doing analysis for yellow_taxi_jan_2024 (january 2024) data
    could observe that after payement_type 1 (credit card ) and 2 (cash)
    could see high ocurrence of payment_type 0 (around 140K)
    is this something to think about or the relevant meta data is not updated in the website ?
    if anyone is working on this data kindly share your thoughts regarding this matter,
    would appreciate a response
    thanks :-)