How to remove outliers in Python? | For multiple columns | Step by step ♥

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ก.พ. 2025

ความคิดเห็น • 61

  • @jingyiwang5113
    @jingyiwang5113 2 ปีที่แล้ว +3

    I am really grateful for this video. I am doing research with my professor. And this is really an essential skill for me to conduct research with him. Thank you so much! I do appreciate your wisdom!

  • @amandacorreia2625
    @amandacorreia2625 3 ปีที่แล้ว +12

    Your voice, the music and the explanation: everything is amazing! Thanks a lot ♥

  • @rajendranayak8018
    @rajendranayak8018 3 ปีที่แล้ว +7

    Dear Eigen B, Please upload videos on machine learning & higher stats. I found this video, which helps me a lot. Your way of teaching is good.

  • @yohanneskebede1573
    @yohanneskebede1573 6 หลายเดือนก่อน

    clear and simple! well explained with no error! Bravo!

  • @Christopher-xr9kq
    @Christopher-xr9kq 2 ปีที่แล้ว +1

    Wow. Watched entire video. So peaceful. good job!!!!

  • @jayanthimallela8842
    @jayanthimallela8842 ปีที่แล้ว

    This video Really help me a lot for outliers. thankful to you and very clean and decent explanation, please do more videos on machine learning. Thanks a lot

  • @Mandelbrot567
    @Mandelbrot567 2 ปีที่แล้ว

    This video is excellent, I tried the method on another data set , it worked a treat.

  • @robertaraujo347
    @robertaraujo347 3 ปีที่แล้ว +1

    I loved to watch this video! it goes to the main point, your explanation was very clear and you've taken ur time to avoid letting any detail out. At the beginning I was considering if I should see ur video cause it lasted 13 minutes and I don't like to see videos longer than 5 minutes xd but I'll leave happy cause I've understood this topic and now I'll be able to apply this in futures data cleaning.

  • @jamesjulius7726
    @jamesjulius7726 2 ปีที่แล้ว

    excellent explanation and pace! so calm, will never forget these part #removing outliers

  • @123arskas
    @123arskas 2 ปีที่แล้ว

    Nice work. Liked the simplicity and the soothing voice + music.

  • @eugenevlaxos92
    @eugenevlaxos92 3 ปีที่แล้ว +1

    thank you so much you saved my data mining project

  • @selsabillekkaf5724
    @selsabillekkaf5724 ปีที่แล้ว

    Every thing is amazing ! , More than very helpful. thank you

  • @priyanshugupta2104
    @priyanshugupta2104 2 ปีที่แล้ว

    बहुत अच्छा सिखाया बहिनी

  • @jesusparra9840
    @jesusparra9840 2 ปีที่แล้ว

    Excelente video, estuve buscando bastante y tu lo explicaste super bien todo

  • @chandrasm009
    @chandrasm009 3 ปีที่แล้ว +1

    Thanks alot Eigen B. Its really helpful.

  • @nurulfadillah1248
    @nurulfadillah1248 ปีที่แล้ว

    this really helps me, thank you so much!

  • @josiahadesola
    @josiahadesola 3 ปีที่แล้ว

    Awesome....Thanks I love the method of teaching and background music

  • @christopherfreyre744
    @christopherfreyre744 2 ปีที่แล้ว

    This is amazing thanks for sharing and such a lovely explanation

  • @manojnaik8720
    @manojnaik8720 2 ปีที่แล้ว

    Sweet voice....Nicely explained.... Thanks

  • @hizokadarkwolf
    @hizokadarkwolf 2 ปีที่แล้ว

    I was doing something similar, with no results... Guess what: I used & instead of | when finding the lower and upper bounds. Thanks a lot for making this video!

  • @ft_smile
    @ft_smile 3 ปีที่แล้ว

    I wish i could show you how much thankful am i
    🙏🙏🙏🙏🙏🙏🙏🙏🙏🙏🙏🙏

  • @bobozaimee
    @bobozaimee 2 ปีที่แล้ว

    Thank you! Your video was really helpful for me :)

  • @مريم-مناف-عدنان
    @مريم-مناف-عدنان 2 ปีที่แล้ว

    thank great video i have question if i have about 446 feature how can i deal with it like in your example i tried to store the features in a variable X then use your code but it did not work any help please

  • @oipseismic7621
    @oipseismic7621 3 ปีที่แล้ว +2

    i tried these codes and it doesn't work. it shows(an only compare identically-labeled Series objects)

  • @rokeyasiddiqua9375
    @rokeyasiddiqua9375 2 ปีที่แล้ว

    Great tutorial

  • @mihirthakkar6902
    @mihirthakkar6902 3 ปีที่แล้ว

    Very nicely explained. great work. Thanks.

  • @MrYnitram
    @MrYnitram 2 ปีที่แล้ว +1

    great video! One question though: what if you only wanted to drop the outlier values and not the whole row in which the outlier is found?

    • @prashantshrivastava01
      @prashantshrivastava01 2 ปีที่แล้ว

      not possible.. but you can replace outliers with NaN but again.. no point of doing that

    • @jayanthimallela8842
      @jayanthimallela8842 ปีที่แล้ว

      It won't be like that; we can't remove only outlier we can remove entire row only.

  • @surajsalunkhe2348
    @surajsalunkhe2348 2 ปีที่แล้ว

    Thanks for the help

  • @peopleonemillion4283
    @peopleonemillion4283 2 ปีที่แล้ว

    Thank you!!!! you are amazing

  • @chiranjeebmahanta1215
    @chiranjeebmahanta1215 2 ปีที่แล้ว

    Thanks a lot!

  • @zarynooi5669
    @zarynooi5669 3 ปีที่แล้ว

    Thank You! Very helpful !

  • @PulkitKumar-fd8rb
    @PulkitKumar-fd8rb 2 ปีที่แล้ว

    Instead of removing, how can we impute median values ?

  • @t.farias9336
    @t.farias9336 3 ปีที่แล้ว

    thanks, you helped me a lot!

  • @manish17788
    @manish17788 3 ปีที่แล้ว

    what if data has no outlier. In that case we will loose tiny data? how to know if not outlier removal is needed in big dataset?

  • @souravsinghbhandari9699
    @souravsinghbhandari9699 2 ปีที่แล้ว

    I used the same technique for my dataset but outliers are still persistent any suggestions what to do?
    I tried rerunning the loop it removed some outliers but that reduced the original dataset i was working on.
    Anyone has any better suggestions?

  • @IngenieriaEstructural7
    @IngenieriaEstructural7 2 ปีที่แล้ว

    Genia me ayudaste mucho

  • @stephanie_ong
    @stephanie_ong 3 ปีที่แล้ว

    Thank you so much!

  • @cse048harshkumawat6
    @cse048harshkumawat6 3 ปีที่แล้ว

    Is there any way to replace those outliers rows with upper_bound or lower_bound please help

  • @gebremedhnmehari8451
    @gebremedhnmehari8451 2 ปีที่แล้ว

    How we can determine the value of the quantile?

  • @KP-oi4ee
    @KP-oi4ee 2 ปีที่แล้ว +2

    index_list = []
    for feature in ['feature1', 'feature2']:
    index_list.extend(outliers(data, feature))
    index_list = []
    ----- > For this i am getting an error : Boolean array expected for the condition, not float64 ,
    How can i fix it ?

    • @aartiahluwalia4104
      @aartiahluwalia4104 2 ปีที่แล้ว

      index_list = []
      for feature in ['feature1', 'feature2']:
      index_list.extend(outliers(data, feature))
      index_list = [] --> seem to have created two index_list so modify this line as
      index_list

  • @divina.glitch
    @divina.glitch 2 ปีที่แล้ว

    Thanks!

  • @sudhanshusingh5594
    @sudhanshusingh5594 3 ปีที่แล้ว

    thnx u so much.... really tqqq

  • @kyleroach2581
    @kyleroach2581 10 หลายเดือนก่อน

    This should be titled Pandas ASMR

  • @AhmedDaoud2
    @AhmedDaoud2 2 ปีที่แล้ว

    Thanks, can I get the test.csv file?

  • @shahadewadh606
    @shahadewadh606 2 ปีที่แล้ว

    ❤❤❤❤

  • @VishwasPatki
    @VishwasPatki 2 ปีที่แล้ว

    Error: TypeError: Cannot perform 'ror_' with a dtyped [float64] array and scalar of type [bool]

  • @jenirex1944
    @jenirex1944 2 ปีที่แล้ว

    what will be the output of In[8].. can anyone explain?

  • @trangdtt30
    @trangdtt30 3 ปีที่แล้ว

    Hi. I have one error: "Name 'dt' is not defined" when i ran cell [9]. can you help me

  • @aravinthanseenu1237
    @aravinthanseenu1237 ปีที่แล้ว

    Dear Eigen B,
    Instead of removing the outliers kindly help to code- how to replace them with mean value of respective column.

  • @jorgeeg2668
    @jorgeeg2668 2 ปีที่แล้ว

    No entiendo ingles, pero entendi el video :D

  • @9881847751
    @9881847751 2 ปีที่แล้ว

    what is ft? here?

  • @AlAhlyLy
    @AlAhlyLy 9 หลายเดือนก่อน

    Hello, I write your code And nothing happend, thank you for the video anyway

  • @jamesjayanth7926
    @jamesjayanth7926 ปีที่แล้ว

    Define outliers error is coming

  • @Master_of_Chess_Shorts
    @Master_of_Chess_Shorts 2 ปีที่แล้ว

    great coding but operation should be column wise not row wise, you are removing a possible valid adjacent value by using the index, imagine a large dataset with 500 columns...

  • @modhua4497
    @modhua4497 3 ปีที่แล้ว

    Could you share your code? Thanks

  • @WEMELONs
    @WEMELONs ปีที่แล้ว

    where vids mazafaka