Handling skewness

แชร์
ฝัง
  • เผยแพร่เมื่อ 13 ก.ย. 2024

ความคิดเห็น • 52

  • @bitanbanerjee2090
    @bitanbanerjee2090 3 ปีที่แล้ว +2

    Wow I was confused about this and you explained it so well!!!! Thank you

  • @teja2683
    @teja2683 2 ปีที่แล้ว +1

    Today I learned so many things from you bro

  • @visvashattri
    @visvashattri 3 ปีที่แล้ว +2

    Thank you.. it was indeed helpful :)

  • @freeprivatetutor
    @freeprivatetutor 8 หลายเดือนก่อน +1

    ❤ and hug. ❤❤❤❤.

  • @alok94singh
    @alok94singh 2 ปีที่แล้ว

    nice explanation sir, keep posted these kind of informative videos

  • @Sallu0123
    @Sallu0123 3 ปีที่แล้ว +2

    Thanks for the videos, where is the descriptive stats videos

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +1

      statquest and khan academy videos are good.

  • @CoopmanGreg
    @CoopmanGreg ปีที่แล้ว

    😀There was no skewness in the opinion of your watches. They were unanimous in that it was a GREAT video. I definitely agree. Thanks.

  • @HariMithra-iz7oq
    @HariMithra-iz7oq 4 หลายเดือนก่อน

    Hi. Can you please take me a paid tutorial for a case study of australian vehicle prices from kaggle? Just one session will be enough.

  • @mkeremyucedag
    @mkeremyucedag 3 ปีที่แล้ว +4

    Hello from Turkey! That's great content. I'd like to ask you what if our target (y) is skewed what to do then ? For example let's say I made a car price prediction model and variables are horsepower of car and color of car and car price is skewed in dataset. I did sqrt or log to car price and trained the model, got the mse and did model tuning everything is finished. And now I want to see a 180 hp black car's price. If I insert the values it will give me the sqrt'ed or log'd value right ? So if I do the reverse of log or sqrt will it give me the real car price ? Or should I do other operations ? Thank you...

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +3

      Hello, if the target is skewed, you can apply log or sqrt transformation. Now the predicted value you get is not original price value, it is either sqrt or log of the price. To get the actual price from predicted value, you can apply the inverse function, for eg if transformation is square root, apply square or if transformation is log (base e) apply exponential function i.e np.exp(). I hope that helps, Thank you.

    • @mkeremyucedag
      @mkeremyucedag 3 ปีที่แล้ว

      @@sukamal_das Oh now it's very clear for me. Thank you so much your contents are great.
      Sincerely...

  • @charmilam920
    @charmilam920 3 ปีที่แล้ว +1

    great video

  • @ketanbutte3497
    @ketanbutte3497 ปีที่แล้ว

    Hi Sukamal, great explanation. For symboling, can we convert the scale to min-max(0-1) and then apply sqrt?

  • @mamatha1850
    @mamatha1850 ปีที่แล้ว

    your video is helpfull.you r saying that highly correlated features with the target.we no need to perform any transformation.will this impact the accuracy of the model.plz reply

  • @adamleyon8297
    @adamleyon8297 3 ปีที่แล้ว

    absolutely useful video, thanks for sharing!

  • @michaelogunmakin9143
    @michaelogunmakin9143 3 หลายเดือนก่อน

    subscribed, thank you!

  • @AbhishekSingh-og7kf
    @AbhishekSingh-og7kf 3 ปีที่แล้ว

    very useful video, thank you for sharing.

  • @preciousbatta9576
    @preciousbatta9576 2 ปีที่แล้ว +1

    Very helpful.. Thanks
    I also want to ask if we reduce skewness, is there any need to scale the data as well?

    • @sukamal_das
      @sukamal_das  2 ปีที่แล้ว

      Yes scaling is still required.

  • @sktdebnath
    @sktdebnath 2 ปีที่แล้ว

    Very very useful.

  • @pforpython7884
    @pforpython7884 ปีที่แล้ว

    Could you please share the link for Descriptive Statistics Video?

  • @ajaykushwaha-je6mw
    @ajaykushwaha-je6mw 3 ปีที่แล้ว

    conceptual knowledge is very good, you have earned 1 more subs. There is one just improve your playlist, rest are good.

  • @AshishYadav-vi6on
    @AshishYadav-vi6on 2 ปีที่แล้ว

    very well explained!

  • @shama6345
    @shama6345 3 ปีที่แล้ว +1

    How can we remove negative and positive skewness together?

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +1

      Skewness can be either positive or negative, if positive apply square root, logarithm, if negative - apply square, cube or higher powers.

    • @shama6345
      @shama6345 3 ปีที่แล้ว +2

      Plz make video on that

  • @rohinisn8147
    @rohinisn8147 4 ปีที่แล้ว +1

    Tq sir....🙆‍♂️

  • @JVJBA
    @JVJBA ปีที่แล้ว

    Hi can you pls share the video for discriptive statistics ?

  • @AbhishekRana-ye9uw
    @AbhishekRana-ye9uw 3 ปีที่แล้ว +2

    sir how do i find skewness of a list given as price = [14751, 16422, 15398, 9445, 12589, 11687, 10692, 8475, 11184, 9961, 12898, 11905]
    please do reply sir will be very helpful :-)

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +2

      from scipy.stats import skew
      import seaborn as sns
      price = [14751, 16422, 15398, 9445, 12589, 11687, 10692, 8475, 11184, 9961, 12898, 11905]
      # print skewness value
      skew(price)
      # plot distribution
      sns.distplot(price)

    • @AbhishekRana-ye9uw
      @AbhishekRana-ye9uw 3 ปีที่แล้ว +1

      @@sukamal_das thank you sir you are great👍🙏

  • @pinkyeeeepinkydas
    @pinkyeeeepinkydas 2 ปีที่แล้ว

    in heat map one feature has 0.002, 2nd has 0.0017.when i am going to skew, getting negative values for 2nd one -0.07723174570350672 where in first 0.2155809290498895. is it correct. why -ve values comming

  • @pranavkhurud6254
    @pranavkhurud6254 ปีที่แล้ว

    Hello DAS Sir,
    Could you please help me or provide here a descriptive statistics video link.
    Not able to find out.
    Any update

  • @RashmiUdupa
    @RashmiUdupa 3 ปีที่แล้ว

    Instead of removing skewness, would removing outliers be enough?

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +2

      Can't say that, but removing outliers will definitely reduce the magnitude of skewness, but no guarantee on totally removing the skewness. Skewness and outliers are 2 different things, A normal distribution with 0 skewness can have outliers on both the extreme ends.

    • @ajaykushwaha-je6mw
      @ajaykushwaha-je6mw 3 ปีที่แล้ว

      removing outliers by trimming is removing records which comes in outlier i.e. you are loosing the data.
      removing outliers with capping is good practice and you can try, it works very well.

  • @brameeev5768
    @brameeev5768 3 ปีที่แล้ว

    can we use VIF to find the correlation i.e. multi-collinearity?

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว

      Yes you can use that too

  • @zohaibramzan6381
    @zohaibramzan6381 3 ปีที่แล้ว

    Why skewness needs to be removed?

  • @Rajaraj-kf9kx
    @Rajaraj-kf9kx 2 ปีที่แล้ว

    hello sir how to remove the negative skewness

    • @sukamal_das
      @sukamal_das  2 ปีที่แล้ว

      You can go for square/cube/ higher power transformation

  • @NehaYadav-hs1po
    @NehaYadav-hs1po 3 ปีที่แล้ว

    I typed your code but line is not visible on histogram!! whats wrong in my code? #skewness

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +1

      Sometimes this might happen due to different versions of matplotlib or seaborn

    • @NehaYadav-hs1po
      @NehaYadav-hs1po 3 ปีที่แล้ว

      @@sukamal_das whats the solution then?

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +2

      @@NehaYadav-hs1po try upgrading your seaborn version

  • @sheelazaware5494
    @sheelazaware5494 2 ปีที่แล้ว

    Hello I tried this code I got hist but not curveshape on it why ?

    • @sukamal_das
      @sukamal_das  2 ปีที่แล้ว

      Sometimes this might happen due to different versions of matplotlib or seaborn

  • @sayednab
    @sayednab ปีที่แล้ว

    #name sns is not defined. what am i doing wrong here?

    • @sukamal_das
      @sukamal_das  ปีที่แล้ว

      You are missing an import statement, add this on the top cell - import seaborn as sns

    • @sayednab
      @sayednab ปีที่แล้ว

      @@sukamal_das yes, I figured it out. thx anyway