Handling categorical data

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ก.ย. 2024

ความคิดเห็น • 18

  • @JahnaviVasamsetti-v3b
    @JahnaviVasamsetti-v3b 2 หลายเดือนก่อน +2

    super sir ,thank you soo much

  • @aartisharma4793
    @aartisharma4793 2 ปีที่แล้ว +2

    Thank you so much.
    These concepts were not that easy for me as you made these now.Any average learner can understand and implement these concept.

    • @sukamal_das
      @sukamal_das  2 ปีที่แล้ว

      Glad that I could help you 🙂👍

  • @bhulekhyag4189
    @bhulekhyag4189 ปีที่แล้ว +1

    i have been working on it for long time , you made it simple .. thank you

  • @sasikala_chowdary
    @sasikala_chowdary ปีที่แล้ว +1

    Time saver ,thank you so much🙏

  • @abdulwahabchudhary6269
    @abdulwahabchudhary6269 8 หลายเดือนก่อน

    Excellent work! I have been working on it for two days, but I did not grasp the main concept. However, after watching the video, I now understand the whole concept. Is my use of 'dose' correct? Also, please check the entire sentence

  • @gpoojitha2614
    @gpoojitha2614 6 หลายเดือนก่อน

    Can you please give a piece of code from which we can get the number of categorical columns and numeric columns in the whole dataset

  • @JyotirmoyeeRoy
    @JyotirmoyeeRoy 4 หลายเดือนก่อน

    I'm getting true or false instead of 0 and 1 after applying the dummies. Why is that?

  • @mkeremyucedag
    @mkeremyucedag 3 ปีที่แล้ว

    Hey! Great video as always. I have a question for you. In the end you're doing fit_transform with for loop. How can I do it with map, list ? When I do list(map(le.fit_transform(df_cat),df_cat)) it gives this error :
    y should be a 1d array, got an array of shape (513, 2) instead.
    How would you do map,list as an alternative to for loop ?

    • @sukamal_das
      @sukamal_das  3 ปีที่แล้ว +1

      You can also use this technique -
      df_cat = df_cat.apply(lambda x : LabelEncoder().fit_transform(x))

    • @mkeremyucedag
      @mkeremyucedag 3 ปีที่แล้ว +1

      @@sukamal_das Oh, thanks! That works and surely is an easy way to work things. Wish you good luck, thanks again !

  • @shahfahad3682
    @shahfahad3682 2 ปีที่แล้ว

    Great explanation!
    I have a question though,
    When we apply label encoder and the categorical column has more than 3 unique values it assigns the value as 1,2,3,4 etc. Are there any chances that our model prioritizes the category which has a higher number compared to others?

    • @sukamal_das
      @sukamal_das  2 ปีที่แล้ว

      Yes you are right. To avoid this problem we can go for One Hot Encoding technique.

    • @shahfahad3682
      @shahfahad3682 2 ปีที่แล้ว +1

      @@sukamal_das but what if we have 100 categories? Then it would create 99 extra columns right?
      How do we handle this?

  • @SusmitaMelodies
    @SusmitaMelodies ปีที่แล้ว

    Thankyou❤

  • @SusmitaMelodies
    @SusmitaMelodies ปีที่แล้ว

    Im having error even after converting categorical values df value still shows object type

    • @sukamal_das
      @sukamal_das  ปีที่แล้ว

      Can you share your code via github ?

    • @SusmitaMelodies
      @SusmitaMelodies ปีที่แล้ว

      @@sukamal_das yes sure can u share the link