How to Quickly Perform Exploratory Data Analysis (EDA) in Python using Sweetviz

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ม.ค. 2025

ความคิดเห็น • 69

  • @DataProfessor
    @DataProfessor  3 ปีที่แล้ว

    👉Watch this video next (How to learn data science in 2021) th-cam.com/video/oR670Txwh88/w-d-xo.html
    Support this Channel 👇👇👇
    🌟 Buy me a coffee www.buymeacoffee.com/dataprofessor
    🌟 Download Kite for FREE www.kite.com/get-kite/?
    👉 Subscribe to this TH-cam channel th-cam.com/users/dataprofessor
    👉 Join the Newsletter of Data Professor newsletter.dataprofessor.org

  • @DatascienceConcepts
    @DatascienceConcepts 4 ปีที่แล้ว +10

    Excellent topic! I see so many students struggling with EDA, this should be helpful.

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      Thank you for watching! Glad you think so!

  • @AndrewMoMoney
    @AndrewMoMoney 4 ปีที่แล้ว +5

    woot! so many uploads, love it Chanin!

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Nice to see you here, Andrew! Glad you enjoyed it! 😊

  • @margaretblack5474
    @margaretblack5474 ปีที่แล้ว +1

    Thank you for putting this content together! This package is just what I’ve been looking for.

  • @TinaHuang1
    @TinaHuang1 4 ปีที่แล้ว +2

    This is so helpful!! Doing EDA now and gonna check this out. Thank you!!

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      Awesome, thanks for watching Tina!

  • @temiwale88
    @temiwale88 3 ปีที่แล้ว +1

    Using this now for quick exploration of my data. Thanks again Data Professor!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Thanks for watching Elijah, Glad it was helpful!

  • @x7331x
    @x7331x 3 ปีที่แล้ว +2

    Thanks for this into to sweetviz! Extremely useful and time-efficient!

  • @aaronauta5898
    @aaronauta5898 4 ปีที่แล้ว +1

    Thank you, professor. you made learning data science for me so simple and easy to assimilate, thank you for introducing this library to me. still Waiting for data cleaning in orange lib. thanks again.

  • @minhajali3671
    @minhajali3671 ปีที่แล้ว

    hi, thanks for the help! used it for my final year dissertation

  • @pushkarajpalnitkar1695
    @pushkarajpalnitkar1695 4 ปีที่แล้ว +1

    Wow this is going to save a lot of time..thanks for the video

  • @thinamG
    @thinamG 4 ปีที่แล้ว +1

    It looks amazing, I will try it out soon. Thanks for sharing !!

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Awesome, looking forward to seeing it in use in one of your amazing LinkedIn posts 😊

    • @thinamG
      @thinamG 4 ปีที่แล้ว +1

      @@DataProfessor Yeah, I will try it soon. 😊

  • @etc_be_a_62harshshinde66
    @etc_be_a_62harshshinde66 ปีที่แล้ว

    not working for me says 'DataFrame' object has no attribute 'iteritems' i read on stack overflow that latest pandas has no iteritems, so need to downgrade but still after downgrading pandas still showing this error

  • @jeffwong1310
    @jeffwong1310 4 ปีที่แล้ว +1

    Any chance that I can keep showing the same HTML in Github as in Colab. When I put the notebook in the github, the HTML display is gone

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Hi, I recommend to download the generated HTML report and upload this HTML file to GitHub.

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      Hi, I think its the rendering of the notebook by GitHub that is the issue. HTML is shown in Colab or Jupyter notebook with no issues as it is using the iPython library for the HTML display. GitHub probably doesn’t support this and so it is gone. On a similar note, interactive plotly graphs that are included in some of my Jupyter notebooks when uploaded to GitHub also disappears.

    • @jeffwong1310
      @jeffwong1310 4 ปีที่แล้ว

      @@DataProfessor got it, thanks for the clarifications

  • @bassamal-kaaki3253
    @bassamal-kaaki3253 4 ปีที่แล้ว +1

    Hi data prof, when I run display.html in Anaconda Jupyter notebook I don’t get the same exact large screen you have! The cell is rather small to navigate inside of it. Is this normal? I do however get the html separately in the folder to view.

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Alternatively, the html file can be opened in an internet browser such as Chrome.

  • @TheLucas9amsn
    @TheLucas9amsn 3 ปีที่แล้ว +1

    Thank you for always sharing great content.

  • @sebascol
    @sebascol 4 ปีที่แล้ว +1

    Nice video. Thanks for sharing this amazing tool.

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      A pleasure, thanks for watching! 😊

  • @kirandeepmarala5541
    @kirandeepmarala5541 4 ปีที่แล้ว +1

    Hi Professor. thanks For Letting Us to Know This tool..

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      A pleasure, thanks for watching 😊

  • @stretch8390
    @stretch8390 4 ปีที่แล้ว +1

    X variable is fine but for y = penguins('species') I keep getting a "TypeError of 'DataFrame' object is not callable" which I haven't been able to figure out yet. Anyone know what might be happening? This obviously creates issues with later code cause then 'y' is not defined.

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Hi, can you try replacing the parenthesis with brackets such that it becomes penguins[‘species’]

    • @stretch8390
      @stretch8390 4 ปีที่แล้ว +1

      @@DataProfessor will give that a try next time thanks Data Professor!

  • @Eri3t5co
    @Eri3t5co 4 ปีที่แล้ว

    Amazing library. I'll explore it. Thank you very much.

  • @AndrewClark83
    @AndrewClark83 4 ปีที่แล้ว +1

    Love watching your videos. Thanks for sharing.
    Question though, you split out (species) as your dependant y variable at the start of the video, but then never seemed to use it. Are you doing another video on the same data set where you are predicting the species based on the other inputs?

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      Thanks for watching! Glad you liked them 😊

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Great observation, actually I’ve made a prior video on classifying the penguins according to their species and made a web app using the Streamlit library in this video th-cam.com/video/Eai1jaZrRDs/w-d-xo.html

    • @ianpilgrim2818
      @ianpilgrim2818 4 ปีที่แล้ว

      Had problem with step 3,2 will take a further look tomorrow

  • @cheikhbadiane4596
    @cheikhbadiane4596 4 ปีที่แล้ว +1

    Awesome Professor thanks a lot

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      It's a pleasure, thanks for watching! 😊

  • @sherifelgazar4089
    @sherifelgazar4089 2 ปีที่แล้ว

    Can u put dataset linl?

  • @wojtek8396
    @wojtek8396 3 ปีที่แล้ว

    Nice video unfortunately I get this error all the time'charmap' codec can't decode byte 0x9d in position 1207412: character maps to

  • @chiazamochiegbu848
    @chiazamochiegbu848 4 ปีที่แล้ว +1

    I'm getting module not found error

    • @chiazamochiegbu848
      @chiazamochiegbu848 4 ปีที่แล้ว +1

      Could you help me resolve this. Thank you

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Hi, have you installed sweetviz library
      pip install sweetviz

    • @chiazamochiegbu848
      @chiazamochiegbu848 4 ปีที่แล้ว +1

      @@DataProfessor yes I have. J also uninstalled and installed again but still getting same module not found error. Same thing happens with autoplotter, Vaex and pandasgui.
      Please help me.🙏

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      What version of Python are you using? Mine is 3.7 or 3.8 and it is working fine.

    • @chiazamochiegbu848
      @chiazamochiegbu848 4 ปีที่แล้ว

      @@DataProfessor I install through anaconda and it worked

  • @akshykumar1118
    @akshykumar1118 3 ปีที่แล้ว +1

    Sir is it OK to use this library ,I mean will recruiters accept this or will expect me to go manually with pandas??

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      This is a great tool for speeding things up, as for job interview preps, it is always nice to go back to basics and be proficient at standard Python libraries like pandas, matplotlib, numpy, scikit-learn, etc.

    • @akshykumar1118
      @akshykumar1118 3 ปีที่แล้ว +1

      @@DataProfessor thankyou sir for responding

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      @@akshykumar1118 You’re welcome :)

  • @blessingagyeikyem9849
    @blessingagyeikyem9849 4 ปีที่แล้ว +1

    Can I get the ipynb file

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว +1

      Thanks for the reminder, here it is github.com/dataprofessor/code/blob/master/python/Sweetviz.ipynb

  • @grahaminzana324
    @grahaminzana324 2 หลายเดือนก่อน

    Appreciate this video my dog

  • @muhammadhusni3097
    @muhammadhusni3097 4 ปีที่แล้ว +1

    So cool!

  • @fezacongo1692
    @fezacongo1692 4 ปีที่แล้ว +1

    Thank you for sharing

  • @tobymasuku5781
    @tobymasuku5781 4 ปีที่แล้ว

    Thanks very informative. Can you please help on how to cluster zip codes

  • @chuuchuu2381
    @chuuchuu2381 3 ปีที่แล้ว

    Should freshers use it?

  • @pauloreis8868
    @pauloreis8868 4 ปีที่แล้ว +1

    Awesome!!! Wait... do you think it's worth trying to "replicate" this library "from scratch"? Anyways, I think I'm going to try only for the fun of it kakaka.
    Thank you, Professor!

    • @DataProfessor
      @DataProfessor  4 ปีที่แล้ว

      Hi Paulo, it's definitely worth a try, to save you time, feel free to download the Notebook (.ipynb) file from the GitHub to give it a try:
      github.com/dataprofessor/code/blob/master/python/Sweetviz.ipynb

  • @RichardMonteiro-rx9ew
    @RichardMonteiro-rx9ew 8 หลายเดือนก่อน +1

    Was good but no point of the x test and train, in this video