How to Quickly Perform Exploratory Data Analysis (EDA) in Python using Sweetviz

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ก.ย. 2024
  • In this video, I will be showing you how to quickly perform exploratory data analysis (EDA) in Python using the Sweetviz library. Particularly, I will be sharing the 2 essential features of Sweetviz that I use the most namely the analyze (quick EDA of a single dataset) and compare (compares 2 datasets) functions.
    🌟 Buy me a coffee: www.buymeacoff...
    ⭕ Links for this video:
    ✅ Code: github.com/dat...
    ✅ Sweetviz: pypi.org/proje...
    ⭕ Playlist:
    Check out our other videos in the following playlists.
    ✅ Data Science 101: bit.ly/datapro...
    ✅ Data Science TH-camr Podcast: bit.ly/datasci...
    ✅ Data Science Virtual Internship: bit.ly/datapro...
    ✅ Bioinformatics: bit.ly/dataprof...
    ✅ Data Science Toolbox: bit.ly/datapro...
    ✅ Streamlit (Web App in Python): bit.ly/datapro...
    ✅ Shiny (Web App in R): bit.ly/datapro...
    ✅ Google Colab Tips and Tricks: bit.ly/datapro...
    ✅ Pandas Tips and Tricks: bit.ly/datapro...
    ✅ Python Data Science Project: bit.ly/datapro...
    ✅ R Data Science Project: bit.ly/datapro...
    ✅ Weka (No Code Machine Learning): bit.ly/dp-weka
    ⭕ Subscribe:
    If you're new here, it would mean the world to me if you would consider subscribing to this channel.
    ✅ Subscribe: www.youtube.co...
    ⭕ Recommended Tools:
    🌟Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite and I love it!
    ✅ Check out Kite: www.kite.com/g...
    ⭕ Recommended Books:
    🌟kit.co/datapro...
    ✅ Hands-On Machine Learning with Scikit-Learn : amzn.to/3hTKuTt
    ✅ Data Science from Scratch : amzn.to/3fO0JiZ
    ✅ Python Data Science Handbook : amzn.to/37Tvf8n
    ✅ R for Data Science : amzn.to/2YCPcgW
    ✅ Artificial Intelligence: The Insights You Need from Harvard Business Review: amzn.to/33jTdcv
    ✅ AI Superpowers: China, Silicon Valley, and the New World Order: amzn.to/3nghGrd
    ⭕ Stock photos, graphics and videos used on this channel:
    ✅ 1.envato.marke...
    ⭕ Follow us:
    ✅ Medium: bit.ly/chanin-m...
    ✅ FaceBook: / dataprofessor
    ✅ Website: dataprofessor.org/ (Under construction)
    ✅ Twitter: / thedataprof
    ✅ Instagram: / data.professor
    ✅ LinkedIn: / chanin-nantasenamat
    ✅ GitHub 1: github.com/dat...
    ✅ GitHub 2: github.com/cha...
    Icons used in thumbnail by Chandlel via CleanPNG:
    ✅ Python logo: www.cleanpng.c...
    ⭕ Disclaimer:
    Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel's contents.
    #sweetviz #EDA #python #exploratorydataanalysis #datawrangling #datascience #dataanalyst #analytics #machinelearning #dataprofessor #bigdata #machinelearning #datamining #bigdata #ai #artificialintelligence #dataanalytics #dataanalysis #dataprofessor

ความคิดเห็น • 68

  • @DataProfessor
    @DataProfessor  3 ปีที่แล้ว

    👉Watch this video next (How to learn data science in 2021) th-cam.com/video/oR670Txwh88/w-d-xo.html
    Support this Channel 👇👇👇
    🌟 Buy me a coffee www.buymeacoffee.com/dataprofessor
    🌟 Download Kite for FREE www.kite.com/get-kite/?
    👉 Subscribe to this TH-cam channel th-cam.com/users/dataprofessor
    👉 Join the Newsletter of Data Professor newsletter.dataprofessor.org

  • @DatascienceConcepts
    @DatascienceConcepts 3 ปีที่แล้ว +10

    Excellent topic! I see so many students struggling with EDA, this should be helpful.

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      Thank you for watching! Glad you think so!

  • @AndrewMoMoney
    @AndrewMoMoney 3 ปีที่แล้ว +5

    woot! so many uploads, love it Chanin!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Nice to see you here, Andrew! Glad you enjoyed it! 😊

  • @TinaHuang1
    @TinaHuang1 3 ปีที่แล้ว +2

    This is so helpful!! Doing EDA now and gonna check this out. Thank you!!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      Awesome, thanks for watching Tina!

  • @margaretblack5474
    @margaretblack5474 11 หลายเดือนก่อน +1

    Thank you for putting this content together! This package is just what I’ve been looking for.

  • @temiwale88
    @temiwale88 3 ปีที่แล้ว +1

    Using this now for quick exploration of my data. Thanks again Data Professor!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Thanks for watching Elijah, Glad it was helpful!

  • @x7331x
    @x7331x 2 ปีที่แล้ว +2

    Thanks for this into to sweetviz! Extremely useful and time-efficient!

  • @aaronauta5898
    @aaronauta5898 3 ปีที่แล้ว +1

    Thank you, professor. you made learning data science for me so simple and easy to assimilate, thank you for introducing this library to me. still Waiting for data cleaning in orange lib. thanks again.

  • @minhajali3671
    @minhajali3671 ปีที่แล้ว

    hi, thanks for the help! used it for my final year dissertation

  • @pushkarajpalnitkar1695
    @pushkarajpalnitkar1695 3 ปีที่แล้ว +1

    Wow this is going to save a lot of time..thanks for the video

  • @thinamG
    @thinamG 3 ปีที่แล้ว +1

    It looks amazing, I will try it out soon. Thanks for sharing !!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Awesome, looking forward to seeing it in use in one of your amazing LinkedIn posts 😊

    • @thinamG
      @thinamG 3 ปีที่แล้ว +1

      @@DataProfessor Yeah, I will try it soon. 😊

  • @kirandeepmarala5541
    @kirandeepmarala5541 3 ปีที่แล้ว +1

    Hi Professor. thanks For Letting Us to Know This tool..

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      A pleasure, thanks for watching 😊

  • @sebascol
    @sebascol 3 ปีที่แล้ว +1

    Nice video. Thanks for sharing this amazing tool.

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      A pleasure, thanks for watching! 😊

  • @etc_be_a_62harshshinde66
    @etc_be_a_62harshshinde66 ปีที่แล้ว

    not working for me says 'DataFrame' object has no attribute 'iteritems' i read on stack overflow that latest pandas has no iteritems, so need to downgrade but still after downgrading pandas still showing this error

  • @TheLucas9amsn
    @TheLucas9amsn 3 ปีที่แล้ว +1

    Thank you for always sharing great content.

  • @cheikhbadiane4596
    @cheikhbadiane4596 3 ปีที่แล้ว +1

    Awesome Professor thanks a lot

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      It's a pleasure, thanks for watching! 😊

  • @Eri3t5co
    @Eri3t5co 3 ปีที่แล้ว

    Amazing library. I'll explore it. Thank you very much.

  • @AndrewClark83
    @AndrewClark83 3 ปีที่แล้ว +1

    Love watching your videos. Thanks for sharing.
    Question though, you split out (species) as your dependant y variable at the start of the video, but then never seemed to use it. Are you doing another video on the same data set where you are predicting the species based on the other inputs?

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      Thanks for watching! Glad you liked them 😊

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Great observation, actually I’ve made a prior video on classifying the penguins according to their species and made a web app using the Streamlit library in this video th-cam.com/video/Eai1jaZrRDs/w-d-xo.html

    • @ianpilgrim2818
      @ianpilgrim2818 3 ปีที่แล้ว

      Had problem with step 3,2 will take a further look tomorrow

  • @wojtek8396
    @wojtek8396 3 ปีที่แล้ว

    Nice video unfortunately I get this error all the time'charmap' codec can't decode byte 0x9d in position 1207412: character maps to

  • @bassamal-kaaki3253
    @bassamal-kaaki3253 3 ปีที่แล้ว +1

    Hi data prof, when I run display.html in Anaconda Jupyter notebook I don’t get the same exact large screen you have! The cell is rather small to navigate inside of it. Is this normal? I do however get the html separately in the folder to view.

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Alternatively, the html file can be opened in an internet browser such as Chrome.

  • @tobymasuku5781
    @tobymasuku5781 3 ปีที่แล้ว

    Thanks very informative. Can you please help on how to cluster zip codes

  • @akshykumar1118
    @akshykumar1118 2 ปีที่แล้ว +1

    Sir is it OK to use this library ,I mean will recruiters accept this or will expect me to go manually with pandas??

    • @DataProfessor
      @DataProfessor  2 ปีที่แล้ว

      This is a great tool for speeding things up, as for job interview preps, it is always nice to go back to basics and be proficient at standard Python libraries like pandas, matplotlib, numpy, scikit-learn, etc.

    • @akshykumar1118
      @akshykumar1118 2 ปีที่แล้ว +1

      @@DataProfessor thankyou sir for responding

    • @DataProfessor
      @DataProfessor  2 ปีที่แล้ว

      @@akshykumar1118 You’re welcome :)

  • @sherifelgazar4089
    @sherifelgazar4089 2 ปีที่แล้ว

    Can u put dataset linl?

  • @fezacongo1692
    @fezacongo1692 3 ปีที่แล้ว +1

    Thank you for sharing

  • @jeffwong1310
    @jeffwong1310 3 ปีที่แล้ว +1

    Any chance that I can keep showing the same HTML in Github as in Colab. When I put the notebook in the github, the HTML display is gone

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Hi, I recommend to download the generated HTML report and upload this HTML file to GitHub.

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      Hi, I think its the rendering of the notebook by GitHub that is the issue. HTML is shown in Colab or Jupyter notebook with no issues as it is using the iPython library for the HTML display. GitHub probably doesn’t support this and so it is gone. On a similar note, interactive plotly graphs that are included in some of my Jupyter notebooks when uploaded to GitHub also disappears.

    • @jeffwong1310
      @jeffwong1310 3 ปีที่แล้ว

      @@DataProfessor got it, thanks for the clarifications

  • @stretch8390
    @stretch8390 3 ปีที่แล้ว +1

    X variable is fine but for y = penguins('species') I keep getting a "TypeError of 'DataFrame' object is not callable" which I haven't been able to figure out yet. Anyone know what might be happening? This obviously creates issues with later code cause then 'y' is not defined.

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Hi, can you try replacing the parenthesis with brackets such that it becomes penguins[‘species’]

    • @stretch8390
      @stretch8390 3 ปีที่แล้ว +1

      @@DataProfessor will give that a try next time thanks Data Professor!

  • @muhammadhusni3097
    @muhammadhusni3097 3 ปีที่แล้ว +1

    So cool!

  • @pauloreis8868
    @pauloreis8868 3 ปีที่แล้ว +1

    Awesome!!! Wait... do you think it's worth trying to "replicate" this library "from scratch"? Anyways, I think I'm going to try only for the fun of it kakaka.
    Thank you, Professor!

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว

      Hi Paulo, it's definitely worth a try, to save you time, feel free to download the Notebook (.ipynb) file from the GitHub to give it a try:
      github.com/dataprofessor/code/blob/master/python/Sweetviz.ipynb

  • @chuuchuu2381
    @chuuchuu2381 3 ปีที่แล้ว

    Should freshers use it?

  • @chiazamochiegbu848
    @chiazamochiegbu848 3 ปีที่แล้ว +1

    I'm getting module not found error

    • @chiazamochiegbu848
      @chiazamochiegbu848 3 ปีที่แล้ว +1

      Could you help me resolve this. Thank you

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Hi, have you installed sweetviz library
      pip install sweetviz

    • @chiazamochiegbu848
      @chiazamochiegbu848 3 ปีที่แล้ว +1

      @@DataProfessor yes I have. J also uninstalled and installed again but still getting same module not found error. Same thing happens with autoplotter, Vaex and pandasgui.
      Please help me.🙏

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      What version of Python are you using? Mine is 3.7 or 3.8 and it is working fine.

    • @chiazamochiegbu848
      @chiazamochiegbu848 3 ปีที่แล้ว

      @@DataProfessor I install through anaconda and it worked

  • @blessingagyeikyem9849
    @blessingagyeikyem9849 3 ปีที่แล้ว +1

    Can I get the ipynb file

    • @DataProfessor
      @DataProfessor  3 ปีที่แล้ว +1

      Thanks for the reminder, here it is github.com/dataprofessor/code/blob/master/python/Sweetviz.ipynb

  • @RichardMonteiro-rx9ew
    @RichardMonteiro-rx9ew 3 หลายเดือนก่อน

    Was good but no point of the x test and train, in this video