How to Create a Dataset for Machine Learning |

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 ม.ค. 2020
  • ➡️ Wanna watch this video without ads and see exclusive content? Go to nebula.tv/jordan-harrod 👀
    Data, what is it good for? Pretty much everything, actually.
    Twitter - / jordanbharrod
    Facebook - / jordanbharrod
    Instagram - / jordanbharrod
    Thank you to Kasia, Jeff, Gerald, Milan, Ian, Becky, Jino, Daniel, Narskogr, Jason, and Mariano for being $5+/month Patrons!
    everydAI is a TH-cam channel focused on highlighting the ways we interact with artificial intelligence every day.
    Sources:
    A Survey on Data Collection for Machine Learning (Peer Reviewed Journal Version - $$$) - ieeexplore.ieee.org/abstract/...
    A Survey on Data Collection for Machine Learning (arxiv preprint - Free) - arxiv.org/pdf/1811.03402.pdf
    How to Build A Data Set for ML - towardsdatascience.com/how-to...
    TH-cam-8M Website - research.google.com/youtube8m/
    Stall Catchers - stallcatchers.com/
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 49

  • @4year4thyear
    @4year4thyear 3 ปีที่แล้ว +5

    My professor recently voluntold me to build an AI model for data in our research lab. Thank you for this lifeline!

  • @opejohn1116
    @opejohn1116 4 ปีที่แล้ว +1

    Hello Jordan. Thanks for this Video. I want to ask how you feel about using permutation and combinations to generate more rows for a dataset.

  • @BuddhiKavindra
    @BuddhiKavindra ปีที่แล้ว +1

    Thanks! This is so clear explanation.

  • @Darksagan
    @Darksagan ปีที่แล้ว +1

    I have no idea why I searched this topic. But you did a great job explaining it. lol

  • @abdelbassethechaichi4422
    @abdelbassethechaichi4422 3 ปีที่แล้ว

    That was really helpful, thank you very much

  • @BenjaSerra
    @BenjaSerra 4 ปีที่แล้ว +13

    Nowdays every tutorial just talk about how to create deep learning models, but almost none talks about how to prepare your own dataset and how important is data in machine learning.
    Subscribed += 1

    • @BenjaSerra
      @BenjaSerra 4 ปีที่แล้ว +1

      If is posible for you, would be great if you make a video about how to prepare your own datasets with sample code, depending in which case are you working (text clasification, images, etc...), as I said, there's almost no video about it. Thank you and greetings from Chile!

    • @sushantrauthan5704
      @sushantrauthan5704 4 ปีที่แล้ว +2

      Couldn't agree more

  • @parietal100
    @parietal100 ปีที่แล้ว +1

    Great overview which I have shared with my team. Thanks.

  • @MyGraden
    @MyGraden ปีที่แล้ว +1

    can you send or show an example of a clean ready for machine learning dataset?

  • @0xredpill
    @0xredpill 4 ปีที่แล้ว

    thanks for sharing such knowledge in 20AI

  • @aguslimanto6766
    @aguslimanto6766 3 ปีที่แล้ว +1

    Hi, how do I create a chemical structure dataset to predict the QSAR? I'm looking for a simple tutorial for my future Ph.D.
    study..thank you

  • @codebits4461
    @codebits4461 3 ปีที่แล้ว

    I like your hair, and great video. The content is exactly what I was looking for and more😁

  • @surendranathreddy7114
    @surendranathreddy7114 3 ปีที่แล้ว

    Amazing! Thank you!

  • @HolyFacts
    @HolyFacts ปีที่แล้ว

    Thank you and Subscribed !

  • @anandsheth5490
    @anandsheth5490 ปีที่แล้ว +1

    Jordan - what would you say is the minimum quantity of data required? for example, for a specialized NLP. 20,000 lines of text? 50K?

  • @alexshortsplus
    @alexshortsplus 4 ปีที่แล้ว +6

    Jordan your great with AI topics❤🙏

  • @benvolioombese9109
    @benvolioombese9109 4 ปีที่แล้ว

    I've really liked this !!
    How can I learn this? Please

  • @TheAmit4sun
    @TheAmit4sun ปีที่แล้ว +1

    I am sold. Just subscribed you. This is such an important topic but there is hardly any good videos on it. God bless

  • @viniciusoliveira4798
    @viniciusoliveira4798 4 ปีที่แล้ว +1

    this is so helpful

  • @LofiMoodCrafts
    @LofiMoodCrafts 2 ปีที่แล้ว

    Hey Jordan where can I get NETWORK INTRUSION DETECTION SYSTEM DATASET?

  • @muneebabbas2424
    @muneebabbas2424 2 ปีที่แล้ว

    Can I have a dataset of textfiles (.txt) for training and testing is viable for machine learning algorithms generally? I'm very new to this area and want to know before I start trying to create machine learning algorithms

  • @lakeguy65616
    @lakeguy65616 4 ปีที่แล้ว +17

    This is a great and important topic I hope you'll spend more time talking about it, particularly numerical data sets. Cleaning, accounting for missing values, scaling, normalizing, etc...

    • @JordanHarrod
      @JordanHarrod  4 ปีที่แล้ว +2

      Definitely planning to go deeper into this on future AI 101 videos!

    • @farahwael8806
      @farahwael8806 ปีที่แล้ว

      @@JordanHarrod Hello Jordan
      I want to study at Harvard university. How I can do this! Please answer me 🌸🌸🌸

  • @xabisontloko3574
    @xabisontloko3574 4 ปีที่แล้ว

    Nice, thank you for this great video.

    • @JordanHarrod
      @JordanHarrod  4 ปีที่แล้ว +1

      Thank you for watching!

  • @rnsfebay1
    @rnsfebay1 7 หลายเดือนก่อน +1

    Thanks!

  • @anthonyjoshua7148
    @anthonyjoshua7148 3 ปีที่แล้ว

    How can I develop dataset of student result to predict their project topic

  • @adamderose9468
    @adamderose9468 ปีที่แล้ว

    i may have missed it but reCAPTCHA is a cool ex of crowd sourcing labeling

  • @marcoprimo4042
    @marcoprimo4042 3 ปีที่แล้ว +1

    How do I practically create create a dataset, meaning what program or software do I use.

    • @JordanHarrod
      @JordanHarrod  3 ปีที่แล้ว +1

      There are a few options - if I'm using a public, well-known dataset, you can usually load it from an existing machine learning API (ex. TensorFlow, Keras, PyTorch, fast.ai) using one of their predefined methods. If you're using something custom, things become a lot more complicated, and it depends a lot on what data you're loading, the format it is in, and how much pre-processing you need to do to get it to the point where you can use it as training data.

  • @_abhishek_08_
    @_abhishek_08_ ปีที่แล้ว

    hey how can I reach out to you ? incase of any doubts? I am a engineering student and for my final project I need to build a project which needs data set and I have no clue where to start

  • @md.masudurrahman5852
    @md.masudurrahman5852 5 หลายเดือนก่อน

    hi ,can you please show ai dataset in live,,,?

  • @amiryavariabdi8962
    @amiryavariabdi8962 3 ปีที่แล้ว +3

    Dear the artificial intelligence community
    I am pleased to introduce DIDA dataset, which is the largest handwritten digit dataset. I will be grateful, if you could help me to introduce this dataset to the community.
    Thanks

  • @semilshah8252
    @semilshah8252 3 ปีที่แล้ว

    I am working on a project Sentimental Analysis Based On Social Media Post and i need to create dataset in which data is extracted from instagram,facebook,twitter. Please describe a step by step process how should i go for it?

    • @olaniyiajayi4319
      @olaniyiajayi4319 3 ปีที่แล้ว

      You should go with scrapping those social media you mentioned. There are libraries out there that can help you scrape easily from each site.

    • @hocineb8483
      @hocineb8483 ปีที่แล้ว

      Bro thought Jordan is ChatGPT or smth

  • @slametwidodo727
    @slametwidodo727 3 ปีที่แล้ว

    Data set is very important before we process data into scientific data

  • @njmagay0223
    @njmagay0223 2 ปีที่แล้ว

    How to make dataset in Facebook?🥺

  • @vauths8204
    @vauths8204 ปีที่แล้ว

    im ready to simp. this was a great video and really helps me with my goal of ruling the world thank you future queen

  • @auguststas7770
    @auguststas7770 ปีที่แล้ว

    cool

  • @DeepFrydTurd
    @DeepFrydTurd 2 ปีที่แล้ว

    Ai is like a baby and we are the parents. its in its infancy and will develop into a self aware adult

  • @shashwatvaibhav2769
    @shashwatvaibhav2769 3 ปีที่แล้ว

    So, you hardly blink...right??

  • @haz1615
    @haz1615 7 หลายเดือนก่อน

    i thought its a practical vid but its just talking video

  • @BERNARD7269
    @BERNARD7269 3 ปีที่แล้ว

    Who taught her to speak like that

  • @codebits4461
    @codebits4461 3 ปีที่แล้ว +2

    I like your hair, and great video. The content is exactly what I was looking for and more😁