Kaggle Earthquake Prediction Challenge

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ก.พ. 2019
  • Data Science coding challenge time! The popular Data Science competition website Kaggle has an ongoing competition to solve the problem of earthquake prediction. Given a dataset of seismographic activity from a laboratory simulation, participants are asked to create a predictive model for earthquakes. In this video, I'll attempt the challenge as a way to teach 3 concepts; the Data Science mindset, Categorical Boosting, and Support Vector Regression models. I'll be coding this using python from start to finish in the online Google colab environment. Enjoy!
    Code for this video:
    github.com/llSourcell/Kaggle_...
    Please Subscribe! And Like. And comment. Thats what keeps me going.
    Want more education? Connect with me here:
    Twitter: / sirajraval
    instagram: / sirajraval
    Facebook: / sirajology
    Join us at the School of AI:
    theschool.ai/
    More learning resources:
    www.kaggle.com/c/LANL-Earthqu...
    www.analyticsvidhya.com/blog/...
    blog.griddynamics.com/xgboost...
    towardsdatascience.com/catboo...
    accio.github.io/machinelearni...
    kernelsvm.tripod.com/
    www.saedsayad.com/support_vec...
    / support-vector-regress...
    scikit-learn.org/stable/modul...
    Join us in the Wizards Slack channel:
    wizards.herokuapp.com/
    Please support me on Patreon:
    www.patreon.com/user?u=3191693
    Signup for my newsletter for exciting updates in the field of AI:
    goo.gl/FZzJ5w
    Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
    www.wagergpt.co

ความคิดเห็น • 133

  • @druestaples3044
    @druestaples3044 5 ปีที่แล้ว +46

    I really appreciate these kaggle competitions you do cause it’s helping me gain confidence to do my own, which in turn is adding projects to my portfolio!

  • @X_platform
    @X_platform 5 ปีที่แล้ว +75

    The best video you have made to date.
    Cutting down the meme, rap, special effects really helped me focus more.
    Keep up the good work!

  • @user-ix9ib4qh1c
    @user-ix9ib4qh1c 5 ปีที่แล้ว +1

    Thanks Siraj! Study ML for 3 weeks already and want you to shoot more episodes with more focus on data-preparation and all that stuff with features because when scikit-learn has all algorithms made for us how could we get the max power from them tuning hypersparams and so on. I think you get what I mean. This episode seems pretty easy so I want to see you really making art from it!! Greets from Russia

  • @YeeYeez
    @YeeYeez 5 ปีที่แล้ว +2

    Glad you re-did this. More to the point and cleaned up. Keep it up! Papers with code - super cool!

  • @danielmiranda7315
    @danielmiranda7315 5 ปีที่แล้ว +15

    Make sure you join the competition in Kaggle. Otherwise, kaggle download command will fail with forbidden error.

  • @thomaswyrick4648
    @thomaswyrick4648 5 ปีที่แล้ว +1

    Siraj, this video was gold. Thanks for everything that you do.

  • @NeoCodingGod
    @NeoCodingGod 5 ปีที่แล้ว

    @Siraj What's super exciting is hearing what you have to say! Whenever I watch your videos I know I'll not get disponited. You should do more videos about you & what you're up to etc.

  • @akanksharastogi8741
    @akanksharastogi8741 5 ปีที่แล้ว

    Would love to see a video by you on SLAM algorithms! Keep up the good work and keep inspiring!

  • @CodeEmporium
    @CodeEmporium 5 ปีที่แล้ว +2

    Dude you should make a video on Lingvo (a framework for sequence-sequence modeling) I'd love to hear your thoughts ~

  • @MrMatheus195
    @MrMatheus195 5 ปีที่แล้ว +2

    With this video we now know how nervous Siraj is when coding hahah

  • @felipeozoski
    @felipeozoski 5 ปีที่แล้ว

    Thank you so much Siraj!! You are the man! Cheers

  • @amitkhetan5062
    @amitkhetan5062 4 ปีที่แล้ว +6

    Hey Dude,
    I'm a DS Practitioner from India and I really appreciate 🙌 your work. A year ago, I was working in Mumbai and heard about a few things that got viral about you. I cannot choose to ignore that but I personally feel that you've been a strong influencer in the whole data science community. Just be yourself and keep doing what you do.. with citations and references, of course. :D #LetsMakeIndiaAGreatNation 🙌

    • @anilchaudhry804
      @anilchaudhry804 3 ปีที่แล้ว

      dude u first go learn datastructures and algorithms

  • @miguelflorian4022
    @miguelflorian4022 5 ปีที่แล้ว

    Wow man, you made my day!! Thanks a lot!

  • @siraj3434
    @siraj3434 5 ปีที่แล้ว

    the code in step 4 gives me an invalid syntax on the last line
    return pd.Series.(strain)
    What am I doing wrong?

  • @etienneekpo348
    @etienneekpo348 5 ปีที่แล้ว

    Amazing Video Siraj, thanks for sharing.

  • @Simba-qm5qs
    @Simba-qm5qs 5 ปีที่แล้ว

    Waouwh TH-cam videos can be shown on Jupyter :O That is so cool :) Siraj you're just the best !

  • @temitopeoyeyemi899
    @temitopeoyeyemi899 5 ปีที่แล้ว +2

    what else can i say......YOU ARE GOOD BRO

  • @debasishjana3125
    @debasishjana3125 5 ปีที่แล้ว

    I think you did a live session with this problem. Did you delete it? I couldn't find that on your channel. That was also a great video.

  • @quocanhhbui8271
    @quocanhhbui8271 5 ปีที่แล้ว

    Pls pls keep up what you are doing with kaggle. You helped me a lot

  • @jithendrayenugula7137
    @jithendrayenugula7137 5 ปีที่แล้ว

    This is really a helpful video,siraj!!! Helped me a lot... Thanks a lot

  • @elisimic4371
    @elisimic4371 5 ปีที่แล้ว

    really high quality video Siraj.

  • @terrydouglasjayasuriya5162
    @terrydouglasjayasuriya5162 ปีที่แล้ว

    Thanks for the "tutorial" Siraj. Great video and has sure stimulated some new ideas on prediction modeling.

  • @shubham2450
    @shubham2450 5 ปีที่แล้ว

    Siraj great video man... really helpful

  • @haoyilu8380
    @haoyilu8380 5 ปีที่แล้ว

    Thank you, Siraj! I have two questions: 1. Does catboost perform better than LGB and XGBoost? 2. Could you please do a video talking about the whole process of feature engineer? I found feature is more important than models but I don’t know how to create fantastic features based on the original data.

  • @k.alipardhan6957
    @k.alipardhan6957 5 ปีที่แล้ว

    I like these Kaggle videos, but it would be nice to see even more advanced videos

  • @aidenstill7179
    @aidenstill7179 5 ปีที่แล้ว +1

    Do a lesson on creating your own deep learning framework. please or tell me where it can be studied

  • @ravisinghpatel4263
    @ravisinghpatel4263 5 ปีที่แล้ว

    can this type of data be classified as segmented data?

  • @shalabhgreatest
    @shalabhgreatest 5 ปีที่แล้ว +1

    Hi Siraj,
    In step 4 you use chunksize=150_000 in read_csv. Can you explain how does 150_000 work?
    Thanks

  • @abilashvr4802
    @abilashvr4802 5 ปีที่แล้ว

    Thanks for the all your Videos Siraj, its really awesome ! Could you please demonstrate how should we approach Multivariate Time Series Prediction?

  • @zamazalotta
    @zamazalotta 5 ปีที่แล้ว +1

    Don't know about earthquakes, but this guy's hand-waving (both literally and allegorically) must have triggered the tornado in Alabama.

  • @AbhishekKumar-mq1tt
    @AbhishekKumar-mq1tt 5 ปีที่แล้ว +1

    Thank u for this awesome video

  • @etienneekpo348
    @etienneekpo348 5 ปีที่แล้ว

    Ahhaha, "Time to Rap" . Wanting on the2019 AI Rap ... : )

  • @perlindholm4129
    @perlindholm4129 5 ปีที่แล้ว

    Idea - Could a cause of a vulcano eruption be that not wanted material got under into the lava? Like CO2 for instance. So a eruption has some very important meaning and is not just a physical effect or random noise event. I assume the lava_model() probably iterates like a sun_model(). So faults in the iteration is not wanted and the lava_model() needs to get rid of them.
    So wonder if you can insert a removable problem in a lava_model() and let the model find it and then move it.
    // Per Lindholm

  • @sampletextsampletext7051
    @sampletextsampletext7051 4 ปีที่แล้ว

    can you explain how [24] works because it gives
    TypeError: string indices must be integers

  • @Pa-ow1nj
    @Pa-ow1nj 5 ปีที่แล้ว

    thank you man great explanation !

  • @coffeebuzz69
    @coffeebuzz69 5 ปีที่แล้ว

    Great video! Could you do a video about genetic algorithms for hyperparameter tuning? maybe compare to other methods like gridsearchcv.

  • @DeependraTube
    @DeependraTube 5 ปีที่แล้ว +1

    Very nice !
    I want to ask if we can tweak this AI /ML Algo. to predict "Crimes Probability at any place" and "People's gathering at any event" ???

    • @nikelias1995
      @nikelias1995 5 ปีที่แล้ว +1

      There is already software which predicts crime locations and the probabilities for crime

  • @jusgaming99
    @jusgaming99 5 ปีที่แล้ว

    i did not understand your command for uploading file

  • @pothny3371
    @pothny3371 5 ปีที่แล้ว +13

    Best takeaway of this video is "from IPython.display import TH-camVideo"

  • @fyaa23
    @fyaa23 5 ปีที่แล้ว

    Nice video! Shouldn't you split the training data again into two sets for parameter optimization?

  • @SUGATORAY
    @SUGATORAY 5 ปีที่แล้ว +2

    Kaggle API would work to fetch the list of competitions one has signed up for. So, every person who is trying to fetch a kaggle competition dataset to Colab, will first need to signup for the competition and then use the api.

    • @SUGATORAY
      @SUGATORAY 5 ปีที่แล้ว +1

      @Siraj: perhaps if you include this somewhere in the video, for someone new to kaggle competitions following your video will be easier. I had not signed up for the competition at first and tried fetching the list of datasets from kaggle (following your live stream) and got stuck there for a while as it kept on throwing error, until I did some digging and finally signed up for the competition. It is a small detail that could save one a lot of time and trouble.
      Thanks for the great videos. :)

  • @invisibleimpostor299
    @invisibleimpostor299 5 ปีที่แล้ว +1

    Yay!
    The older hand intro?????

  • @kaushalmistry4007
    @kaushalmistry4007 5 ปีที่แล้ว +2

    Siraj make some videos on solving Numerai Challenge

  • @kabijoshi5949
    @kabijoshi5949 11 หลายเดือนก่อน

    What a day I found your video. I just didnt get the code to confirm your claims.

  • @_monnie_
    @_monnie_ 5 ปีที่แล้ว

    THANK YOU SO MUCH !!

  • @chournsolidet5406
    @chournsolidet5406 5 ปีที่แล้ว

    I am new to Machine Learning and Deep Learning. I have learned some basic and done some small experiments. Now the school task is to join the challenge of research in Kaggle. I have no idea how to start. Any suggestions? I do appreciate your response. Thanks in advance.

  • @TaddeusBuica
    @TaddeusBuica 5 ปีที่แล้ว

    Hey, i'd like to learn more about stateful LSTMs

  • @AnkitSingh-lb9fj
    @AnkitSingh-lb9fj 5 ปีที่แล้ว

    i am not able to download the dataset following error is shown
    Warning: Your Kaggle API key is readable by other users on this system! To fix this, you can run 'chmod 600 /root/.kaggle/kaggle.json'
    403 - Forbidden

    • @yashmjain95
      @yashmjain95 5 ปีที่แล้ว

      You have to join the kaggle competition first. This is why you are getting the forbidden error

  • @bambangkurniawan7992
    @bambangkurniawan7992 5 ปีที่แล้ว

    Siraj, can you make tutorial about prediction keywords with high volume or will be booming in 1/2 months from google trends data using machine learning? it would be awesome. Thanks

  • @radwaneberredai6622
    @radwaneberredai6622 5 ปีที่แล้ว +1

    HEY SIRAJ WE need kaggle for beginner

  • @munawarhussain7878
    @munawarhussain7878 4 ปีที่แล้ว

    I Learn a lot from you boss, learn a lot.

  • @gauravchaudhari9279
    @gauravchaudhari9279 5 ปีที่แล้ว

    Awesome! :)

  • @brianchaplin278
    @brianchaplin278 2 ปีที่แล้ว

    Great job

  • @AsifAhmed-bf5hg
    @AsifAhmed-bf5hg 5 ปีที่แล้ว +4

    Can you please make video on neural network application on neuroscience

    • @SirajRaval
      @SirajRaval  5 ปีที่แล้ว

      search siraj math of neuroscience

  • @aquibk
    @aquibk 5 ปีที่แล้ว

    Can you please make videos on probabilistic graphical models from very basics.

  • @nitin-goyal
    @nitin-goyal 5 ปีที่แล้ว

    Next, VSB Power Line Fault Detection?

  • @thankyouthankyou1172
    @thankyouthankyou1172 5 ปีที่แล้ว

    9:38 so, the data was downloaded to the 'cloud'? and we cannot see the data locally? how to check the data in the cloud? where is the data exactly?

  • @ali985411
    @ali985411 5 ปีที่แล้ว

    My Visualization graphs are different from yours while I'm implementing your code with same data, I'm a complete beginner (rather just started learning data science). Please help me in this regard.

  • @yimingzhao3081
    @yimingzhao3081 5 ปีที่แล้ว

    i just dont usually see you use deep learning/tensorflow to solve problems like this, why? is that not fit to solve kaggle cases?

    • @Manu321024
      @Manu321024 5 ปีที่แล้ว

      not enough data

  • @vamsikalyan8013
    @vamsikalyan8013 5 ปีที่แล้ว +1

    please do vedios on kaggle for bigginers,
    thank your

  • @qusha588
    @qusha588 5 ปีที่แล้ว

    Is there any chanels like Siraj's?

  • @deltatv9335
    @deltatv9335 5 ปีที่แล้ว

    Thanks buddy🕺

  • @vedantkulkarni1711
    @vedantkulkarni1711 5 ปีที่แล้ว

    I'm unable to import the LANL-Earthquake-Prediction dataset by following the code from the video. I get the error "403 - Forbidden". How to fix this?

    • @crjuncher1
      @crjuncher1 5 ปีที่แล้ว

      have same issue

    • @antoniamendo
      @antoniamendo 5 ปีที่แล้ว

      Did you enroll in the competition?
      Also there is a typo error, in the video he wrote ~/kaggle/kaggle.json, but file has to be placed in ~/.kaggle/kaggle.json.
      The github code is cleaner:
      github.com/llSourcell/Kaggle_Earthquake_challenge/blob/master/Earthquake_Challenge.ipynb

  • @user-ur5kp2zd9l
    @user-ur5kp2zd9l 5 ปีที่แล้ว

    Hi!~It's a great video. Help me a lot. I try to do the same step to learn,
    but i stuck in step 4. I try to debug but it's still error about " module 'numpy' has no attribute 'quantile' ".
    I already try install higher numpy. As numpy==1.15.1 and upgrade.
    But error still the same. I see you video has no this problem. why it happen?

    • @stickyas1
      @stickyas1 5 ปีที่แล้ว

      !pip install kaggle
      !pip install numpy==1.16.4
      !pip install catboost

  • @GauravGuptaa
    @GauravGuptaa 5 ปีที่แล้ว

    @Siraj help....
    When trying to download earthquake data, it shows forbidden 403 error in Google colab

    • @randykat
      @randykat 5 ปีที่แล้ว

      You need to join the competition on the kaggle website to have access to download the data. On the kaggle website do a search for LANL Earthquake Prediction then click the join the competition button.

    • @GauravGuptaa
      @GauravGuptaa 5 ปีที่แล้ว

      @@randykat thanks but i already did enrolled.
      anyways i figured it out :D
      i was running out of storage space :P

  • @asharkhan6714
    @asharkhan6714 5 ปีที่แล้ว

    Another Great Video!

  • @earthnote
    @earthnote 3 ปีที่แล้ว

    can you make one for real earthquakes? just for fun, maybe use IRIS earthquake database

  • @NetViewer-hw2tu
    @NetViewer-hw2tu 4 ปีที่แล้ว

    Understanding those faults movements will get to nothing because there is always new ones appearing...understanding the cause's of those movements will be something...😊😊

  • @priyankachauhan8503
    @priyankachauhan8503 5 ปีที่แล้ว

    Make a video on twitter sentiment analysis in python with dataset

  • @tylerreichardt3270
    @tylerreichardt3270 4 ปีที่แล้ว

    Can someone explaine why this error is occuring:
    Warning: Looks like you're using an outdated API Version, please consider updating (server 1.5.6 / client 1.5.4)
    403 - Forbidden
    TIA

    • @anitakhandelwal553
      @anitakhandelwal553 4 ปีที่แล้ว

      This might have happened when you tried to use the kaggle API for downloading the data. You have to first accept the rules of the competition (on Kaggle) in order to use the API .

  • @bhuvaneshs.k638
    @bhuvaneshs.k638 5 ปีที่แล้ว

    Finally..!

  • @deepaknenyana
    @deepaknenyana 5 ปีที่แล้ว

    thanks siraj

  • @EricHathaway
    @EricHathaway 5 ปีที่แล้ว

    awesome

  • @phillipmarnell9966
    @phillipmarnell9966 4 ปีที่แล้ว

    You make this so clear! But... using Colab, I'm getting this outdated API message, and a '403 Forbidden' message Any ideas on fix/workaround?
    Saving kaggle.json to kaggle.json
    User uploaded file "kaggle.json" with length 68 bytes
    Warning: Looks like you're using an outdated API Version, please consider updating (server 1.5.6 / client 1.5.4)

  • @quarkquark7882
    @quarkquark7882 ปีที่แล้ว

    Zarah effect random is the best earthquake forecast in world it can predicted minutea before it happen in epicentrum.i challenge kaggle.

  • @HarshitDaftary
    @HarshitDaftary 5 ปีที่แล้ว

    First time you made a proper video. This really helps to learn.

  • @nayeemsunny2573
    @nayeemsunny2573 5 ปีที่แล้ว

    bro while i'm trying to download datset it's giving me "403 forbidden" help me bro

    • @randykat
      @randykat 5 ปีที่แล้ว +1

      You need to join the competition on the kaggle website to have access to download the data. On the kaggle website do a search for LANL Earthquake Prediction then click the join the competition button.

    • @nayeemsunny2573
      @nayeemsunny2573 5 ปีที่แล้ว

      @@randykat thanks

  • @martincastellano89
    @martincastellano89 5 ปีที่แล้ว

    hi there is a gpt2 video??? (by siraj)

    • @aiwebbiz8532
      @aiwebbiz8532 5 ปีที่แล้ว

      blog.openai.com/better-language-models/

  • @vidusha123
    @vidusha123 5 ปีที่แล้ว +1

    I’ve never seen him so uncomfortable lol.
    Great video nevertheless I learnt a lot. Thank you so much sir.

  • @user-ym5kh5pq2x
    @user-ym5kh5pq2x 2 ปีที่แล้ว

    Earthquake prediction is possible .. time and magnitude .. with 1% error .. Prediction Formula is generated .. and passed testing and verification phase ..

  • @proturguenev1820
    @proturguenev1820 5 ปีที่แล้ว +1

    estamoh en primer lugar somos el mejor pais de chile

  • @samuelsama8780
    @samuelsama8780 5 ปีที่แล้ว

    WHAT IS A PERSON IN DATA SCIENCE CALLED A QUANT

  • @annkauffman8026
    @annkauffman8026 4 ปีที่แล้ว

    IOError: Could not find kaggle.json. Make sure it's located in /root/.kaggle. Or use the environment method 😢
    neither when I typed by myself nor copied uploading kaggle.json and next steps from github :(
    btw Siraj has same error mv: target '/root/kaggle/kaggle.json' is not a directory however his code is worked.

    • @annkauffman8026
      @annkauffman8026 4 ปีที่แล้ว

      yikes, I missed second '&&' before chmod >_

  • @junkbingo4482
    @junkbingo4482 2 ปีที่แล้ว

    well it's like financial time series; when i was young i learned things, and i seriously doubt one can find out smtg useful.....

  • @user-vp5io1so3i
    @user-vp5io1so3i 4 ปีที่แล้ว

    Hi Siraj, I doubt the pseudocode you shown in 22:13 was AdaBoost, rather than gradient boosting. Or if I misunderstood the concept?
    Check this link out www.quora.com/What-is-the-difference-between-gradient-boosting-and-adaboost
    Nice video by the way. Helps me a lot.

  • @dhruvrnaik
    @dhruvrnaik 5 ปีที่แล้ว

    ♥️♥️🔥

  • @NarendraKumar-qt8sj
    @NarendraKumar-qt8sj 5 ปีที่แล้ว

    !kaggle competitions download -c LANL-Earthquake-Prediction
    I am getting 403 - Forbidden error while i am trying to download the dataset from kaggle

    • @SUGATORAY
      @SUGATORAY 5 ปีที่แล้ว

      Narendra: Go to Kaggle and sign up for the competition from the competition’s page. This needs to be done manually first, before kaggle let’s you access the competition’s data through the api. Good luck. 👍

    • @NarendraKumar-qt8sj
      @NarendraKumar-qt8sj 5 ปีที่แล้ว

      @@SUGATORAY Thank u ,it helped

    • @wasgeht2409
      @wasgeht2409 5 ปีที่แล้ว

      @@SUGATORAY I did it but it didn't work , get the same error

    • @SUGATORAY
      @SUGATORAY 5 ปีที่แล้ว

      @@wasgeht2409 Follow the instructions as in this Github gist: gist.github.com/sugatoray/9244867c89fd3868ec8e5a5ba7341901#file-kaggle_import_dataset_public-ipynb

  • @AnkitSingh-lb9fj
    @AnkitSingh-lb9fj 5 ปีที่แล้ว

    just want to know whether is it legal to discuss an ongoing challenge??

  • @user-ym5kh5pq2x
    @user-ym5kh5pq2x 2 ปีที่แล้ว

    Please provide your data .. and I will predict the next coming earthquake .. time-wise and magnitude ..

  • @werthersoriginal
    @werthersoriginal 5 ปีที่แล้ว +6

    Just slap a GPS collar on to my ex wife. The closer she gets to your location, an earthquake is a comin. Bam, project done! No ML needed.

  • @adityabachhawat8727
    @adityabachhawat8727 5 ปีที่แล้ว

    why making a video twice?

  • @mrme8521
    @mrme8521 ปีที่แล้ว

    watching this makes realize how much of a rookie I am...

  • @antoniamendo
    @antoniamendo 5 ปีที่แล้ว

    Viva Chile weon

  • @pranjalpathak4498
    @pranjalpathak4498 5 ปีที่แล้ว

    He should have told that he is actually using google colab and not jupyter

    • @stickyas1
      @stickyas1 5 ปีที่แล้ว

      I was thinking he used Kaggle kernel. but he is taking about google colab around 5:10.

  • @nayeemsunny2573
    @nayeemsunny2573 5 ปีที่แล้ว

    hey bro i'm the first viewer woho!!!!!!!!!!

  • @xrystalskyes2838
    @xrystalskyes2838 5 ปีที่แล้ว

    Lol we live on a flat earth right

  • @EhsanIrshad
    @EhsanIrshad 5 ปีที่แล้ว

    first

  • @rickyzhang
    @rickyzhang 5 ปีที่แล้ว

    No offense. I don’t see the value that you added descriptive statistics in one single value to all rows as features. You must do a prank or make yourself like a fool.

    • @antoniamendo
      @antoniamendo 5 ปีที่แล้ว

      does the magic. He iterates over chunks, not single rows. I was wondering the same.

  • @arashkhajooei5944
    @arashkhajooei5944 4 ปีที่แล้ว +4

    Useless information that explained everything unclearly ! it shows that you just memorized it and didn't understand it from the depth. I'm just wasted my time watching this.

  • @MichaelMantion
    @MichaelMantion 5 ปีที่แล้ว

    how does a womens vajaja muscle predict earthquakes?