How to Deploy ML Solutions with FastAPI, Docker, & AWS

แชร์
ฝัง
  • เผยแพร่เมื่อ 29 ม.ค. 2025

ความคิดเห็น • 57

  • @ShawhinTalebi
    @ShawhinTalebi  8 หลายเดือนก่อน +2

    More on Full Stack Data Science👇
    👉Series Playlist: th-cam.com/play/PLz-ep5RbHosWmAt-AMK0MBgh3GeSvbCmL.html
    💻Example Code: github.com/ShawhinT/TH-cam-Blog/tree/main/full-stack-data-science/ml-engineering

  • @FREAK-st6kk
    @FREAK-st6kk 5 หลายเดือนก่อน +5

    One of the best aspects of AWS Elastic Cloud is how seamlessly everything comes together, whether you're using FastAPI or Docker. It's all integrated beautifully.

  • @divyanshtripathi4867
    @divyanshtripathi4867 7 หลายเดือนก่อน +4

    This is such a great video, no nonsense straight to the point!

  • @simonmafanye.3537
    @simonmafanye.3537 21 วันที่ผ่านมา

    Your detail explanations gained my subscription.
    Thanks a lot for sharing.

  • @waleedashraf9t
    @waleedashraf9t 2 หลายเดือนก่อน +1

    One of the best video available. on the internet for deployment.

    • @ShawhinTalebi
      @ShawhinTalebi  2 หลายเดือนก่อน +1

      Glad it was helpful :)

  • @MuhammadTaha-p8z7h
    @MuhammadTaha-p8z7h หลายเดือนก่อน

    you rock it men...The thing I was searching for days...

  • @pawe5560
    @pawe5560 8 หลายเดือนก่อน +7

    Hey Shawn, videos on FastAPI and Docker from you would be great.

    • @ShawhinTalebi
      @ShawhinTalebi  8 หลายเดือนก่อน

      Thanks for the suggestion! I'll add it to my queue :)

  • @miraclechijioke1213
    @miraclechijioke1213 4 หลายเดือนก่อน

    This was so simplified. thank you Shawhin

  • @Coret-with-c
    @Coret-with-c 4 หลายเดือนก่อน +1

    Super detailed explanation, thank you!

  • @tusharrohilla7154
    @tusharrohilla7154 2 หลายเดือนก่อน

    It was amazing ...love this type of content❤

  • @nocomments_s
    @nocomments_s 4 หลายเดือนก่อน

    I like your video, it deserves much more views

  • @jonathankerr2479
    @jonathankerr2479 2 หลายเดือนก่อน

    Fantastic video. Thank you for this!

  • @rahulkrishna3043
    @rahulkrishna3043 2 หลายเดือนก่อน

    great video ! very informative . Thanks a lot :)

  • @brianmorin5547
    @brianmorin5547 8 หลายเดือนก่อน

    This is fantastic stuff as I’m pulling out my hair on this same step.
    You have the right idea for the next video, but I think the next one after that is making the chat interface publicly accessible

    • @ShawhinTalebi
      @ShawhinTalebi  8 หลายเดือนก่อน

      Great suggestion Brian! There are several ways one can do this. The simplest and cheapest would be hosting it via HuggingFace Spaces: huggingface.co/spaces/launch
      However, for this specific use case the most practical option would be to embed it into my Squarespace website. I'll need to do some more digging to see the best way to do that.

    • @brianmorin5547
      @brianmorin5547 8 หลายเดือนก่อน

      @@ShawhinTalebi The streamlit cloud has been my go-to so far. I am toying with creating a react front end template but would like to see what others are doing

  • @jeeldoshi195
    @jeeldoshi195 หลายเดือนก่อน

    Amazing video...really helped alot.

  • @pedro.henrique-k5z
    @pedro.henrique-k5z 2 หลายเดือนก่อน

    Hi! Great video! Thanks for sharing with us! Do you have any recommendation on what to study next to learn more about these topics or to go deeper in it?

    • @ShawhinTalebi
      @ShawhinTalebi  2 หลายเดือนก่อน

      Happy to give recs. What's your end goal?

  • @SachinJadhav-p8w
    @SachinJadhav-p8w หลายเดือนก่อน

    I want to know why you didnt consider importing the sentence transfromer using model.pretrained() instead. Was it causing issues in launching the server. I am trying to do a similar thing with a RAG model and it is giving me issues such that the server keeps on loading.

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน +1

      You can do this too! The one hiccup is the first inference will take while since the model will need to download from the internet.

    • @SachinJadhav-p8w
      @SachinJadhav-p8w หลายเดือนก่อน

      @@ShawhinTalebi Thanks for replying back. Appreciate it!

  • @emmanuell8260
    @emmanuell8260 19 วันที่ผ่านมา

    Is it possible to predict by using API with POST ?

    • @ShawhinTalebi
      @ShawhinTalebi  17 วันที่ผ่านมา

      My understanding of a POST request is it will pass input data to a backend data store. I don't think that helps much in generating a prediction from a model. For that, GET makes more sense.

  • @7_Maverick_7
    @7_Maverick_7 หลายเดือนก่อน

    Excellent video!! Hey just one small question, the API and the model combinedly are dockerized before hosting on AWS right ?

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน

      Yes! The model lives in the Docker container,

  • @dhirajkumarsahu999
    @dhirajkumarsahu999 8 หลายเดือนก่อน +1

    Thank you so much, such videos are really very helpful

    • @ShawhinTalebi
      @ShawhinTalebi  8 หลายเดือนก่อน +1

      Glad to hear :)

  • @Smrigankiitk
    @Smrigankiitk 4 หลายเดือนก่อน

    how can we integrate streamlit to make the UI to get input and send to model and display the output here ?

    • @ShawhinTalebi
      @ShawhinTalebi  4 หลายเดือนก่อน

      Good question. This should be similar to the Gradio example shown 25:54.
      This blog post might be helpful: blog.streamlit.io/create-a-search-engine-with-streamlit-and-google-sheets/

  • @karmax7199
    @karmax7199 หลายเดือนก่อน

    Hi Shaw!
    Tysm for this tutorial, I'm learning a lot of cool stuff thanks to you
    I have a question : can you access directly your web interface (made from gradio) directly in a google tab and notwith a jupyter notebook ?
    I'm thinking if you have your security group, it should work if the saas is deployed on AWS, without having to do much more stuff. Maybe i'm mssing something

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน +1

      Glad it was helpful! I don't know what a Google tab is, but you can definitely deploy the UI as an app. Basically you'd create another container (on AWS of the like) to run the gradio (i.e. uvicorn) app and have it talk to the search API container.

  • @angelandresbejarmerma9994
    @angelandresbejarmerma9994 หลายเดือนก่อน

    This is such a great video, Is it normal for the image to be 6.91gb in size? Why is yours so much smaller?

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน +1

      That might be happening because your are creating a multi-platform image?

    • @angelandresbejarmerma9994
      @angelandresbejarmerma9994 หลายเดือนก่อน

      @@ShawhinTalebi I followed all the steps in the video; it might be because I use debian gnu/Linux .

  • @RatherBeCancelledThanHandled
    @RatherBeCancelledThanHandled 4 หลายเดือนก่อน +1

    well done , Thanks !

  • @fatimayousaf1644
    @fatimayousaf1644 4 หลายเดือนก่อน +1

    Hey Shaw!
    what a beautiful content. I followed all the steps from ML APP from scratch till deployment but at last moment, I don;t have AWS free tier account as they still ask to enter the debit card details for having free access, so can you please tell me another way round to cope up with this ?

    • @ShawhinTalebi
      @ShawhinTalebi  3 หลายเดือนก่อน

      Glad you liked it! You could try deploying to railway (railway.app/). I just used them for a project and don't think I needed to input credit card info.

  • @ilyesbouzidi7837
    @ilyesbouzidi7837 2 หลายเดือนก่อน

    i have been following this video, and everything went perfectly fine, until the container deployment on AWS (i think because I am working with an LLM and therefore i need GPU)
    what should i do in this situation?

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน

      Self-hosting LLMs can get tricky. I'd recommend using an API like OpenAI, TogetherAI, or the like if possible.

    • @ilyesbouzidi7837
      @ilyesbouzidi7837 หลายเดือนก่อน

      @@ShawhinTalebi the problem is that I have my own fine-tuned LLM on huggingface, i cannot use another one

  • @ax5344
    @ax5344 3 หลายเดือนก่อน

    where does K8S fit in in this pipeline?

    • @ShawhinTalebi
      @ShawhinTalebi  3 หลายเดือนก่อน

      Good question. In my experience, Kubernetes is rarely used in DS/ML, so I wouldn't worry learning it if your just getting started.

  • @paulohss2
    @paulohss2 4 หลายเดือนก่อน

    Great tutorial... Im still not able to connect to the API unfortunately (This site can’t be reached,refused to connect.) even though I followed the network config steps you explained...

    • @ShawhinTalebi
      @ShawhinTalebi  3 หลายเดือนก่อน

      To confirm, you added inbound rules in the VPC dashboard to allow all incoming traffic from your IP?
      Does the IP listed in the inbound rules match yours?

  • @angieyoon9900
    @angieyoon9900 7 หลายเดือนก่อน

    Thank you!

  • @MuhammadTaha-p8z7h
    @MuhammadTaha-p8z7h หลายเดือนก่อน

    can you add this for RAG chatbot through langchain plzzz

    • @ShawhinTalebi
      @ShawhinTalebi  หลายเดือนก่อน

      That's a cool idea! I've been thinking of a few product idea which may align with this. Hopefully I can cover this in 2025 :)

  • @thonnatigopi
    @thonnatigopi 7 หลายเดือนก่อน

    Bro create a video for handling post and get request and multiple endpoints using fast api dockerize and ECR and aws lambda functions

    • @ShawhinTalebi
      @ShawhinTalebi  7 หลายเดือนก่อน

      Great suggestion. I'll add that to my list!

  • @abbasrabbani7665
    @abbasrabbani7665 8 หลายเดือนก่อน

    that was an awesome video, I have a task for one click ml model deployment on aws, azure and GCP, like one click on aws and other click on azure. CAn u please guide me shortly the roadmap...!

    • @ShawhinTalebi
      @ShawhinTalebi  8 หลายเดือนก่อน

      Thanks for your comment. Sorry I'm not sure what you mean by "one click ml model deployment". Could you share more details?

  • @parthkaria1023
    @parthkaria1023 หลายเดือนก่อน

    GOATED

  • @vivekasthana12345
    @vivekasthana12345 4 หลายเดือนก่อน

    Thank you Shaw for making so many amazing videos. quick question from this video.. where exactly are you making a connection between your dockerhub and AWS ECS? Is it where you mention the url of the image? what if someone has a similar image name (shawhint/yt-search-demo) or that's not possible? Sorry if its a dumb question 😐

    • @ShawhinTalebi
      @ShawhinTalebi  4 หลายเดือนก่อน

      Good question! Yes, exactly. No one will have a similar image name because the first part will be your unique DockerHub username.