Train a Small Language Model for Disease Symptoms | Step-by-Step Tutorial

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 ธ.ค. 2024

ความคิดเห็น • 46

  • @Hellow_._
    @Hellow_._ 11 หลายเดือนก่อน +4

    subscribed just after watching one video. Thanks to creator. plz keep posting videos related to GenAI.

  • @zeta_meow_meow
    @zeta_meow_meow ปีที่แล้ว +2

    saving, will watch later when i am good in ml and python

  • @chuanjiang6931
    @chuanjiang6931 4 หลายเดือนก่อน +2

    outputs = model(input_ids=inputs, labels=targets), is there a reason you used the same tensor for both inputs and targets. Should targets be moved 1 place after inputs?

  • @ruksharalam173
    @ruksharalam173 9 หลายเดือนก่อน +1

    what extensions are you using for code autocomplete in colab?

  • @Rakesh-ed4nn
    @Rakesh-ed4nn 2 วันที่ผ่านมา

    can you provide the inferance notebook for using created model for inferance

  • @namashaggarwal7430
    @namashaggarwal7430 ปีที่แล้ว +4

    Awesome tutorial ❤
    Can you please make a tutorial on how to fine tune a model, especially on textual and image data?

  • @JohnFerrier
    @JohnFerrier 8 วันที่ผ่านมา

    For anyone trying to skip the noise, 6:14

  • @aneesarom
    @aneesarom 4 หลายเดือนก่อน +1

    why inputs and targets are same?

  • @user4-j1w
    @user4-j1w ปีที่แล้ว +1

    Oh wow .... Thank you for this tutorial ❤

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +2

      Glad you like it!

    • @sriharsha5701
      @sriharsha5701 ปีที่แล้ว

      @@AIAnytime i am having 35 columns if i ned to get all information based on ID how i would train please explain

  • @BatBallBites
    @BatBallBites 2 หลายเดือนก่อน

    I don't think so its transformer Architecture, which Architecture you used ?

  • @ruksharalam173
    @ruksharalam173 9 หลายเดือนก่อน

    What are the differences between encode and encode_plus?

  • @aliameen8568
    @aliameen8568 27 วันที่ผ่านมา

    how can i load my own dataset ?

  • @alroygama6166
    @alroygama6166 11 หลายเดือนก่อน

    I have a classificationstask where i have a para of text and it classifies labels of the para.
    Can i use this similar approach to tune it?
    Also will llama be a better choice?

  • @mingyukang6592
    @mingyukang6592 ปีที่แล้ว +1

    Sorry, I have a beginner question.
    1. What is the difference between using data in a data frame and using it by creating a model and loading it like now?
    2. Is the 'SmallMedLM.pt'' model created in the example simply a vector database of "QuyenAnhDE/Diseases_Symptoms" data?
    3. Or do you mean that ‘SmallMedLM.pt’ is a GPT2-based LLM fine-tuned with symptoms matching the disease?
    4. So, does this mean that the trained 'SmallMedLM.pt' can be used when creating a chatbot related to a specific disease or symptom?

  • @sriharsha5701
    @sriharsha5701 ปีที่แล้ว

    i am having 35 colum,s if i ned to get a;; information based on ID how i would train please explain

  • @subashpaudelhimself964
    @subashpaudelhimself964 7 หลายเดือนก่อน

    provide a code to push this model to hugging face too sir.

  • @Mr.Tom.handle
    @Mr.Tom.handle 6 หลายเดือนก่อน

    Need a video or instructions to integrating the model into mobile application

  • @VaibhavPatil-rx7pc
    @VaibhavPatil-rx7pc ปีที่แล้ว

    Excellent information 🎉

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว

      Glad it was helpful!

  • @adityashinde436
    @adityashinde436 ปีที่แล้ว

    my usecase is that the input will be some product keyword and output should be product category it belongs to (example - input: white shirt, output: apparel). for this purpose which model will be suitable, distilgpt2 is good or do you recommend some other models in text generation section or do you recommend me to check model from other section like roberta or distilbert?

  • @sohailhosseini2266
    @sohailhosseini2266 ปีที่แล้ว +1

    Thanks for sharing!

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว

      Thanks for watching!

  • @subramanianchenniappan4059
    @subramanianchenniappan4059 ปีที่แล้ว

    Please tell if you have any training on how to train and deploy a model given a dataset . I am an experienced developer want to learn machine learning and AI and related stuff😊😊

  • @jorgerios4091
    @jorgerios4091 10 หลายเดือนก่อน

    Buddies, it runs on colab but when I try it in my Windows VS code environment I get the following error: RuntimeError: PyTorch is not linked with support for mps devices. Anybody else got the same and how did you fix it?

    • @jorgerios4091
      @jorgerios4091 10 หลายเดือนก่อน

      Got it now, Gemini told me mps is just for Apple 🤡

  • @dragneel4451
    @dragneel4451 ปีที่แล้ว

    Can I train this model on movies dataset with columns Name, Synopsis, Genres and ask the model to recommendation similar kinds of movies?

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +1

      Absolutely. You must do it. Try T5 model instead distilGPT

  • @jorgerios4091
    @jorgerios4091 ปีที่แล้ว

    The steps presented for training the model are incredibly valuable, thanks for your guidance. Would there be any notable differences in the results if we were to utilize a LaMini model instead of GPT2?

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +3

      In my experiments, T5 was better .... I am working on that to create a video shortly for a similar use case.

    • @ARkhan-xw8ud
      @ARkhan-xw8ud ปีที่แล้ว

      is there any metrics to check the performance of the LLMs
      @@AIAnytime

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +2

      Yes look at my Evaluation of LLMs and RAGs video. It's detailed.

  • @aifarmerokay
    @aifarmerokay ปีที่แล้ว +1

    Convert it to ggml format so we can use it in cpu

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +1

      Won't recommend to convert this into GGUF as the model is already very small (around 380MB). Compressing it more will further degrade the performance. But if the data quality is better and other LMs like t5 and lamini are used then we can try GGUF. The current one smoothly runs on CPU tho.

    • @nothing_is_real_0000
      @nothing_is_real_0000 ปีที่แล้ว

      @@AIAnytime Agree. Any model within 1 GB is very much within "small" model range

  • @Sergatx
    @Sergatx ปีที่แล้ว

    Why didnt you do this with GPT 3.5 instead ?

    • @pjoshi_15
      @pjoshi_15 ปีที่แล้ว

      If you don't want to expose your data to open AI

    • @AIAnytime
      @AIAnytime  ปีที่แล้ว +4

      1. Gpt 3.5 in a closed source. You need to pay. 2. Data protection and privacy. 3. Inference token costs.

    • @asiddiqi123
      @asiddiqi123 10 หลายเดือนก่อน

      ​@@AIAnytime
      So you don't pay Hugging face?

  • @ABHINAYYADAV-j2h
    @ABHINAYYADAV-j2h ปีที่แล้ว

    WOW IT IS JUST A AWESOME I HAVE 36000 DOC AND WAS USING LAMINI ' YOUR VEDIO WITHOUT INTERNET BUT IT IS HOW CAN I BUILD MODEL USING THAT DOCS SO FETCHING TIME WILL BE LOW

  • @siriyakcr
    @siriyakcr 9 หลายเดือนก่อน

    Much needed

  • @nunoalexandre6408
    @nunoalexandre6408 ปีที่แล้ว

    Love it!!!!!!!!!!!!!!!!!!!!!!!

  • @sumitawasthi7068
    @sumitawasthi7068 ปีที่แล้ว

    Why did Manchester United bring a map to the game against West Ham?

  • @CooperDaniel87
    @CooperDaniel87 ปีที่แล้ว +1

    MAN U 💔