LoRA - Explained!

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 ม.ค. 2025

ความคิดเห็น • 21

  • @Mohamed_Shokry
    @Mohamed_Shokry 3 หลายเดือนก่อน +5

    Your explanations are easy to understand and in-depth at the same time. Thank you for making my life easier.

  • @JorgeZentrik
    @JorgeZentrik หลายเดือนก่อน

    I don't understand why you don't have much more views and engagement. Your videos are some of the best explanations out there. I've sent my students to your channel multiple times.
    Not a great timeline where virality reins over veracity. Amazing work.

    • @CodeEmporium
      @CodeEmporium  หลายเดือนก่อน

      Thanks! This means a lot. I am just glad the channel is able to provide value. So thanks for sharing this around

  • @harshsharma5768
    @harshsharma5768 3 หลายเดือนก่อน +3

    Awesome explaination! I have few questions though:
    1) At 24:00, you said we can do some matrix multiplication and addition to update the value of Wq so that the fine tuned information gets kinda infused in Wq which inturn allowed us to have faster inference time, but won't that hurt the performance in comparision to the case where we don't update Wq and keep A and B? Are we just trading performance for inference speed?
    2) what if we do the same 'update Wq' part with additive adapters? That will also speed up their inference time?

  • @KhushPatel-x2n
    @KhushPatel-x2n 2 หลายเดือนก่อน +3

    In finetuning of LLM we have 2 options.
    1) change the parameter of actual Base model. But this require High resource and time.
    2) Add new layers and change the architecture of the model. In finetuning only change the weight of this additional layer and Base model remain frozen. In inferencing we use both Base model and this additional layer.
    LoRA helps us in reducing this additional layer by using Low Rank Matrices.
    This is my knowledge. I want to please react on it So I can Verify my knowledge!😊

    • @CodeEmporium
      @CodeEmporium  2 หลายเดือนก่อน +1

      This is a good overview 👍

  • @shisoy4809
    @shisoy4809 3 หลายเดือนก่อน

    I like simple methods yet extremely effective

  • @sam198689
    @sam198689 หลายเดือนก่อน +1

    I understand how LoRA speeds up the fine-tuning, but you mentioned in the video that it also speeds up the inference. Could you please explain how is that possible?

  • @Ishaheennabi
    @Ishaheennabi 3 หลายเดือนก่อน

    Back again ❤❤❤

  • @canygard
    @canygard 3 หลายเดือนก่อน

    Custom GPTs or Gemini Gems are pretty spot on after you get good at making them. I would play around with these before building an AI agent with LangChain and vector embeddings.

  • @minasefikadu
    @minasefikadu 2 หลายเดือนก่อน

    I enjoyed this video. Can you do QLoRA next?

  • @pauljones9150
    @pauljones9150 3 หลายเดือนก่อน

    When did you explain benefits of loras over adapters?
    I seem to have missed it

  • @isaiahcastillo898
    @isaiahcastillo898 3 หลายเดือนก่อน

    Appreciate it!

  • @programming-short-videos
    @programming-short-videos 3 หลายเดือนก่อน

    Amazing, thank you. Can u do one for latent diffusion

  • @pauljones9150
    @pauljones9150 3 หลายเดือนก่อน

    Cursor with claude 3.5 or o1 mini is great. Use their shortcuts to save time. Still struggles with new languages and frameworks though

  • @pauljones9150
    @pauljones9150 3 หลายเดือนก่อน +1

    The quizzes aren't well connected to the content. Heck if you could add a timestamp after each quiz of "if you got this wrong, check out this timestamp" that would be helpful

  • @isaiahcastillo898
    @isaiahcastillo898 3 หลายเดือนก่อน +2

    LoRAs are the biggest thing to come out of AI since the transformer