"okay, but I want Llama 3 for my specific use case" - Here's how

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 พ.ค. 2024
  • If you want a personalized AI strategy to future-proof yourself and your business, join my community: www.skool.com/new-society
    Follow me on Twitter - x.com/DavidOndrej1
    Please Subscribe.
    Major credit to @engineerprompt who beautifully explained the entire Google Colab.
    Title heavily inspired by: @AIJasonZ
    My Google Colab: colab.research.google.com/dri...
    Unsloth GitHub: github.com/unslothai/unsloth?...
    Dataset: huggingface.co/datasets/yahma...

ความคิดเห็น • 101

  • @DavidOndrej
    @DavidOndrej  25 วันที่ผ่านมา +5

    If you want a personalized AI strategy to future-proof yourself and your business, join my community: www.skool.com/new-society

    • @ThisIsSimonBorec
      @ThisIsSimonBorec 25 วันที่ผ่านมา

      I highly recommend it, the community is fabulous!

  • @vishalsaichindepalli2798
    @vishalsaichindepalli2798 25 วันที่ผ่านมา +111

    It would be great if you could make a video on how to create datasets for fine tuning using LLM's/Agents!

    • @d.d.z.
      @d.d.z. 2 วันที่ผ่านมา +1

      It can be so helpful!

  • @laimi7
    @laimi7 22 วันที่ผ่านมา +2

    Thank you for this video. The topic of fine-tuning was very interesting to me.

  • @thiago.nobrega
    @thiago.nobrega 25 วันที่ผ่านมา

    Keep up the amazing work bro.
    You provide us valuable knowledge.

  • @christopheboucher127
    @christopheboucher127 25 วันที่ผ่านมา +28

    yes dataset made by agents ! Thx for all your content !

    • @Matthew-tg4uk
      @Matthew-tg4uk 5 วันที่ผ่านมา

      vicious circle.
      give llm a little data and say simulate.
      llm uses trained data to simulate.
      user takes simulated data and does the same in another llm.
      very little new data has been added to the system.

  • @AlphaCrucis
    @AlphaCrucis 25 วันที่ผ่านมา +8

    This is the kind of content that I've been wanting to see that I haven't been able to find in an easily digestible form.

  • @flavorbot
    @flavorbot 24 วันที่ผ่านมา

    love the videos thanks a lot for taking the time to put them out

  • @NB-qq8wo
    @NB-qq8wo 25 วันที่ผ่านมา

    LOVE these empowering videos, thanks for sharing 🙏

  • @alvinjamur1
    @alvinjamur1 25 วันที่ผ่านมา +3

    why are some here trashing david ondrej? he is imparting knowledge in an easy to understand way for peeps that do not know. i wrote my first neural net from scratch in 1993 and i have been an ML practitioner since then. i can tell u that info back then was hard to come by. be grateful that u have easy access to it. if u dont like it better to move along rather than disperse caustic.

  • @sethjchandler
    @sethjchandler 25 วันที่ผ่านมา +8

    I have seen a lot of videos on fine-tuning and read a lot, and I have to say this is one of the most lucid, explanations. By making it very concrete and showing the code and, importantly, the training data you make very clear what is going on in fine tuning great job!

    • @gonzalodijoux5953
      @gonzalodijoux5953 8 วันที่ผ่านมา

      hello, do you know if it's possible to fine tune with ebook pdf on a specific domain (financial, medical...) ?

  • @dennisking8281
    @dennisking8281 22 วันที่ผ่านมา +3

    Yes, please make a video on how to create the datasets for fine tuning AI - and Thanks for all you do.

  • @jonathanholmes9219
    @jonathanholmes9219 20 วันที่ผ่านมา +5

    Yes please. Team of Agents to create a fine tuning data set from your proprietary data.

  • @Chris-zc9bp
    @Chris-zc9bp 25 วันที่ผ่านมา

    TY I finally trained my first model. Here's another vote for the how to create the fine tuning using LLM agents.

  • @DrKnowitallKnows
    @DrKnowitallKnows 25 วันที่ผ่านมา +6

    Hey I could be wrong but I believe you'll have better luck fine tuning with a less quantized version of the model. At least 8 or 16bit would be preferable to 4. I'm not an expert on quantized models, but you lose a lot of resolution when you quantize that much and that likely makes it more challenging for the LoRA to train. Definitely correct me if I'm wrong, folks, but I think this is the case.

    • @AlphaCrucis
      @AlphaCrucis 25 วันที่ผ่านมา

      Nice to see you here!

  • @VaibhavPatil-rx7pc
    @VaibhavPatil-rx7pc 25 วันที่ผ่านมา

    Excellent information ❤

  • @kamipls6790
    @kamipls6790 25 วันที่ผ่านมา +4

    Hey Ondrej!
    I think this might be a stretch of the topic, but is it possible to use an llm like llama 3 and fine tune it to respond in another language or would it be necessary to train an llm from scratch for this?

    • @icesteel5855
      @icesteel5855 25 วันที่ผ่านมา

      As I am , i need to know this

  • @jayhu6075
    @jayhu6075 25 วันที่ผ่านมา

    This is a very useful topic, in the future we can train our datasets to specifically use them for different applications, particularly in healthcare or other institutions, benefiting people.
    Hopefully, a next topic will be about how to create your own datasets. Thanks for the explanation

  • @kylearnold9647
    @kylearnold9647 25 วันที่ผ่านมา

    You're putting out some great content

  • @jees__antony
    @jees__antony 9 วันที่ผ่านมา

    Great work... Thanks for the tutorial ❤❤❤

  • @joseeduardobolisfortes
    @joseeduardobolisfortes 14 วันที่ผ่านมา

    This video is exactly what I was looking for. Thank you. Now, I wish to know which hardware configuration I will need to install and use Llama 3 models locally in my own machine. Can you help me?

  • @darkesco
    @darkesco 25 วันที่ผ่านมา

    Very useful information!

  • @DrumAndSpaces
    @DrumAndSpaces 25 วันที่ผ่านมา

    perfect timing i was just thinking about having multiple llama 3 versions fine tuned for specific coding projects instead of a broad coding language base. is this just a waste of time and im better off having a general coding version instead? i was considering having a few fine tuned models to imitate a development team with crew.

  • @zeynelacikgoez
    @zeynelacikgoez 25 วันที่ผ่านมา +1

    It would be an interesting topic for a video on how to use agents to generate data for fine-tuning.

  • @Jonathan-et4df
    @Jonathan-et4df 22 วันที่ผ่านมา +2

    please make a video on how to create datasets!

  • @user-ef4df8xp8p
    @user-ef4df8xp8p 25 วันที่ผ่านมา

    Thank you...

  • @user-fe4qc7ot5d
    @user-fe4qc7ot5d 18 วันที่ผ่านมา +1

    Where is fine tuning models stored and how can I find and download it for use?

  • @SpicyMelonYT
    @SpicyMelonYT 8 วันที่ผ่านมา

    Is the trained model able to be used with "ollama run trained_model_name"? Do I have to download it directly and put it some where for that to work? I currently have a python program setup that uses the ollama module and runs llama3. But I would like to use a fine tuned model instead as I am trying to make a Jarvis like personal assistant!

  • @carnageasada5350
    @carnageasada5350 25 วันที่ผ่านมา +1

    Please do make a video on creating datasets, both with and without the use of agents!

  • @glorixx5974
    @glorixx5974 6 วันที่ผ่านมา

    Great vid, it would be really awesome if you could make a video on how to make data sets for fine tuning! That would help a lot

  • @UnSingeEnivre
    @UnSingeEnivre 25 วันที่ผ่านมา +1

    I would love to see a dataset fine tuning tutorial!

  • @PaulFishwick
    @PaulFishwick 14 วันที่ผ่านมา

    This seems like a lot of work in forming the data prep rather than the RAG approach (eg. custom GPTs) where you embed N documents to “fine tune”. Thoughts on each approach?

  • @humanbeingmusic
    @humanbeingmusic 23 วันที่ผ่านมา

    can you offer any advise about importing the ggufs into ollama, mine just spit out gibberish, I presume it has something to do with the modelcard but no idea

  • @Will_669
    @Will_669 8 วันที่ผ่านมา

    what's the dataset like if train for conversations? for example: in a conversation, we have one instruction, multi inputs, and multi outputs

  • @kamruzzamanuzzal3764
    @kamruzzamanuzzal3764 25 วันที่ผ่านมา

    any way to input images as well to fine tune on image and text at the same time?

  • @zippytechnologies
    @zippytechnologies 23 วันที่ผ่านมา

    If there a way to generate the dataset input output data from contextual data like emails and q&a from website forums?

  • @Balajik7-qh1pq
    @Balajik7-qh1pq 7 วันที่ผ่านมา

    awesome David

  • @gnoppixlinux
    @gnoppixlinux 25 วันที่ผ่านมา

    love the 3 primary colors at 10:12 :)

  • @nasiksami2351
    @nasiksami2351 18 วันที่ผ่านมา

    Hey David, great video and great explanation. Please make a tutorial on how to generate dataset using LLM. For my use case, I have a classification problem and the class imbalance is severe. for the minority classes, I want to generate more meaningful samples using LLM and then build an LLM model to do text classification on the dataset. Any suggestion on achieving this would be great!

  • @gileneusz
    @gileneusz 25 วันที่ผ่านมา

    10:29 that would be great tbh, using agents to make dataset to finetune the model is just like inception, you can also make agents to prepare dataset for other agents to create dataset to finetune the model (inception level 2) or make agents to prepare dataset for agents to prepare dataset for finetuning the model which will be used to prepare dataset for agents to prepare dataset............

  • @yongxing1848
    @yongxing1848 21 วันที่ผ่านมา

    when are you going to make datasets for fine-tuning, I have currently data in mysql that I need to extract and create the datasets for fine-tuning llama.

  • @user-xk6rg7nh8y
    @user-xk6rg7nh8y 10 วันที่ผ่านมา

    Thanks alot !!! it is really helpful :)

  • @shaigrustamov5115
    @shaigrustamov5115 11 วันที่ผ่านมา

    it's a good video, thanks. But there are a lot of videos about fine tuning. It would be perfect if you would create a video on how to create own data sets for fine tuning. 👍

  • @siema32
    @siema32 8 วันที่ผ่านมา

    Actually GPT-4 can be fine-tuned by the user, it's done within the openai API and of course used by it's API later on.
    It obviously has downsides, like the model is still invoked on the OpenAI servers and they are collecting all the data which goes through it (no privacy), but it is possible :)

  • @richierosewall3035
    @richierosewall3035 21 วันที่ผ่านมา +1

    Hey what about phi-3..?

  • @strategy419
    @strategy419 9 วันที่ผ่านมา

    did you try finetuning gpt3.5 on the playground?

  • @sourabhiitian
    @sourabhiitian 13 วันที่ผ่านมา

    hi i have a question, what if i want to use my dataset json file into the cell instead of huggingface alpaca json. Can you give the part of the input code

  • @jackderrida
    @jackderrida 25 วันที่ผ่านมา +1

    3:37 He is 100% correct that already fine-tuned LLMs like GPT, Claude, and even Gemini 1.5 Pro with 1m+ context, are freaking awful at trying to emulate writing styles.
    Worst part about ChatGPT for this purpose is that no matter how much you tell it not to, it's filled with clauses like "On the other hand,", "Finally, ", or "As a consequence" and I'll explain to it again all the reasons those phrases don't belong in a rap song.

  • @RemekKinas
    @RemekKinas 25 วันที่ผ่านมา

    I am looking for tutorial how to generate dataset using Agents. There is no such tutorial (or I am not able to find it). It would be great to generate chat format (conversation) dataset as a response of task. So as an input you have list of task, question and then agents generate conversation to this topic.

  • @trueindian03
    @trueindian03 21 วันที่ผ่านมา

    How to train a data set which is not in the form of instructions, input, output format, lets say I want to train the model using the data from a pdf, or any other means, how can we do that, please suggest some ideas. Thanks in advance.

  • @belu6914
    @belu6914 10 วันที่ผ่านมา

    Did anyone get the example running? The copied notbook results in an error when starting the training. I already fixed the missing comma and set the max_steps to 60.

  • @josephtilly258
    @josephtilly258 25 วันที่ผ่านมา

    Are local LLM really that local or just free ? Because I'm not really running it on my computer, more of a cloud base free and flexible llm ?

    • @theobgshow
      @theobgshow 21 วันที่ผ่านมา

      Yea they are. You can run ollama on your computer then pull down a model, such as llama3, Mistral or Dolphin and run everything, completely locally

  • @wetcel1236
    @wetcel1236 25 วันที่ผ่านมา

    Hey David, thanx for this awesome served topic! Exactly what I need to get through this week 😅

  • @christiansroy
    @christiansroy 14 วันที่ผ่านมา

    You can definitely fine-tune ChatGPT 3.5 and you can also ask open AI to invite you to their private waitlist to be able to fine tune GPT 4. So it is definitely possible.

  • @adilzahir9921
    @adilzahir9921 25 วันที่ผ่านมา

    I want to use that for my work , i want to use it to find the best strategy for debts recoverts and to choose the debtors who will pay mostly and who don't ,how i should proceed ? Thanks

    • @ASchnacky
      @ASchnacky 25 วันที่ผ่านมา

      I had same idea

    • @adilzahir9921
      @adilzahir9921 25 วันที่ผ่านมา

      @@ASchnacky y'a that would be great if we can do that without coding ,good luck

  • @tekipeps
    @tekipeps 21 วันที่ผ่านมา

    Nice, how to deploy the saved model?

  • @AlejandroCastillo9
    @AlejandroCastillo9 25 วันที่ผ่านมา

    I want to Create a Lama 3 legal Assistent. I would be happy in you can Show a data prep example

  • @GreenStorm01
    @GreenStorm01 22 วันที่ผ่านมา

    How about Fine-Tuning vs. RAG in those specific things?

  • @stanisd
    @stanisd 19 วันที่ผ่านมา

    open AI has its own API for fine-tuning

  • @andrelvcoelho
    @andrelvcoelho 20 วันที่ผ่านมา

    Yeah, it would be nice if you could set up a video showing how to automatically generate datasets for fine-tuning LLMs… Tks

  • @nimesh.akalanka
    @nimesh.akalanka 22 วันที่ผ่านมา

    Is there any free method to fine tune an large language model locall. I have a small workstation with 128GB DDR4 memory, Nvidia RTX A1000 X2 SLI VGA, AMD Threadripper process. I tried AutoTune-Advanced and LLaMA-Factory. They both failed on me. Autotrain say I dont have enough VRAM. LLaMA-Factory say I dont have CUDA. Please help me.

  • @agenticmark
    @agenticmark 4 ชั่วโมงที่ผ่านมา

    your right headphone will be over your eye soon ;D Thanks for the great content David!

  • @brianmorin5547
    @brianmorin5547 11 วันที่ผ่านมา

    If pushing to hugging face, no config.json so won’t work

  • @ScROnZjara
    @ScROnZjara 25 วันที่ผ่านมา

    More content with lama!!! 🙏❤️

  • @thedatascientist-lg4ls
    @thedatascientist-lg4ls 17 วันที่ผ่านมา

    Yeah, a video on agents for finetuning datasets with a fine tuned LLMs, and used by agents for a real world application.

  • @alma4355
    @alma4355 25 วันที่ผ่านมา +1

    I'm a subscriber, please make the making dataset video

  • @harristengku7153
    @harristengku7153 25 วันที่ผ่านมา +1

    Oh wow you managed to fix the fine-tuning issue? Its been a headache for the entire open source rn, because Llama 3 trained their models differently so every fine tune would end up way worse than the original base model.

    • @DavidOndrej
      @DavidOndrej  25 วันที่ผ่านมา +4

      If you watch the video, you will see that I openly admit that I am not an expert when it comes to fine-tuning. In fact, making this video definitely was outside of my comfort zone.

    • @harristengku7153
      @harristengku7153 25 วันที่ผ่านมา

      @@DavidOndrejay respect man. In fact I think you should include more fine tuning to your videos in the future.
      You can’t run away from fine tuning if you want A.I to move to commercial use.
      Llama 3 is probably the only exception in the industry rn that has everyone stumped

  • @eldinmujovic8705
    @eldinmujovic8705 25 วันที่ผ่านมา

    Can I do this in any language?

  • @CodingScot
    @CodingScot 25 วันที่ผ่านมา

    Do you ever sleep? Wow this is amazing 🎉👏

  • @brutely9718
    @brutely9718 10 วันที่ผ่านมา

    NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs:

  • @samfisher92sc
    @samfisher92sc 25 วันที่ผ่านมา

    Please bro make a video for create datasets

  • @Corteum
    @Corteum 25 วันที่ผ่านมา

    What's an example of what you or someone else has created with this?

  • @zoranProCode
    @zoranProCode 25 วันที่ผ่านมา

    Lava FRI?

  • @kingKai2022
    @kingKai2022 7 วันที่ผ่านมา

    colab doesnt work....

  • @jackgaleras
    @jackgaleras 21 วันที่ผ่านมา

    Fine tune or RAG

  • @MrAtomUniverse
    @MrAtomUniverse 11 วันที่ผ่านมา

    its no longer april T.T

  • @pythonholic
    @pythonholic 25 วันที่ผ่านมา

    I really don't see the benefit in using an AI agent. I've tried to understand its purpose, but it seems like another way of avoiding using GPT and similar models. Can you give us a real example? Perhaps even instances of freelance use?

    • @HakaiKaien
      @HakaiKaien 25 วันที่ผ่านมา +1

      AI agents are a bit different from chat bots. With chat bots, you have a large language model responding to your prompt. With agents, you have a bunch of models talking among themselves to accomplish a task you give them. You can think about Agents as a company of employees. You give each of them roles and functions.
      You can use agents to build an application or a game for example.

  • @RedShipsofSpainAgain
    @RedShipsofSpainAgain 22 วันที่ผ่านมา +1

    This guy's community is $77/month. There's 510 members. $77/month * 12 months = $924/yr. $924/yr * 510 members = $471,240/yr. So this guy's subscription is grossing nearly half a million USD annually.

    • @DavidOndrej
      @DavidOndrej  22 วันที่ผ่านมา

      I wish… not everyone joined at 77

  • @user-jc6tj2xt1p
    @user-jc6tj2xt1p 23 วันที่ผ่านมา

    Wanna be yer cmdmp 😊

  • @matthewm8289
    @matthewm8289 25 วันที่ผ่านมา +8

    Its not Apache 2.0 licence, so you are very limited what you can do. Its not opensource!

    • @braadress
      @braadress 25 วันที่ผ่านมา +9

      Yes, it's opensource. Llama 3 is released under the CreativeML Open RAIL-M license. This license allows for broad use, including commercial use, while imposing certain restrictions aimed at ensuring responsible usage and maintaining safety.

  • @b6234
    @b6234 15 วันที่ผ่านมา

    I stopped at "10x better" I will make my life 100 time better by not watching

  • @my-financial-wealthblog4423
    @my-financial-wealthblog4423 12 วันที่ผ่านมา

    I watched your video. Understood nothing.

  • @dennisdemers9880
    @dennisdemers9880 25 วันที่ผ่านมา +1

    I joined the community but I don't know how to access it. Or when are the weekly meetings.? Trying to generate a python program as it turns out it's getting more and more sophisticated. Just llama three better at it than Chad gbt4

    • @DavidOndrej
      @DavidOndrej  25 วันที่ผ่านมา

      You can access it with the same link. The weekly meetings are on Tuesday and Saturday - more details in the "Calendar" tab at the top
      www.skool.com/new-society/calendar

  • @ramezdemitry3249
    @ramezdemitry3249 23 วันที่ผ่านมา

    NameError Traceback (most recent call last)
    in ()
    11 {}"""
    12 get_ipython().system('pip install tokenizer')
    ---> 13 EOS_TOKEN = tokenizer.eos_token # do not forget this part!
    14 def formatting_prompts_func(examples):
    15 instructions = examples["instruction"]
    NameError: name 'tokenizer' is not defined
    what shall i do here?

  • @Sunshine-vd3bf
    @Sunshine-vd3bf 21 วันที่ผ่านมา

    It would be great if you could make a video on how to create datasets for fine tuning using LLM's/Agents!