How I Fine-Tuned Llama 3 for My Newsletters: A Complete Guide

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ต.ค. 2024

ความคิดเห็น • 16

  • @Because_Reasons
    @Because_Reasons 4 หลายเดือนก่อน

    Hmm. What are the strange characters in your training data, line breaks etc? What format does it take, a specific Json format? Also, is it not possible to have LLama output formatted newsletter? Why the extra step of formatting it with Claude?

  • @draganpetrov9594
    @draganpetrov9594 5 หลายเดือนก่อน +1

    are you happy with the results of your fine tuned model? how would you compare it with the article done on claude 3 ( using your ontology method) ?

    • @heymichaeldaigler
      @heymichaeldaigler  5 หลายเดือนก่อน

      So far yes. Can it get better? Yes. This has helped me get drafts to the 70-80% complete mark off the rip though so has helped shorten the time significantly.

    • @draganpetrov9594
      @draganpetrov9594 5 หลายเดือนก่อน

      Thanks for your reply. Have you found a way to host the fine tuned model somewhere so that you can access it via API? And maybe use it with make/zapier?

  • @onlineinformation5320
    @onlineinformation5320 5 หลายเดือนก่อน +2

    ok my dataset is private right how will it access

  • @dami4335
    @dami4335 5 หลายเดือนก่อน +2

    Why did you finetuned the basic model with alpaca first? Couldn’t you just finetune the basic model on your newsletters?

    • @heymichaeldaigler
      @heymichaeldaigler  5 หลายเดือนก่อน +1

      I found that with just fine-tuning it on my newsletters only, gave less optimal results

    • @dami4335
      @dami4335 5 หลายเดือนก่อน

      @@heymichaeldaigler can you spot some differences between the base model and the one finetuned with alpaca?
      If so, do you recommend fine-tuning first on alpaca ( even a dataset of another language X like Italian for example ) and then finetuning on our own data ( of the same language X)?

  • @onlineinformation5320
    @onlineinformation5320 5 หลายเดือนก่อน +1

    I have a csv of question answer pairs how should I upload it plzzz answer

    • @heymichaeldaigler
      @heymichaeldaigler  5 หลายเดือนก่อน +1

      For uploading the dataset and formatting:
      The file should be a list of JSON objects with the following keys: "Instruction", "Input", "Output".
      "Instruction"-an example of what a user instruction would look like.
      "Input"-example of accompanying input with the instruction (if needed)
      "Output"-example of the format, style, etc of the output we expect
      ---
      once you have your dataset file you will:
      1. click "new dataset" on your hugging face account
      2. Go to files & versions. Select upload file. Upload your dataset.
      3. Copy the name of the dataset which will usually be "your_huggingface_username/name_of_dataset"
      ---
      lmk if that helps with your issue!

  • @GAMINGDEADCRACKER
    @GAMINGDEADCRACKER 5 หลายเดือนก่อน +1

    Can you make it local without using online just dont include the scraping part

    • @heymichaeldaigler
      @heymichaeldaigler  5 หลายเดือนก่อน

      When you say, "make it local" do you mean using newsletters or writing not published online? if so, then you can, as long as it's formatted the right way for the dataset

  • @jorgelobo1381
    @jorgelobo1381 5 หลายเดือนก่อน +1

    Couldn’t you achieve similar results with an agent?

    • @heymichaeldaigler
      @heymichaeldaigler  5 หลายเดือนก่อน

      Yes, you could setup a team of agents with different roles and probably have good results. Just wanted to show/explore other ways to do it.

  • @ErikLandvall
    @ErikLandvall 5 หลายเดือนก่อน

    using python though... kinda counter productive, anyone working with this will have to sit for ages just trying to solve meta questions