Design Your Own Ollama Model Now!

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ต.ค. 2024

ความคิดเห็น • 38

  • @wilson_joe
    @wilson_joe 4 หลายเดือนก่อน +3

    I really appreciate you videos, you have a simple, understandable , friendly approach to teaching which keeps me coming back for more.

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 5 หลายเดือนก่อน +10

    may be a dummy guide would be also helpful. it's a bit advance, the content, though very useful.

  • @tharun2003
    @tharun2003 3 หลายเดือนก่อน

    You saved my day. Thank you Matt.

  • @userou-ig1ze
    @userou-ig1ze 5 หลายเดือนก่อน

    Please keep doing what you're doing, at this point I would guess job offers from all over the world pour in. Thanks for your continuous videos! I went through this with the meditron model, that I suspect is still not fully correct in prompt format, but couldn't fix, maybe with this video I will be more successful.
    Ps: Let us know in case you sell merch :)

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      No merch but the is a patreon at patreon.com/technovangelist and a newsletter at technovangelist.com/newsletter

  • @tecnopadre
    @tecnopadre 5 หลายเดือนก่อน +2

    I love your endings

  • @xspydazx
    @xspydazx 3 หลายเดือนก่อน

    very good !!! qell understood....
    (quick advice)...
    Tempreture is related to the training also (as things which were not trained deeply will need higher tempreture ... and things deeply embedded will be ok with lowest tempreture: how do people train thier odels and what are thier acceptable levels? as some are .=0.5 and under whilst other dont care and let the model complete an epoch on large dataset and assume the data took .. as long as thier final output was preferable : when in fact all the data which did not go in at the loss below .0.5 did not take and is not retrivable perhaps its there ephemeallly ... as it is like a pretraining ... its just used for next word prediction... but we are doing tasks ! which is whole sequece prediction/recall so when we train for a task we expect the whole of the data set to be fit in range .... so low temptrture 1 should be acceptable losses ...
    Some say tha this effect the soft max of possiblisty chosen byu the topk sample as well as the topP percentage of cutt of... but this is when there are many sample chosen... but this also depicts the values that were trained at thatr rate of loss .... so it will be collecting sample from the level under the temptrture rate of 1 ( a lot ) so this will need constraining with topP (selecting the highest of probablitys ... but the softmax will also spread them alowing for more random also, when the model has been over trained.) ..
    so an over trained model can be loosened by raising the temptretue and a wild model tamed !
    lol...

    • @xspydazx
      @xspydazx 3 หลายเดือนก่อน

      i would like to see a vdieo on publishing a modl really !

  • @wardehaj
    @wardehaj 5 หลายเดือนก่อน

    Great video, very usefull! I have a request for you: please make a video about making an ollama model of dbrx and/or grok 1.5 vision models

  • @eyeseethru
    @eyeseethru 5 หลายเดือนก่อน +1

    Thanks for all the helpful videos on Ollama! I've since located the answers, but these are a few questions I was always left with whenever I saw mention of making a model file. Asking so it may help other new Ollama users: What kind of file is it? What program should be used to create it? Is it saved in a specific file format or location?

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      I created it in vscode. It’s just a text file like everything else in a code editor. And put it anywhere you like. Once you run ollama create, blobs and manifests are generated in a specific place.

  • @francescobassignana4211
    @francescobassignana4211 4 หลายเดือนก่อน +1

    Hi! Thanks for the video. I have a question about using the Ollama model with LangChain: When I run the .invoke method with a simple prompt, does the Ollama library automatically insert the prompt into the pre-configured template in the model file, or do I need to manually include it in the LangChain prompt template?

  • @atrocitus777
    @atrocitus777 5 หลายเดือนก่อน

    i see that you can use your own docker registry with ollama as a way of hosting model files. would love to see a video on this for users running ollama on closed networks.

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      It’s not actually the same as the docker registry. It was written by the same person that created the docker registry though.

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน +1

      It had to be modified because layers in a docker image are tiny whereas models are huge.

  • @mpesakapoeta
    @mpesakapoeta หลายเดือนก่อน +1

    Any tutorial on model creating from custom data,like pdf s? Like for companies?

    • @AyushSharma-qd1lq
      @AyushSharma-qd1lq หลายเดือนก่อน

      yes please ive been looking for this. if you find anything please share, any help is appreciated

  • @twinnie38
    @twinnie38 4 หลายเดือนก่อน

    So helpful, so interesting, thanks 👍After generating my model, I notice that I have to specify the number of layers to use even though my GPU has enough memory in Ollama (--n-gpu-layers). If I use fewer layers, what does this mean in practice?

  • @rude_people_die_young
    @rude_people_die_young 5 หลายเดือนก่อน

    Another great one - identifying several pitfalls

  • @Hemanthkumar-zz6fb
    @Hemanthkumar-zz6fb หลายเดือนก่อน +1

    Can I train model via chatting with the model? how to do it

  • @explorer945
    @explorer945 5 หลายเดือนก่อน

    Thank you for the short and sweet video. How do you get so much good audio quality on your videos? step 0: have a great voice. What is step1 (gear, setup in OBS/plugins) :?

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน +3

      I think I need a video on it. I don’t use obs though.

    • @explorer945
      @explorer945 5 หลายเดือนก่อน

      @@technovangelist yes, video please. You could add affiliate links to the gear as well. Really loved the base in audio

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน +1

      th-cam.com/video/LQe3DFjMYrE/w-d-xo.htmlsi=R4u3h6yPtbUaHeDh

  • @KhanaKhala1
    @KhanaKhala1 5 หลายเดือนก่อน

    Extremely useful but what if there is no template in the readme?

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      Then look in that file I showed. And if not there then look how the model was trained or fine tuned

  • @AliAlias
    @AliAlias 5 หลายเดือนก่อน

    Thanks ❤
    Very helpful 😊

  • @UTubeGuyJK
    @UTubeGuyJK 5 หลายเดือนก่อน

    My coworker and I set up a windows machine to run ollama. It works great but occasionally seems to crash. Could it be the keep_alive setting? If I want others to be able to hit it via the api, should I set the keep_alive to “forever”? (I don’t remember the flag for that off the top of my head). Thanks for your work on Ollama!

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      In most cases you shouldn’t need to worry about keep alive.

  •  5 หลายเดือนก่อน

    Excellent thank you!

  • @Cloud_Dude
    @Cloud_Dude 5 หลายเดือนก่อน

    there is folder based on the date of this video . do you have a gist containing content of the template per model ?

    • @technovangelist
      @technovangelist  5 หลายเดือนก่อน

      No. It was just a few lines that you can grab from the same sources I did so didn’t bother with it

  • @sanjaybhatikar
    @sanjaybhatikar 2 หลายเดือนก่อน

    I fine-tuned a HuggingFace embedding model locally and got a set of files with safetensors on disk. Tried to convert to GGUF with llama.cpp but it fails. Ollama requires the model as GGUF. Any suggestions for how to integrate the fine-tuned model in Ollama?😢

    • @technovangelist
      @technovangelist  2 หลายเดือนก่อน

      I’m not sure. Your best bet is to ask on the ollama discord. Discord.gg/ollama

  • @SonGoku-pc7jl
    @SonGoku-pc7jl 5 หลายเดือนก่อน

    thanks!!!

  • @florentflote
    @florentflote 5 หลายเดือนก่อน