How to run Mistral LLM locally on iPhone or iPad

แชร์
ฝัง
  • เผยแพร่เมื่อ 12 ธ.ค. 2023
  • Based off the following tutorial: / using-llms-locally-ipa...
    LLM Farm: llmfarm.site/
    Mistral: huggingface.co/TheBloke/Mistr...
    Description:
    📚🔍🌐 Join us as we explore the cutting-edge world of AI with a step-by-step tutorial on installing a ChatGPT-like large language model (LLM) locally on your Apple device. Based on the insights of Maciek Jędrzejczyk, Senior Cloud Infrastructure Architect at Amazon Web Services (AWS), this guide is tailored for anyone looking to run an LLM on their iPad Pro Gen or iPhone.
    🚀 In This Detailed Walkthrough, You'll Learn:
    Preparation: The prerequisites for installing an LLM on your Apple device, including required RAM and storage space.
    Installing TestFlight and LLMFarm: A step-by-step guide to using TestFlight to install LLMFarm, an open-source client supporting Apple Silicon, on your device.
    Downloading and Setting Up the Mistral-7B Model: How to download the Mistral-7B model from huggingface.co, and set it up using LLMFarm.
    Configuring Chat Settings: Detailed instructions on setting up your chat context window, including prompt formatting and resource management settings.
    Testing and Using Your LLM: Finally, testing your setup to interact with the model and check the accuracy of results.
    ✨ Why This Tutorial Is Important: This video is perfect for privacy-conscious individuals, AI enthusiasts, and tech-savvy users who want to leverage the power of LLMs on their portable Apple devices.
    🔔 Stay Updated with AI Innovations: Subscribe and hit the bell icon to keep up with the latest tutorials and insights into AI applications and data privacy.
    💬 Your Experience and Questions: Have you tried setting up an LLM on your device? Share your experiences or any questions you have in the comments - let's discuss the exciting world of local AI models!
    📲 Connect with Us for More Tech Insights: Check out the links in the description for more information and updates. Follow us on LinkedIn and other social platforms for regular content on AI, technology, and privacy.
    #ChatGPT #LLM #Apple #iPhone #AI #mistral

ความคิดเห็น • 42

  • @junedkhatri31
    @junedkhatri31 6 หลายเดือนก่อน +18

    The fact that LLM can be run locally is already fascinating. Excited for 2024.

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน +1

      Yeah will be exciting to see what Gemini is able to do being designed for mobile

  • @timl2k11
    @timl2k11 วันที่ผ่านมา

    This is amazing! I got it working on an iPhone 14 with only 6GB RAM. It works great, but is very slow (3 tokens a second) and puts my phone on its knees! I wouldn’t use it day to day but it’s a wonderful profit of concept and performs amazingly well. Even Gemini makes similar mistakes. I didn’t turn on mloc as advised by someone else in another comment.

  • @paulkiragu8120
    @paulkiragu8120 6 หลายเดือนก่อน +2

    Amazing! I’m running like 5 models on my iPhone, I love it! Especially for small talk

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน +1

      Oh wow!! What other models have you tried?

    • @paulkiragu8120
      @paulkiragu8120 6 หลายเดือนก่อน

      @@kylebehrend I have llama chat 7b, orca 3b and dolphin mistral 7b. All q4

    • @sms7048
      @sms7048 5 หลายเดือนก่อน

      @@paulkiragu8120what’s your favorite?

  • @mickeyeng
    @mickeyeng 6 หลายเดือนก่อน +5

    Great video. Mistral just launched v2 model 🎉

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน +1

      Nice 👍

  • @andreahenechesierra
    @andreahenechesierra 2 หลายเดือนก่อน

    Hi Kyle! Thank you for your video, I was trying to run this on my iphone 13 but using microsoft/Phi-3-mini-4k-instruct-gguf and whenever I try to run it, my phone completely crashes. Any ideas on why this is and how can I solve it? Thanks again!

  • @alexandersoldatkin6953
    @alexandersoldatkin6953 3 หลายเดือนก่อน

    Hi, thanks for the video! Would you mind sharing what kind of screen recording/editing software you use? I’ve seen it used I a few other tutorials and am quite impressed with the style and quality.

    • @kylebehrend
      @kylebehrend  2 หลายเดือนก่อน

      Sure thing ScreenStudio :)

  • @AndreAmorim-AA
    @AndreAmorim-AA 6 หลายเดือนก่อน +3

    That's a great tip; thanks for your video. By the way, I wonder when Apple is going to release their own LLM, and since Apple controls their own silicon, I guess they will put some emphasis on LLMs that take advantage of Apple's 'Neural Engine.' I noticed that in features like iOS 17, it clones your voice. Personal Voice is trained locally on your device.

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน +1

      Apparently iPhone 16 will have AI capabilities and no doubt 2024 will be huge

  • @martinspedding4210
    @martinspedding4210 หลายเดือนก่อน

    Can it be run on a modern smartphone or just on ios?

    • @kylebehrend
      @kylebehrend  หลายเดือนก่อน

      I think iOS but new phones will have local LLMs too

  • @Stoniiann
    @Stoniiann 6 หลายเดือนก่อน

    Followed to the letter and I just get a bunch of gobble about derivatives and mathematical jumbo :(

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน

      Oh no :( You may need to force shut the app and try again. It's quite experimental at this stage :)

  • @vithaii
    @vithaii 6 หลายเดือนก่อน +1

    I’m IOS17, everytime i done this (i tried remove and do it again many times) and when i chat it’s look like my device not enough Ram, very lag and I can’t do anything even when my device turn off screen I can’t touch it, my device is hot, sometime it’s reboot,...

    • @keeab2165
      @keeab2165 6 หลายเดือนก่อน +1

      Mine does the same thing, I didn’t care to look to fix it, but I did notice when looking through downloads I had a setting that made me download the model to my iCloud and not my actual device storage this might possibly be the problem

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน

      What device you on? iPad is better than iPhone unless it’s high spec

    • @pjth3g0dx
      @pjth3g0dx 5 หลายเดือนก่อน

      For Minstral 7B settings template: Llama2 chat 7B new .. inference turn on Llama and turn of mlock all will work well. Also in system setting stop the app from running in the background and it will stop the lagging

  • @nathanielbrown1056
    @nathanielbrown1056 2 หลายเดือนก่อน

    for the life of me I cannot get this to work on my 9th gen ipad! I have followed the directions step by satep several times even using the original article which is a different model then the video. anyways each time i get model load error eval [error] load model, or load model error: [done] its so frustrating! please redo this with the new ipad LLM farm interface and a working model or teach us how to download a model that is available in the downloads and get it to work.

    • @kylebehrend
      @kylebehrend  2 หลายเดือนก่อน +1

      Probably better to wait a little, I'm sure there is a better way of doing this now. I'll take another look

  • @Bigjuergo
    @Bigjuergo 4 หลายเดือนก่อน

    possible to run on android?

    • @kylebehrend
      @kylebehrend  4 หลายเดือนก่อน

      I haven't seen it done yet but I am sure there is a way :)

  • @sandwich-plays
    @sandwich-plays 14 วันที่ผ่านมา

    mine crashes?

    • @kylebehrend
      @kylebehrend  12 วันที่ผ่านมา

      I probably wouldn't recommend doing this anymore. Was more an experiment :)

  • @pjth3g0dx
    @pjth3g0dx 5 หลายเดือนก่อน

    Don’t turn on MLock if your using metal this will cause your device to freeze

    • @kylebehrend
      @kylebehrend  5 หลายเดือนก่อน

      Thanks for the tip! Whats MLock?

    • @pjth3g0dx
      @pjth3g0dx 5 หลายเดือนก่อน

      @@kylebehrend metal lock

    • @pjth3g0dx
      @pjth3g0dx 5 หลายเดือนก่อน

      @@kylebehrend locks the loaded model into memory

    • @pjth3g0dx
      @pjth3g0dx 5 หลายเดือนก่อน

      @@kylebehrend settings template: Llama2 chat 7B new .. inference turn on Llama and turn of mlock all will work well. Also in system setting stop the app from running in the background and it will stop the lagging

    • @kylebehrend
      @kylebehrend  4 หลายเดือนก่อน

      @@pjth3g0dx Ooh interesting thanks will have to check that out.

  • @kloakovalimonada
    @kloakovalimonada 17 วันที่ผ่านมา

    Done that, LLM Farm spews complete nonsense. First it generated some sort of novel-type text after simple "hi!", now it creates code after the same prompt.

    • @Ajarylee-qh9ln
      @Ajarylee-qh9ln 14 วันที่ผ่านมา

      I can't speak for LLM Farm specifically, since I never used it, but this looks like an issue with the model or the settings, not with the app.
      Try decreasing the "temperature" setting; "0.75" is a good baseline.

  • @greendsnow
    @greendsnow 6 หลายเดือนก่อน +1

    I'd rather pay OpenAI cents to run the latest LLM model on the cloud,
    than buying an iPad Pro to run, whatever this is, on an 8 GB shared RAM.

    • @kylebehrend
      @kylebehrend  6 หลายเดือนก่อน +1

      Me too :) This is just a demo of whats possible and if I had a trip planned with no WIFI this would be on my download list

    • @Kipwich
      @Kipwich 6 หลายเดือนก่อน +2

      Well, of course you wouldn’t *buy* an iPad Pro to run this, this would be something for people who *already have* an iPad Pro and who want to try it out.