What Makes Large Language Models Expensive?

แชร์
ฝัง
  • เผยแพร่เมื่อ 23 พ.ย. 2024

ความคิดเห็น •

  • @KP-sg9fm
    @KP-sg9fm 11 หลายเดือนก่อน +8

    Can you make a video talking about smaller more effecient models (Orca, Phi II, Gemini Nano, etc)
    Do they have a future, and if so, what does it look like?
    Will more sota models leverage the techniques used by smaller models to become more effiecient?
    Or will they always remain separate?

    • @teleprint-me
      @teleprint-me 11 หลายเดือนก่อน +3

      There are pros and cons to each approach. Larger models are scaled in a way that makes their capabilities proportional to their parameters. So, larger models are smarter and that will always be the case.
      Both techniques feed off of one another, so improvements in one will lead to improvements in another.
      It's cheaper and easier and faster to iterate over smaller models and any gains made throughout the process are applied to larger models.
      Not sure if this helps. Anyone can feel free to correct me if I misrepresented any information.

  • @imanrezazadeh
    @imanrezazadeh 11 หลายเดือนก่อน +6

    Excellent explanation! A minor note: the analogy of curtain makes sense, but then you mentioned fine-tuning makes structural changes to the parameters, which is not accurate. It just changes the values of the parameters.

    • @aymerico11
      @aymerico11 10 หลายเดือนก่อน

      How does it change the value ? Is it token change ? Basically it means that once you've tuned your model f(x) no longer equals y but actually z right ?

  • @jediTempleGuard
    @jediTempleGuard 11 หลายเดือนก่อน +8

    I think customized language models will become more important over time. Companies will want artificial intelligence applications specific to their fields of activity, and individuals will want artificial intelligence applications specific to their special interests. Not to sound like I'm telling fortunes, but with improvements in cost, customized smaller models may become more dominant in the market.

    • @Cahangir
      @Cahangir 11 หลายเดือนก่อน +2

      what types of AI apps would individuals want apart from personal assistants that would need customizing?

    • @Anurag_Hansda
      @Anurag_Hansda 10 หลายเดือนก่อน

      I very much agree with you... Google could be much more efficient by giving specific detail.

    • @RajeshR-bz3nj
      @RajeshR-bz3nj 2 หลายเดือนก่อน

      @@CahangirIndustry specific LLMs. If I am a pancreatic cancer research company, I don’t want to know about Renaissance in Europe

  • @aqynbc
    @aqynbc 11 หลายเดือนก่อน +7

    Another excellent videos that makes you understand the fundamentals of an otherwise complicated subject.

  • @shaniquedasilva1856
    @shaniquedasilva1856 10 หลายเดือนก่อน +3

    Great video Jessica and so informative!! I’m working on a project now implementing Gen AI (gen fallback, generators). Identifying proper use cases are so important to yield the best results while thinking about the # of LLM calls.

    • @unclenine9x9
      @unclenine9x9 9 หลายเดือนก่อน

      Yes, we need to select the suitable LLMs for pickings up the request with cost effective way. Thus the cost of operation should be lowered.

  • @Murat-hh4hu
    @Murat-hh4hu 11 หลายเดือนก่อน +74

    For a moment I thought she is AI generated)

    • @CYBERPOX
      @CYBERPOX 11 หลายเดือนก่อน

      Truth

    • @Beny123
      @Beny123 11 หลายเดือนก่อน +1

      Don’t blame you . Pretty

    • @MohitSharma-dv7mg
      @MohitSharma-dv7mg 11 หลายเดือนก่อน +1

      Yeah and looked finely tuned!

    • @Alice8000
      @Alice8000 7 หลายเดือนก่อน

      nope u didn't

    • @uduakedet2861
      @uduakedet2861 5 วันที่ผ่านมา

      😂😂😂

  • @ameliarose6833
    @ameliarose6833 2 หลายเดือนก่อน

    absolutely love this video. You really answers so many questions to a person who had to know how thing work from the very beginning in order to learn a new skill. Thank you so much.

  • @renanmonteirobarbosa8129
    @renanmonteirobarbosa8129 11 หลายเดือนก่อน +1

    There are mistakes with the information provided.
    PEFT and Lora are separate things
    model size is influenced mostly by numerical choice and how you compile the GPU kernel.
    ...

  • @webgpu
    @webgpu 11 หลายเดือนก่อน +2

    anyone noticed she kept on talking * while * writing ? women are real multitaskers - i swear to God my brain is 100% monotask and i could never Ever: write AND do anything else. The apex of my manly monotaskiness is to be able to talk while i'm driving (but i can only talk about light subjects, if you talk about anything a little more involved, i will just not follow you.

  • @bastabey2652
    @bastabey2652 11 หลายเดือนก่อน

    I once attended a whole day IBM sales presentation in Delhi for telco CRM/Billing system.. it was an educational experience more than sales.. IBM sales is really good

  • @attainconsult
    @attainconsult 11 หลายเดือนก่อน

    this is a great start to costing running models, I think you need to think/explain more along the lines of business i.e. adding in all biz file/google/365 docs, biz emails, other biz data sales cash flow, stock usage, forecasting usage of consumables lettuces coffee... all the things biz work off

  • @fasteddylove-muffin6415
    @fasteddylove-muffin6415 11 หลายเดือนก่อน +2

    You walk into a dealership & ask a salesperson how much a vehicle will cost.
    Answer: This vehicle will cost you whatever you're willing to pay.

  • @carkawalakhatulistiwa
    @carkawalakhatulistiwa 11 หลายเดือนก่อน +4

    And PHI-2 with 2,7 B billion parameters. proves that we have spent a lot of time and money on computerization that is wasted because of bad data.
    with better data PHI-2 LLM can be equivalent to gpt 3 175 billion parameters . and there is still the possibility to reduce LLM to 1 billion parameters with the same capabilities

    • @akj3344
      @akj3344 11 หลายเดือนก่อน

      There are 1B models on huggingface made for RAGs.

  • @silberlinie
    @silberlinie 11 หลายเดือนก่อน +1

    They used an interesting technique to record the video.

  • @team-m2
    @team-m2 11 หลายเดือนก่อน

    Great and concise, thanks! But ... is she writing from the right to the left? 🤔

  • @luciengrondin5802
    @luciengrondin5802 11 หลายเดือนก่อน

    Stumbled upon this and feel like asking : how did IBM miss the LLM train? Watson was very impressive IMHO. Very much ahead of its time. How could IBM not capitalize on it? Why was it OpenAI that ended up with the language model breakthrough? Which innovation openAI had that IBM could not think of? Was it RLHF?

    • @VoltLover00
      @VoltLover00 11 หลายเดือนก่อน

      You can easily google the answer to your question

  • @saikatnextd
    @saikatnextd 9 หลายเดือนก่อน

    Thanks Jessica for this video, really eye opening and introspective at the same time.......

  • @gihan5812
    @gihan5812 11 หลายเดือนก่อน

    How can i speak to someone at IBM about working together.

  • @emil8367
    @emil8367 11 หลายเดือนก่อน +2

    Very interesting and useful. Thanks for explaining so many topics !

  • @oieieio741
    @oieieio741 11 หลายเดือนก่อน +6

    Excellent explanation. A solid understanding of how AI works. Thanks IBM

    • @jhaimp.sullivan5618
      @jhaimp.sullivan5618 2 หลายเดือนก่อน

      Bot? There is another comment saying the exact same thing. Interesting.. I'm noticing a pattern.. just noticed this on another video. Not knocking whoever's behind doing this. But if your going through the trouble of using different accounts why use the same exact comment? Anyways. I'm just halfway curious. Don't really care tbh. I have other reasons behind my curiosity not necessarily bad .. just couldn't resist but to address and pry to a degree not to expose but . Eh idk. Do not wish to further elaborate.

  • @wzqdhr
    @wzqdhr 11 หลายเดือนก่อน

    Does IBM have anything to do with this AI booming?

  • @johnnyalam7301
    @johnnyalam7301 11 หลายเดือนก่อน

    Very nicely and intelligently explained 3:49 pm ( Christmas Day 2023)

  • @mohsenghafari7652
    @mohsenghafari7652 9 หลายเดือนก่อน

    hi. please help me. how to create custom model from many pdfs in Persian language? tank you.

  • @ChrisJSnook
    @ChrisJSnook 10 หลายเดือนก่อน

    What software solution powers this mirrored whiteboard in front of you? It’s awesome and I want to use it?

    • @djembello
      @djembello 5 หลายเดือนก่อน

      I think it can be simple done by rotating/fliping the video itself :)

  • @teresafarrer1252
    @teresafarrer1252 10 หลายเดือนก่อน

    Great video: really clear and professional (unlike a couple of the saddos commenting). Thanks!

  • @seanlee2002
    @seanlee2002 11 หลายเดือนก่อน

    Excellent explanation. A great understanding of how AI works

  • @Alice8000
    @Alice8000 7 หลายเดือนก่อน

    Daaaaamn woman. Good explanation.

  • @mrd6869
    @mrd6869 11 หลายเดือนก่อน

    Small and powerfulmodels will win out.Phi 2 and Orca2 are some good examples.

  • @benthiele
    @benthiele 11 หลายเดือนก่อน

    Incredibly helpful video. Please make more!

  • @LeonButler-b8r
    @LeonButler-b8r 11 หลายเดือนก่อน

    Great explanation Jessica

  • @aymerico11
    @aymerico11 10 หลายเดือนก่อน

    Very good video thanks a lot !

  • @gamingbeast710
    @gamingbeast710 11 หลายเดือนก่อน +1

    awsome , 100% focued :D thx for the professionalisme :D

  • @cleansebob1
    @cleansebob1 11 หลายเดือนก่อน

    Looks like it all depends...

  • @markfitz8315
    @markfitz8315 11 หลายเดือนก่อน +1

    very good - thanks

  • @potatodog7910
    @potatodog7910 11 หลายเดือนก่อน

    How much of this can be done with GPTs?

    • @scottt9382
      @scottt9382 11 หลายเดือนก่อน +1

      A GPT is just one type of an LLM

  • @AdamSioud
    @AdamSioud 11 หลายเดือนก่อน

    Great video

  • @jameshopkins3541
    @jameshopkins3541 10 หลายเดือนก่อน

    THEN A COMMON PERSON CAN'T DO A LLM FROM SCRATCH???

  • @joung-joonlee1037
    @joung-joonlee1037 11 หลายเดือนก่อน +1

    I think, that LLM or GAI Look like Spread-Sheet if concern the facts that this type of engine inject By SELF toward tokens and Spell Out tokens..!! AND This type of tokens look like iterated by LLM or GAI, because that is also programs using Computer Iterations...! AND The LLM or GAI's using cost can be acquired using calculations over Time/Number of Tokens/Weight of Meaning.... But, I know that this calculations is just approximation by User. Thank you for NICE Video! and I'm korean.

  • @reazulislam8446
    @reazulislam8446 11 หลายเดือนก่อน

    So precise..

  • @jameshopkins3541
    @jameshopkins3541 10 หลายเดือนก่อน

    She is 36 years old Isn't it?

  • @rursus8354
    @rursus8354 11 หลายเดือนก่อน

    If you cannot find the best man, take the next best.

  • @jameshopkins3541
    @jameshopkins3541 10 หลายเดือนก่อน +1

    LLM IS BLA BLA BLAAAAA??????

  • @NisseOhlsen
    @NisseOhlsen 11 หลายเดือนก่อน +1

    What makes them so expensive? Simple. Their Architecture is not right.

  • @potatodog7910
    @potatodog7910 11 หลายเดือนก่อน

    Nice

  • @Free-pp8mr
    @Free-pp8mr 11 หลายเดือนก่อน

    It is not intelligent to pay for AI! It’s simply marketing!

  • @SiegelBantuBear
    @SiegelBantuBear 7 หลายเดือนก่อน

    🙏🏼

  • @ciphore
    @ciphore 11 หลายเดือนก่อน

    Nancy Pi did it first 😤

  • @markmaurer6370
    @markmaurer6370 11 หลายเดือนก่อน

    1:19 So IBM does not believe consumers need to have their data protected.

  • @thierry-le-frippon
    @thierry-le-frippon 10 หลายเดือนก่อน

    People will pay for that 😅😅😅 ???

  • @reninj
    @reninj 2 หลายเดือนก่อน

    How is it that, these videos still give such basic generic examples? Use cases for example. She couldn't find different use cases that an enterprise might have? She had to give the example of a car dealership???

  • @Canadainfo
    @Canadainfo 11 หลายเดือนก่อน

    amazon bedrock!!

  • @aprilmeowmeow
    @aprilmeowmeow 8 หลายเดือนก่อน

    so sad that people cant even write a speech anymore.

  • @Drunrealer
    @Drunrealer 8 หลายเดือนก่อน

    Drink from de bottle

  • @michaelm8460
    @michaelm8460 2 หลายเดือนก่อน

    anthropomorphism makes you forget you have another (abiet sophisticated ) search engine. Worse is the Model can enforce that idea by using personal pronouns

  • @bobanmilisavljevic7857
    @bobanmilisavljevic7857 11 หลายเดือนก่อน +1

    🦾🥳

  • @ashishsehrawat_007
    @ashishsehrawat_007 11 หลายเดือนก่อน

    Kinda boring explanation.