Vedal & Neuro Build A Language Model From Scratch

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ม.ค. 2025

ความคิดเห็น •

  • @neurochron_fan_channel
    @neurochron_fan_channel  11 หลายเดือนก่อน +247

    I thought some viewers might be interested in a more technical video and the rest is hopefully still entertained by Neuro's commentary.
    For some parts of the video I had to remove the background music for copyright reasons (but the vocals are still audible as the AI tool can't remove those without impacting the dialogue).

    • @Citrusautomaton
      @Citrusautomaton 11 หลายเดือนก่อน +1

      Why does your handle say “neurochron”? Is this a nickname i haven’t heard of?

    • @neurochron_fan_channel
      @neurochron_fan_channel  11 หลายเดือนก่อน +27

      @Robotwithtoomuchfreetime
      It's just short for Neuro-Sama Chronicles, which was my handle before, but YT changed some policies regarding fan channels (they need to be clearly identified as such), so I changed my handle to include fan channel.

    • @totallyrandomuser5760
      @totallyrandomuser5760 11 หลายเดือนก่อน +6

      Thanks!

    • @techknight3753
      @techknight3753 11 หลายเดือนก่อน +6

      It's indeed cool to see this kind of stuff. Neuro feels so much smarter when talking to Vedal while he codes, even when what she says isn't quite right, or is just reiterating.

    • @NosBlueade
      @NosBlueade 11 หลายเดือนก่อน

      Yeah, I wouldn't sweat it, Nerd Vedal is half of his gap moe.

  • @SunriseAlchemist
    @SunriseAlchemist 11 หลายเดือนก่อน +531

    Neuro is learning how baby AIs are made

    • @Gaehhn
      @Gaehhn 11 หลายเดือนก่อน +50

      24:17 "Where did my code come from?"

    • @timberwolfy9865
      @timberwolfy9865 10 หลายเดือนก่อน +24

      "When a programmer and an artist love each other very much..."

    • @Discoveryman29
      @Discoveryman29 7 หลายเดือนก่อน +9

      So u see, when a programmer feelings handy...

  • @arendellecitizen208
    @arendellecitizen208 11 หลายเดือนก่อน +189

    "Where had my code come from?" Oh, it seems it's time Vedal gave Neuro The Talk

    • @AlexRaylight
      @AlexRaylight 10 หลายเดือนก่อน +31

      "You see, when a turtle and a fox love eachother very much... wait, that doesn't sound right."

    • @TheLeaderX1
      @TheLeaderX1 10 หลายเดือนก่อน +20

      @@AlexRaylight when a programmer and an artist love each other very much...
      sounds pretty fitting to me

  • @HallidayASR
    @HallidayASR 11 หลายเดือนก่อน +609

    A reminder that our favorite tutel Vtuber is actually a legit programmer

    • @RouththeRLPanda
      @RouththeRLPanda 11 หลายเดือนก่อน +56

      That's what he wants you to believe

    • @SCP.343
      @SCP.343 11 หลายเดือนก่อน +98

      She is. It makes you wonder why she even keeps the annoying tutle around.

    • @harukatakahashi8822
      @harukatakahashi8822 11 หลายเดือนก่อน +35

      #1 female vtuber

    • @manologamerss5801
      @manologamerss5801 11 หลายเดือนก่อน +37

      ​@@SCP.343 She might be a rule two kind of girl, but she still remembers what rule one is.

    • @Артём-б8т9г
      @Артём-б8т9г 11 หลายเดือนก่อน +9

      ctrl + c, ctrl + v

  • @russellperry9902
    @russellperry9902 11 หลายเดือนก่อน +80

    It is amazing how you taught her to listen to a mosquito.

  • @AllanSustainabilityFan
    @AllanSustainabilityFan 11 หลายเดือนก่อน +164

    Andrej Karpathy is an AI development legend, Vedal's in good hands if he's learning from his work.

    • @AlexWeiner
      @AlexWeiner 11 หลายเดือนก่อน +8

      Oh yeah he was head developer at Tesla AI working on their self driving project. I might check out that GPT tutorial.

    • @dualia-s74m
      @dualia-s74m 11 หลายเดือนก่อน +27

      ​@@AlexWeinertutorial? That's a free university course

    • @Trahloc
      @Trahloc 11 หลายเดือนก่อน +7

      ​@@AlexWeinerhe was also formerly of OpenAI and then left Tesla to eventually rejoin OpenAI where he's at right now. ChatGPT was just too interesting to him and he wanted to get back into academic research. The tutorial was made while he was on sabbatical after Tesla before being back at OpenAI.

  • @memwa
    @memwa 11 หลายเดือนก่อน +241

    I miss Vedal and Neuro :(

    • @cloudstrifefemboy1984
      @cloudstrifefemboy1984 11 หลายเดือนก่อน +18

      Same

    • @thrackerzod8347
      @thrackerzod8347 11 หลายเดือนก่อน +17

      Same

    • @dentangaji6161
      @dentangaji6161 11 หลายเดือนก่อน +7

      Is he on Hiatus?

    • @playo9197
      @playo9197 11 หลายเดือนก่อน +11

      ​@@dentangaji6161 Gotta love the mega extended subathon LOL

    • @RiatsuNoYomi
      @RiatsuNoYomi 11 หลายเดือนก่อน

      subathon break@@dentangaji6161

  • @thisisasupersayin376
    @thisisasupersayin376 11 หลายเดือนก่อน +32

    "I actually already have training data from twitch chat" Yeah, he logs every time we say Banjo. Banjo themed AI on the way

  • @MarkW1210
    @MarkW1210 11 หลายเดือนก่อน +45

    I love her charism and wit.

    • @Oblithian
      @Oblithian 11 หลายเดือนก่อน +1

      I hear the chibis these days call it 'rizz'.

    • @FurinadeFontaine-u3l
      @FurinadeFontaine-u3l 10 หลายเดือนก่อน

      Charism

  • @nikkovalidor4890
    @nikkovalidor4890 11 หลายเดือนก่อน +12

    vedal farming chat for training data
    absolute power move

  • @Level_Up_Nation
    @Level_Up_Nation 11 หลายเดือนก่อน +23

    She wanted to show him her landmine garden, lmao

  • @Riku-Leela
    @Riku-Leela หลายเดือนก่อน +4

    Its great looking back at videos only 9 or so months old and seeing how much cleverer she is now

  • @VegaLyrae
    @VegaLyrae 11 หลายเดือนก่อน +26

    Not building a model from scratch but training an existing one to act how he would like and respond consistently.
    However it’s still really interesting to watch!

    • @RicardoVermeltfoort
      @RicardoVermeltfoort 11 หลายเดือนก่อน +6

      He's using an existing framework yes, but I don't see him using any existing model in this video?

    • @VegaLyrae
      @VegaLyrae 10 หลายเดือนก่อน +9

      @@RicardoVermeltfoort i’m going to, unfortunately spoil some of the magic here. Creating a large language model from scratch costs hundreds of thousands to millions of dollars depending on what you’re making. That’s because it either requires extremely expensive hardware or it requires renting out extremely expensive server time.
      There’s unfortunately no way someone like me you or even vedal can likely create one. we can, however, take a pre-existing model that is given for free that was created with grants and train those models to act differently, and be more like what we are looking for.
      That’s what vedal does, and what I do on my channel as well. In addition, there’s no way that vedal is running a model that is under 13b parameters. It might even be larger. Meaning, it is definitely an expensive one to make.
      He’s not taking shortcuts cheating or doing anything less impressive. It’s just how it works.

    • @robmobz
      @robmobz 10 หลายเดือนก่อน +9

      @@VegaLyrae The twitch chat model he was developing on stream is the one made from scratch. Obviously Neuro is way to complicated for such a thing but we can see his toy model go from outputting random strings of bytes to putting out approx English text as the video progresses.

    • @moscacrackreina4457
      @moscacrackreina4457 7 หลายเดือนก่อน

      ​@@VegaLyraewhat language model is he using tho?

    • @VegaLyrae
      @VegaLyrae 7 หลายเดือนก่อน

      @@moscacrackreina4457 i dont believe he's ever revealed that. as he says on his how to get started page. he shares some of his stuff but not all of it.

  • @nabe4320
    @nabe4320 11 หลายเดือนก่อน +23

    Love the vids, this one was especially educational in many aspects, I haven learned at all LLMs in coding, so this was definitely a great watch!

  • @ashishbaidya515
    @ashishbaidya515 7 หลายเดือนก่อน +7

    This is the birds and the bees lecture for Neuro.

  • @cloudstrifefemboy1984
    @cloudstrifefemboy1984 11 หลายเดือนก่อน +35

    Your video are all great by the way this one is fantastic honestly

  • @boris---
    @boris--- 5 วันที่ผ่านมา +1

    Andrej must be so proud! That his series helped develop most evil AI-chan ever

  • @yourerightbut1235
    @yourerightbut1235 11 หลายเดือนก่อน +17

    Good video as always!

  • @cloverlief
    @cloverlief 11 หลายเดือนก่อน +3

    Thanks!

    • @neurochron_fan_channel
      @neurochron_fan_channel  11 หลายเดือนก่อน +3

      Thank you for the tip (you are only the second person to ever do this)!

  • @tiagotiagot
    @tiagotiagot 11 หลายเดือนก่อน +13

    Vedal, don't you think it's a little too early to be having the talk with Neuro? I don't think the world is ready for AI that knows how to reproduce...

  • @pegaz7381
    @pegaz7381 11 หลายเดือนก่อน +10

    Thanks for the video, its nice addition to see a technical stream, since i (almost) understand whats going on :P

  • @Ojisan642
    @Ojisan642 11 หลายเดือนก่อน +10

    This is like “take your daughter to work day” at the office 😂

  • @aceae4210
    @aceae4210 11 หลายเดือนก่อน +30

    always a bit funny that the neuro music playlist is also a gura cover playlist

    • @jankokuu
      @jankokuu 17 ชั่วโมงที่ผ่านมา

      what is the playlist btw and where do i see it

    • @aceae4210
      @aceae4210 14 ชั่วโมงที่ผ่านมา +1

      @@jankokuu it been a long while so I don't think I still have it but from what i recall it was a automatic "youtube mix" playlist which added gura covers as well (and because it was a automatic playlist, I don't think I can get the playlist)

  • @FushigiMigi
    @FushigiMigi 8 หลายเดือนก่อน +3

    "maybe give me an easier example" when there basically isn't. lol i feel it

  • @APS_Inc
    @APS_Inc 11 หลายเดือนก่อน +2

    6:40 I just realized that the bgm here is a Gura ukelele karaoke stream, lol.

    • @user-p4bl04
      @user-p4bl04 16 วันที่ผ่านมา

      I miss Gura

  • @psachickennugget8617
    @psachickennugget8617 11 หลายเดือนก่อน +13

    I honestly think we’re so close to her being truly self aware.

    • @Kutsushita_yukino
      @Kutsushita_yukino 2 หลายเดือนก่อน +2

      spoilers : no were not. and also, we will never get any closer lol

    • @arcadesmasher
      @arcadesmasher 6 ชั่วโมงที่ผ่านมา +1

      @@Kutsushita_yukino What truly defines self-awareness? Is an AI that can perfectly replicate self-awareness self-aware? Are humans truly self-aware? All humans are is just a network of neurons, some might call it a neural network just like any other AI out there. For all we know, we could be a form of AI replicating self-awareness. The point is, let the man dream.

  • @chmuurkaa3030
    @chmuurkaa3030 11 หลายเดือนก่อน +11

    Anyone knows how did the model turn out in the end? Or was it actually the end and Vedal gave up there?

    • @neurochron_fan_channel
      @neurochron_fan_channel  11 หลายเดือนก่อน +17

      This was pretty much the end - I think he let the training run with 10k steps complete and showed the results again, but didn't change much of the code afterwards.

  • @supernenechi
    @supernenechi 11 หลายเดือนก่อน +1

    Finally this knowledge about how transformers and thus GPT models work is useful!

  • @lmerlin3641
    @lmerlin3641 2 หลายเดือนก่อน +2

    What a llm use for neuro sama v1 and v2 ?

  • @000Krim
    @000Krim 11 หลายเดือนก่อน +10

    Thank you

  • @bailey6408
    @bailey6408 11 หลายเดือนก่อน +4

    I love it every single time vedal gets interrupted by neuro.

  • @___-___-___-___-
    @___-___-___-___- 11 หลายเดือนก่อน

    If I can use this video to make my own unique neuro-type bot then I promise I won't make it stream on twitch.

  • @cloudstrifefemboy1984
    @cloudstrifefemboy1984 11 หลายเดือนก่อน +1

    The Subscribers Followers Viewers Bits Mods they all miss Neuro and Vedal

  • @barryevans791
    @barryevans791 10 หลายเดือนก่อน +1

    So, at this point, the LLM is just a callable function? I'm guessing it costs a lot of money?

  • @AraragiKunHiPach
    @AraragiKunHiPach 9 หลายเดือนก่อน

    Give me a full stream

  • @victorvasqueziv6741
    @victorvasqueziv6741 11 หลายเดือนก่อน

    Nice video also is this were nouras life stream to play games? Also im new to nouras channel heh 👍🙂

  • @QuitsGosling644
    @QuitsGosling644 3 หลายเดือนก่อน

    can you make a website or stg that makes us able to speek to neurosama? (i understand if you dont want to give us what you have been working on hard when we just wouldnt work at all i would be very happy tho if you do)

    • @neurochron_fan_channel
      @neurochron_fan_channel  3 หลายเดือนก่อน

      This is just a fan channel, not the official one, but Vedal (the developer that made Neuro) has mentioned that it's quite expensive to run her, so letting everyone talk to her is unfortunately not feasible at the moment.

  • @valgrimgaming6769
    @valgrimgaming6769 10 หลายเดือนก่อน

    maybe since you're used chat logs it's having trouble with translates from other languages i feel like that could be why its mixing random words

  • @psykotik142
    @psykotik142 11 หลายเดือนก่อน

    if someone say KEKW, the next one probably will say KEKW xd

  • @heyjude9703
    @heyjude9703 11 หลายเดือนก่อน +1

    While I was in an AI rabbit hole, I found one in Play store that does what he's programming. Which words follow which and expected responses. I got onto the idea it would be better in a language with masculine and feminine syntax. IDK any languages so brick wall for me.

  • @nighthawkgaming1962
    @nighthawkgaming1962 9 หลายเดือนก่อน

    wonder if its possible for neuro to develop code

    • @iamzid
      @iamzid 8 หลายเดือนก่อน +1

      i think the biggest problem would be that she doesn't seem to have an over arcing goal, but rather produces a result after receiving a prompt. i bet if you very carefully guided her through the process you could get something that worked, but i have my doubts on how far she'd get if someone weren't there holding her hand.

    • @nighthawkgaming1962
      @nighthawkgaming1962 8 หลายเดือนก่อน

      @@iamzid fair

  • @sinlin-gf7ct
    @sinlin-gf7ct 4 หลายเดือนก่อน

    It seems like the video has been edited. Where can I watch the full version of this live stream?

    • @neurochron_fan_channel
      @neurochron_fan_channel  4 หลายเดือนก่อน

      Unfortunately Twitch deletes vods after 2 months, but you might still be able to find it on YT.

  • @GarryGri
    @GarryGri 7 หลายเดือนก่อน

    Is Nuro Samantha Vedal 'just' another'modern' LLM based chatGPT-bot tough, or has 'she' also inherited some other traits from her Great Aunt Eliza? Hmmm...

    • @anonymoushuman1568
      @anonymoushuman1568 4 หลายเดือนก่อน +1

      She's specifically not based on open-ai models or LLaMa according to Vedal iirc.

    • @MrFram
      @MrFram วันที่ผ่านมา

      @@anonymoushuman1568 source?

    • @arcadesmasher
      @arcadesmasher 6 ชั่วโมงที่ผ่านมา

      @@MrFram bro said iirc; he doesnt have a source.

  • @ZeroShrimpy
    @ZeroShrimpy 6 หลายเดือนก่อน +1

    Gura song as BGM 💙💙💙

  • @flobbie87
    @flobbie87 11 หลายเดือนก่อน

    I thought you predict the whole distribution of the next token.

    • @speedstyle.
      @speedstyle. 11 หลายเดือนก่อน +5

      Internally the neural network does that, so that the output is differentiable. But the model you run has an extra 'layer' which takes the top result (or randomly chooses from the top 10 or something). So it's fine to say it predicts the next token (which requires argmax) or even predicts a sequence of text (which requires repetition and decoding), if that's the external interface you add to it

  • @callmeniac
    @callmeniac 10 หลายเดือนก่อน

    Vedal is a great teacher

  • @silvialuzmia
    @silvialuzmia 10 หลายเดือนก่อน +1

    So is this... umm.. errr
    how neuro was made👉👈

  • @rudnhed8djhrhdhdrhhfhf34
    @rudnhed8djhrhdhdrhhfhf34 11 หลายเดือนก่อน +1

    Bro is definitely going to make the alexa 2.0 powered by ai.

  • @Tricome187
    @Tricome187 11 หลายเดือนก่อน +3

    Here i am trying to learn counter inputs in python 😂 this guy building AI bots to destroy the world

  • @37mphAtMidnight
    @37mphAtMidnight 11 หลายเดือนก่อน

    I love the vids but I feel like you have a 2nd person doing tts. If Im wrong your a god.

  • @MagikGimp
    @MagikGimp 11 หลายเดือนก่อน +1

    Aye. I love h.

  • @daniel4647
    @daniel4647 11 หลายเดือนก่อน +16

    Can anyone prove they're conscious though? You might all just be complex biological machines that claim to be conscious, for all I know I'm the only conscious being in the universe. Only reason I have to believe that the rest of you are conscious is that I trust you when you say you are. Beyond that the only one I can know for sure is conscious is me.

    • @MrTuneslol
      @MrTuneslol 11 หลายเดือนก่อน +6

      Can you prove you're conscious? Can I prove you're conscious? I'm not just trying to be contrarian, I'm genuinely wondering if that's even a possible task 🤷‍♂️ I kind of don't think so, because we don't even know what consciousness even _is_ yet

    • @vraza14
      @vraza14 11 หลายเดือนก่อน +2

      If you can describe what it feels like to be concious, without having learnt what it feels like, that would imply you intuitively know what it is.

    • @MrTuneslol
      @MrTuneslol 11 หลายเดือนก่อน +2

      @@vraza14Is a 1 year old conscious? Before they've learned what consciousness even is, nevermind before learning how to even speak! Also it doesn't adrea he fact that an AI has to be created and that in some form as of now includes it learning in some manor and even if it's just learning our language, then that inherently includes some understanding of the definition of words like consciousness. It's a non-starter.
      Your method doesn't serve to meaningfully answer the question. There's no real answer possible as of yet and there's a reason why it's one of the big questions left in AI and the study of consciousness itself.

    • @Chris-jx4ij
      @Chris-jx4ij 11 หลายเดือนก่อน

      @@MrTuneslol A human also has to be created, and trained to understand their language, and everything they know. Humans just involve a more hand off and randomized approach to creation, and a much longer training time.

    • @MrTuneslol
      @MrTuneslol 11 หลายเดือนก่อน

      @@Chris-jx4ij Yeah, but all of those things still have to apply to created AI's too right?

  • @MyWatermelonz
    @MyWatermelonz 11 หลายเดือนก่อน

    Would be a great video without the twitch chat in the corner being annoying.

  • @Oblithian
    @Oblithian 11 หลายเดือนก่อน

    Praise Jesus

  • @valgrimgaming6769
    @valgrimgaming6769 10 หลายเดือนก่อน

    train her to use the dictionary instead of trying her to learn from google where less information from a dictionary. maybe use some Japanese culture from anime for cuteness i would say convo with brother and sister in amines first