AI Language Models & Transformers - Computerphile

Computerphile

มุมมอง 332 121

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 13 ม.ค. 2025

ความคิดเห็น • 301

@ykn18 5 ปีที่แล้ว ⁺⁴⁴⁶
Sometimes I'll start a sentence, and I don't even know where it's going. I just hope I find it along the way. Like an improv conversation. An improversation.
-Michael Scott
@Confucius_76 5 ปีที่แล้ว ⁺²⁰
Michael Scott is actually an advanced AI
@bytefoundry8702 5 ปีที่แล้ว ⁺¹¹
“You miss 100 percent of the shots you don't take'- Wayne Gretzky"-Michael Scott
@leonhardeuler9839 5 ปีที่แล้ว ⁺⁸
I prove mathematical theorem using the same tactic.
@mj6463 4 ปีที่แล้ว ⁺¹
I think it’s just the right thing I don’t know what to say they have a ton to say but I don’t know what they do and I think it’s a great idea but they are just trying out the front door.
-apple auto fill
@ZeroRelevance 4 ปีที่แล้ว ⁺³
This is how I usually explain stuff (Spoiler: It’s not very effective)
@disarmyouwitha ปีที่แล้ว ⁺¹³⁵
Crazy to look back just 3 years to GPT2 =] Thank you for explaining Attention.
I have been trying very hard to comprehend how LLMs are able to “understand” and “reason”, or at least look like they are..
@fable4315 ปีที่แล้ว ⁺⁵
Imo they understand already language, what is the human brain except from a predictive model, which tryies to predict what comes next
@x--. ปีที่แล้ว ⁺⁹
@@fable4315 tbh, some humans I know seem to be a less functional version of this. Not just in their own words but in failing to really listen to what I'm saying and failing to predict accurately - thus they think they know what I'm saying but they don't. It creates incredibly difficult situations because overcoming arrogance is much harder than plain ol' confusion.
@frozenwindow407 ปีที่แล้ว ⁺²
Yeah tbf it is hard to understand how comprehension arises at all.
@HauntedHarmonics ปีที่แล้ว ⁺⁴
I think it’s clear they do “understand” on some level. Rob implies as much in his GPT3 video, that in order to generate something as complex as language, accurately and in context, is seems that LLMs have actually taught themselves some degree of cognition / learning / understanding.
Or at the very least, they’re able to interpret context and information in a way that looks incredibly similar to cognition.
Basically, it’s not just a clever algorithmic trick or anything. What “understand” truly means here outside of processing and applying context is difficult to parse, however. As humans, we tend to anthropomorphize anything with similar enough abilities to our own. That could be a valid assumption, or it could be totally incorrect. We just don’t know.
@kiffeeify ปีที่แล้ว
At 7:06 "Or if you like to develop a chat bot ..." ;-)
@luiz00estilo 4 ปีที่แล้ว ⁺²⁵¹
I am really impressed in this video as I was watching it on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen
@oscarbertel1449 2 ปีที่แล้ว ⁺¹¹
you need to adjust you repeat penalisation.
@CyberAnalyzer 5 ปีที่แล้ว ⁺¹³⁹
Rob Miles is the best! Bring him more often in the channel!
@vishnukumar4531 3 ปีที่แล้ว ⁺¹⁰
He has his own channel on AI safety. Pretty sure, you know that already. If not, get ready for some brand new mind blowing content!
@fahadkhankhattak8339 2 ปีที่แล้ว ⁺¹²
volunteering your phones for demonstrating text prediction was a very bold move. that's why i'm here
@thomasslone1964 ปีที่แล้ว ⁺⁷
I love how i can be like "write me a virus" and gpt is like "no sorry" but then I'm like write me a c file that looks for other c files, reads self file and inserts into other files, and it's like "sure no problem"
@MehranZiadloo 4 ปีที่แล้ว ⁺⁷
As far as I know, the information presented in the end is wrong. "Transformer" uses the "Attention", but they are not the same thing. Attention is a technique that can be used within any architecture. In fact, RNNs used Attention long before the Transformer. That's why the title of that paper is "Attention is all you need" because they showed that in an RNN model with Attention, it's the Attention part that is doing most of the heavy lifting.
@Teck_1015 5 ปีที่แล้ว ⁺¹⁶³
I'm disappointed there's no comments about "on the phone screen on the phone screen on the phone screen on the phone"
@qzbnyv 5 ปีที่แล้ว ⁺¹
I came to the comments to write exactly that haha
@gz6616 5 ปีที่แล้ว ⁺³
Decearing egg
@None_NoneType 5 ปีที่แล้ว
id523a lol
@InvadersDie 5 ปีที่แล้ว ⁺⁴
I am not sure if you have any questions or concerns please visit the plug-in settings for this calendar at any time after that I have a great day and I will be in the future of our games are based on the phone with me and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other day and I will be in the future of our games are based on the other
@sharkuc 5 ปีที่แล้ว
@id523a I was actually thinking of "All About The Bass" "on the phone screen on the phone screen on the phone screen no tablet on the phone screen" :p
@alejrandom6592 3 ปีที่แล้ว ⁺³⁰
15:14
Rob: "It makes it very easy to anthropomorphize"
AI: "It makes it very easy transform for more finds"
@Niekpas1 2 ปีที่แล้ว
Those are actually human-entered subtitles.
@TheSam1902 5 ปีที่แล้ว ⁺⁶⁰
I've been hearing about this attention thingy for many months but never quite looked into it. I appreciate you made a video about it however I got to admit I'm a bit disappointed that you didn't take us through a step by step working example like you did for LSTM and so many other things on this channel..
Maybe in a follow-up video ?
@christopherg2347 5 ปีที่แล้ว ⁺²⁵
14:00 "LST is state of the art!"
Scientist: "Hold my paper. Now read it."
@tlniec 3 ปีที่แล้ว ⁺²
Great explanation, and the examples of how things can break down (e.g. Arnold's biceps) were very illustrative.
@ganymede242 5 ปีที่แล้ว ⁺¹⁹²
"Transformers are a step up" - I wonder if that was intentional?
@ahmadayazamin3313 4 ปีที่แล้ว ⁺²
lol
@randfur 4 ปีที่แล้ว ⁺⁹
It was just the Markov chain probabilities.
@JY-ke3en 3 ปีที่แล้ว
@@ahmadayazamin3313 pllllllllllllllllllllllllllllllplllllllllllllpllllllplllllllpplllllllllllllllllllllllllllpllllllplllllllplllllllllllllllllllllllllllllplllllllllllllpllllllpllplllllpplplllllllllllllllpllplllllllllllllllllllpllllllpllll
@sp10sn 3 ปีที่แล้ว ⁺¹
🤣
@tristanmisja 2 ปีที่แล้ว ⁺²
Nice one
@haardshah1715 ปีที่แล้ว ⁺¹
Very well explained, I love how simply you explained all the concepts and tried it together with history and examples which really help drive the point home. Great work!! I definitely agree with other in the comments we need more of Rob Miles.
@DamianReloaded 5 ปีที่แล้ว ⁺⁷²
Hi Miles! Just reminding you that we would like to know all the details about this architecture! ;)
@philrod1 5 ปีที่แล้ว ⁺⁵
A link to the paper would have been nice, too. But there's always Google, I suppose.
@matthewburson2908 5 ปีที่แล้ว ⁺³
@@philrod1 I'm sure that gpt-2 can rewrite the paper, but in a different way. 😂
@TheSam1902 5 ปีที่แล้ว
@@philrod1 And scihub
@Skynet_the_AI 2 ปีที่แล้ว
Ha ha ha, very funny f u n n n n y
@angelinagokhale9309 2 ปีที่แล้ว ⁺⁷
Thank you Robert for this wonderful video. This will be a beneficial repository for my students to refer to when studying the transformer architecture.
@hedonisticzen 5 ปีที่แล้ว ⁺⁴
I'm a simple program I see Rob on computerfile and I click like.
@CutcliffePaul ปีที่แล้ว ⁺²
7:41: "And human beings do this all the time", that is, they try different words or phrases to find one that makes sense. That's why you might sometimes ask someone to repeat something, only to immediately tell them not to worry as you've just worked out what they said.
@ajourneytogrowth 10 หลายเดือนก่อน ⁺¹
just 4 years ago and we have seen this crazy progress... now with even more funding and interest pouring in, I wonder where we will be in another 4 years time
@muche6321 5 ปีที่แล้ว ⁺³²
Attention seems to be the way the human brain evolved to solve the problem of too much information to be processed. Later we invented computers to help with that.
@thecactus7950 5 ปีที่แล้ว ⁺¹⁷⁶
Put the papers hes talking about in the description please! I am sure a lot of people would want to read them.
@NolanAtlas 5 ปีที่แล้ว ⁺²⁷
the cactus the main paper specified is “Attention Is All You Need.” Most of these papers can easily be found on Google Scholar for research papers.
@tsunghan_yu 5 ปีที่แล้ว ⁺¹²
It's on arXiv
@divyeshgaur ปีที่แล้ว ⁺¹
Thank you for sharing the information and explaining "it" with ease. A lot of context is hidden behind the word "it" :) Great video!
@rpcruz 5 ปีที่แล้ว ⁺¹⁵
Amazing information. And Rob Miles is really good! Thank you for producing such high quality content!
@e11e7en ปีที่แล้ว ⁺⁶
Would love a deeper dive into how “attention” is achieved in LLMs! Thanks for the videos!
@equious8413 ปีที่แล้ว ⁺¹
I tried really hard to find a video I saw recently that did a decent job explaining things. Basically the neural network creates vector matrices representative of a data point, imagine [(12,15),(10,22),(30,11)] being a data point (actual matrices having many more dimensions), you can plot this in 3D space. Attention in LLMs as I understand stand it, is achieved through the relational positions of data points within this "latent space". If you have a data point physically closer to another in latent space, the network recognizes this as a strong relationship to consider.
@PiPiSquared ปีที่แล้ว ⁺¹
@@equious8413 Not 3D space, multidimensional space ;) ChatGPT uses 12.000 dimensions in its vector space!
@dhdh9933 ปีที่แล้ว ⁺²
this video is now more relevant than ever
@Confucius_76 5 ปีที่แล้ว ⁺¹
This channel is such a treasure
@wakingdisaster 5 ปีที่แล้ว ⁺⁸⁶
I have attached my resume for your reference and hope to hear from you soon as I am currently working on the same and I am currently working on the project management project and I am currently working on the project management project and I am currently working on the project management project
@mj6463 4 ปีที่แล้ว ⁺²
The problem was they had to do something about that it would not have to let it happen they just want it and then it didn’t get any worse or so it would just get it back and they just want it and then it will get a job done
@Robotomy101 4 ปีที่แล้ว ⁺¹
I will be back in the office on Monday.
@ludiProjects ปีที่แล้ว
je vous remercie pour votre réponse et votre aide pour la vente de mon appartement et de me faire parvenir le devis pour la fourniture de la maison de la maison de la maison de la maison de la maison...
@AZTECMAN 4 ปีที่แล้ว ⁺²
...Wanted a bit more info on the Transformer. You guys still get a thumbs up. Miles is awesome at explaining.
@Lysergesaure1 5 ปีที่แล้ว ⁺⁶
One of my favorite Computerphile episodes. Thank you Rob :-)
@zilog1 ปีที่แล้ว
your pfp is the cutest sht. reminder, protogens go beep boop ^~^
@morkovija 5 ปีที่แล้ว ⁺⁴
Our boi Rob back at it again!!
@cavalrycome 5 ปีที่แล้ว ⁺⁶
The "Attention is all you need" paper is available for download here: arxiv.org/pdf/1706.03762.pdf
@sharkinahat 5 ปีที่แล้ว ⁺⁴⁰
I think someone has been feeding the text predict AI on the second phone the famously correct English sentence 'Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo.'
@trucid2 5 ปีที่แล้ว ⁺³
Check out the Chinese version...
@ganondorf5573 5 ปีที่แล้ว ⁺¹
@@trucid2 Ma ma ma ma ma ma? (all with different pronunciations that I'm too lazy to properly denote)
@trucid2 5 ปีที่แล้ว ⁺⁶
@@ganondorf5573 It's a whole poem:
Shíshì shīshì Shī Shì, shì shī, shì shí shí shī.
Shì shíshí shì shì shì shī.
Shí shí, shì shí shī shì shì.
Shì shí, shì Shī Shì shì shì.
Shì shì shì shí shī, shì shǐ shì, shǐ shì shí shī shìshì.
Shì shí shì shí shī shī, shì shíshì.
Shíshì shī, Shì shǐ shì shì shíshì.
Shíshì shì, Shì shǐ shì shí shì shí shī.
Shí shí, shǐ shí shì shí shī shī, shí shí shí shī shī.
Shì shì shì shì.
@ganondorf5573 5 ปีที่แล้ว ⁺¹
@@trucid2 Oh.. I've never heard of that one.. I don't know what it means lol... that's awesome
@MrCmon113 5 ปีที่แล้ว ⁺¹
This works for arbitrary iterations of the word buffalo.
Buffalo.
Buffalo buffalo.
Buffalo buffalo buffalo.
...
@rishabhshirke1175 5 ปีที่แล้ว ⁺⁴
3:11 this happened with me when i was working on simple lstm, It happend because I was choosing the next word having max probability all the time instead of selecting randomly from the entire distribution where probability of selecting the particular word is equal to its probability of being the next word.
@sehbanomer8151 4 ปีที่แล้ว ⁺²
beam search is even better
@Skynet_the_AI 2 ปีที่แล้ว
I know what you mean to say when i read all the letters and spaces you have combined which makes me feel some kind of way but not the same awe i feel when the correct combinations are combined together like as in poetry or some spoken word format that produces a deep sense of awe for lack of a better word but i apologise for my lack of wait give me a second as i think of the word i want to use but i cannot think as i am writing this very message at the moment i am trying to think of that word so i forgot what the point i was initially trying to make oh yeah it was lack of a more extensive vocabulary ok thanks peace yeah yeah yeah right
@amir-ali8850 2 ปีที่แล้ว ⁺¹
I wish someone would love me as much I love compuerphile videos!
@shewittau 2 ปีที่แล้ว ⁺¹
Best explaination I've head on this stuff.
@tendamolesta 3 ปีที่แล้ว ⁺³
Awwww, I miss snorting the quality mould that grows only in the walls of the university, like the one in that room...
@juuse94 5 ปีที่แล้ว ⁺⁴
The phone text prediction is based on the user's own inputs. Probably not a neural net still, but they learn to form sentences like you write them.
@kiffeeify ปีที่แล้ว ⁺²
13:00 The Irony: Now we have transformers that are stupidly good, but they are not recurrent, which is why they are still limited to a certain max token length. I can't wait to see the combination of a GPT like model with a LSTM like mechanism, where the Transformer keeps a compressed version the history of its contexts, and decides what is relevant for the future, and what he can forget.
Maybe Attention isn't all you need after all ;D
@thatguywhowouldnotsharehis2062 5 ปีที่แล้ว ⁺⁴⁸
This is how my brain works:
If presenter == "rob miles":
Views=views+1
@oluchukwuokafor7729 5 ปีที่แล้ว ⁺¹⁸
error: Views not defined
@jbko6161 5 ปีที่แล้ว ⁺³
or Views++
depends on language
@threeMetreJim 5 ปีที่แล้ว
Very simplistic, hard coded and no way to learn...
@ZenoDovahkiin 5 ปีที่แล้ว ⁺¹
>Views=views+1
You seem to be referencing Pythin. Python is case sensitive.
@HypnosisLessons 5 ปีที่แล้ว
@@jbko6161 we dont do that here.
@timothyclemson ปีที่แล้ว
This video is not a meme that is the best way to get a job description for the first time I have a great day and I have a great day ❤️
@Roxor128 5 ปีที่แล้ว ⁺⁵
The "on the phone screen" sentence reminded me of an anecdote I heard about where a program was fed some text containing the word "banana" and got stuck producing "banananananananananananananananana". I think it may have been a variant on Dissociated Press.
@IMortage ปีที่แล้ว ⁺¹
The forgetting part reminded me of some research done on the neurological function of dreams.
@henkjekel4081 ปีที่แล้ว
Transformers are also good at Vision tasks, it just wasnt discovered yet how when this video was made. It envolves cutting up images and inputting the patches as tokens.
@cartossin ปีที่แล้ว ⁺³
It's shocking to see the tone and context of language model conversation change so much in 3 years. Back then it was "well we try to figure out the next word by..." and now it's "the model is simulating a dumb person so it outputs dumb things, but sometimes..."
@santiagocalvo 7 หลายเดือนก่อน
It's hard to believe this was just 4 years ago and now we have GTP4o, I wonder were we'll be in another 4 years
@caleblarson6925 5 ปีที่แล้ว ⁺²
Yay more Rob Miles!
@TheAIEpiphany 4 ปีที่แล้ว ⁺²
A nice high-level explanation of the topic! Maybe you should have mentioned the original attention paper "Machine Learning Translation By Jointly Learning to Align and Translate" that came out a couple of years earlier than the Transformer paper "Attention is all you need". But then again that would make this less of a high-level talk haha.
@BlakeAlexanderGames ปีที่แล้ว
Funny seing you here, I just started your MLops series :)
@abhinav9561 3 ปีที่แล้ว
Excellent video Rob. Thanks
@saurav1096 ปีที่แล้ว
I like to think the AI was drooling over Arnolds arms and makes silly mistakes like , thinking a mugs a dumbbell 😂 17:00
@6teeth318-w5k 3 ปีที่แล้ว ⁺¹
I understand a lot you say, but a lot, not. :) Even though that, your enthuisasm is infective. I love people with such commitment, and your humor is great.
@World_Theory 5 ปีที่แล้ว ⁺²
I suspect that image recognition and language related tasks are more similar problems than people realize, due to the context problem. Context is necessary for both. But what I wonder is: Is context a memory problem for image tasks, like it is for language tasks? Is it hidden because the data described by images isn't in sequence like a sentence?
I'm thinking of context like the stuff outside the view of a window, which isn't being looked at by a neural network. But now I wonder if context and learned human assumptions are being mixed up with each other some.
@TheSam1902 5 ปีที่แล้ว ⁺¹
Image and text processing are alike in the sense that you ought to deduct context BUT a text is already a very densely packed semantic information medium, whilst an image isn't. Just compare the entire work of Shakespeare in text format vs TIFF format. The former can be stored on less than a megabyte whilst the latter takes up more that a hundred gigabytes.
Therefore the first step is to extract meaningful "features", that's why CNN are so powerful. Though they're not designed to do the job of transformers and attention, they should be traded for transformers in the deepest (topmost) layers IMO.
@World_Theory 5 ปีที่แล้ว
Samuel Prevost,
I now wonder how someone or some thing, decides what is meaningful. Philosophy…
@frankie59er 3 ปีที่แล้ว
These videos are great, please keep making them
@euclideanplane 5 ปีที่แล้ว ⁺¹
2:52 ok but, on top of this, we have our own concepts of what people might say next, in our own minds, based on what the conversation is about, based on what the person typically likes to talk about, based on where you are or what you've been doing that day, or even things you've done with that person in the past like some epic fortnite game you played a week ago he still likes to bring up when you talk about getting a new computer (maybe he thinks he will be even better than he was during that epic game you two played but once he has 244hz monitor and new pc things, haha)
@euclideanplane 5 ปีที่แล้ว ⁺¹
3:44 breh, what's the most common word to follow after a combination of TWO specific words? Or Three of Four of Five or Six.
Does the complexity increase a requirement for more computing power for this to even be useful? I imagine it does, but i'm not sure how many common word stitching we could find, maybe we run it once and we just jot down all the really common ones. Or run it constantly... like our brains do... memory... and constantly remember or generate your own, unique way of stitching words together, one that seems smarter or more detail dense or ... k thinking mode off, you ever just sit and wonder how our mind programs work...
@euclideanplane 5 ปีที่แล้ว
"Doesn't look back"
Easy way to put it.
@euclideanplane 5 ปีที่แล้ว
9:30 well, seems I understood / understand these things, I'm happy about that
@CSniper 11 หลายเดือนก่อน
Would be nice with a deepdive on transformers with Miles!
@letMeSayThatInIrish 5 ปีที่แล้ว ⁺¹
Comment from my phone: "The first thing you can say about the app store in a new car in a new location in a car that has a lot to be is the car and a new vehicle that you have a lot to get in a new vehicle to be sure that you can use it to get a better car than a new vehicle and if it doesn't have the right amount you will have a good chance to be a new vehicle".
@Desmaad 5 ปีที่แล้ว ⁺¹
3:00 I think those text prediction systems are based on probability-based n-ary trees.
@suleimanmustafa1473 5 ปีที่แล้ว ⁺⁶
I usually pronounce WALK as WORK and THREE as TREE
@EdeYOlorDSZs 2 ปีที่แล้ว
Great explination, I'm defo gonna read that paper
@4.0.4 5 ปีที่แล้ว ⁺³
I was curious about how you train these. Could be fun doing it ourselves even if the results are crappy/undertrained.
@pgoeds7420 5 ปีที่แล้ว ⁺¹
Today I am going to tell you to respect the Tech Lead. I am the Tech Lead and I am drinking coffee. This is quite nice coffee from the place in the comments but they are not going to give you the discount I get because I am the Tech Lead.
@RichoRosai 2 ปีที่แล้ว ⁺⁴
As a translator recently fallen on hard times, I suddenly feel the need to send a robot back in the past to disrupt progress in this field.
@antimatter2417 2 ปีที่แล้ว ⁺²
:(
@Arnaz87 5 ปีที่แล้ว ⁺¹
Are you teasing us with the data processing video Sean? This is the second video in which it appears at the end but it doesn't exist.
@_HarshVerma 3 ปีที่แล้ว ⁺¹
i think it ' s interesting . i ' m not sure i ' ve ever heard of an ai language model - GPT J 6B which i am using currently using
@mikeking7957 ปีที่แล้ว
thanks for the video Rob and Computerphile! What is the vector (around 9 minutes) ? The bunch of number represent the memory but what is it?
@MrMelonMonkey ปีที่แล้ว
it is very interesting how the automatic subtitles which i assume are using some kind of language model never gets it when youre saying shawn. it's always "short" or "sure" or something
@MaxDiscere 5 ปีที่แล้ว ⁺⁴
So and how do I implement it? Some examples would've been great on how to use these models
@Guztav1337 5 ปีที่แล้ว
I would look up the paper. This video has about the same information as the companies would give to the journalists, ie. it is not technical just overview.
So either search up the model or read the paper.
@abcdxx1059 5 ปีที่แล้ว
lol they are really hard
@abcdxx1059 5 ปีที่แล้ว
@@Guztav1337 there are a lot of technical videos on it
@Guztav1337 5 ปีที่แล้ว
Yes, use youtube or other media to find what you are looking for.
@lakloplak 5 ปีที่แล้ว ⁺²
Im watching now at the 9 min mark and just noticing why I forget stuff and only remember the important bits. If the AI knows what words are important it "knows" what words don;t matter. In a sense: Making AI forget makes it smarter. Hence, I forget a lot so I am smart.
@anandmoon5701 5 ปีที่แล้ว ⁺⁴
What if we start writing a Novel or Poem or Lyrics dose this transformers influence the way we think.
@STEMqueer 5 ปีที่แล้ว ⁺²
You where right on the initialization with zeros (checked my code). Please read some of the text examples on gpt-2 paper, the unicorns ones it's so credible.
Good video, congrats!
@sehbanomer8151 4 ปีที่แล้ว
or a trainable vector
@nickbeaumont2601 2 ปีที่แล้ว
lol at all the festival wristbands he's keeping
@oliverdowning1543 4 ปีที่แล้ว
So does attention remember where to look at rather than what you’re looking at so you just have to remember the words’ “addresses” in the text? Is that right?
@sevdev9844 4 ปีที่แล้ว
If I want to have a system on my own computer, not a lot of instances on a server, would the attention based model still be better than a transformer? Difficulty/Ressources/other
@taylorchu 5 ปีที่แล้ว ⁺⁷
I watched this video on my phone on my phone on my phone on my phone on my phone
@nestycb6702 4 ปีที่แล้ว
I think a great problem about these models is that they rely on morphology. After "say", the two main possible words that may appear are only words with gramatical meaning like "to, with..." or a personal name. This predicts how small groups of words may work together, but is clueless about the overall structure of a sentence. Generative syntax is making a lot of progress in this issue, thus making it possible to program how the structure of language itself works in each language. Although, let's not forget, even past said problem, that would be not enough as it would bring out the importance of function in language.
@starlight_garden ปีที่แล้ว
0:50 error in subtitle.
@thelostlinuxlawngnome5437 ปีที่แล้ว
Great video
@ultraaquamarine ปีที่แล้ว ⁺³
👀 2:45 👀
@rikuown 5 ปีที่แล้ว ⁺²⁸
I know transformers, I've seen all the movies.
@mj6463 4 ปีที่แล้ว
Imagine if all the transformers’ speech was written by the unicorn ai, I’d watch that in a heart beat
@yassinebarhoumi6973 4 ปีที่แล้ว ⁺¹
19:56 now you know with GPT-3!
@zekeluther 4 ปีที่แล้ว
Would like to hear about Performer..
@ulissemini5492 4 ปีที่แล้ว ⁺⁷
whos here after GPT-3 becomes self aware?
@adrianalexandrov7730 2 ปีที่แล้ว ⁺¹
No one, as it hasn't. Still
@tristanmisja 2 ปีที่แล้ว ⁺¹
It won't.
@lukeusername 5 ปีที่แล้ว ⁺¹³
I am in the morning but I can't find it on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen on the phone screen...
@tamasdemjen4242 5 ปีที่แล้ว ⁺³
I think it might be an advantage to think in terms of phrases, instead of just putting words together. If you've learned a foreign language before, you know that the key to fluency is using native speakers' sentences as templates. I would first choose the subject, verb, object triplet, then try to build the rest of the sentence around those using common idioms. However, it becomes significantly more complicated to author an entire article in a coherent way. Also consider what gives speech that unmistakable human quality, such as the use of filler phrases, like "as a matter of fact", "to be honest", "to put it simply", and so on. Then instead of just asking where the train station is, we would say "I was wondering if you could tell me where the train station is." In fact, this is how I learned English, first studying the sentences of native speakers, then varying some of the words a bit, until I became confident enough to say just about anything I wanted. This may be a bit more difficult in Hungarian, Finnish or Estonian, where the word order isn't fixed, and the words are strongly agglutinative. You miss a single suffix, and you miss the whole meaning. But I'm sure they all boil down to common templates and patterns as well.
@tamasdemjen4242 5 ปีที่แล้ว
As others have pointed it out, ideally the neural network should discover these complex rules automatically, without engineers having to manually input them.
@sdmarlow3926 5 ปีที่แล้ว ⁺⁹
Transformers: AI in Disguise.
@acwaller1000 5 ปีที่แล้ว ⁺⁷
AI play chess
------>
Deep Blue
AI play Go
-------->
Alpha Go
AI write song lyrics
--------->
Darude Sandstorm
@videooblivion 9 หลายเดือนก่อน
13:05 relevance realization, cc: John Vervaeke
@RaphaelAnthony ปีที่แล้ว
Hi, 2023 here, it’s working. Taking over all industries. Now it’s learning how to give the program specific and detailed instructions for best results.
@rogeriogouvea7278 ปีที่แล้ว ⁺⁴
They were praising GPT-2 that looks like garbage now with GPT-3, things are really moving fast with AI.
@Mageblood ปีที่แล้ว ⁺³
GPT-4 today
@mosef312 4 ปีที่แล้ว
I am a little thing called love to see if you have any questions or concerns please visit the plug-in settings to determine how attachments are handled the situation I'm not sure if you have any questions or concerns please visit the plug-in settings to determine how attachments are handled the situation I'm not sure if you have any questions or concerns please visit the plug-in settings to determine how attachments are handled the...
@autohmae 2 ปีที่แล้ว
0:57 that aged like fine whine. DeepMind Gato uses transformers.
@Micetticat 5 ปีที่แล้ว ⁺³
Are the English subtitles of this video autogenerated? In that case we have an example of a very good language model in action!
@adiero 2 ปีที่แล้ว
as clear as mud is clear as mud is clear as mud is clear as mud.
@testeeduandsx5957 5 ปีที่แล้ว ⁺¹
chomskys tree would tell which parts to skip and make the beguining of the phrase affect the end.
@SuperOnlyP 4 ปีที่แล้ว
Attention is all you need 14:28
@jonsnow9246 3 ปีที่แล้ว ⁺¹
He looks like Dwight's brother from "The Office"
@Great.Milenko ปีที่แล้ว
I've been using novelai over the last few days, it's come a long way since this video, it can write incredibly natural text, great for DND worldbuilding.
@dustinking2965 5 ปีที่แล้ว ⁺¹
I'm curious how attention works.
@MegaBlizzardman 5 ปีที่แล้ว
GPT-2 repeats it self all the time. I don't think it's necessarily a markov chain here.
@HypnosisLessons 5 ปีที่แล้ว
it doesn't if you decrease the temperature and increase the k_words, also you can remove the duplication and then re-prompt.
@vertxxyz 5 ปีที่แล้ว ⁺³
The editing around the introduction of the attention could have done with some work. It got quite repetitive, and it was worsened for me because I heard it as "a tension" so it was babble
@WhatABanana ปีที่แล้ว ⁺¹
On the phone screen on the phone screen on the phone screen on the phone screen

ต่อไป

เล่นอัตโนมัติ

GPT3: An Even Bigger Language Model - Computerphile