NEW Grok1.5 VISION - Big Step Towards AGI (Better Than GPT4 Vision!)

Matthew Berman

มุมมอง 67 954

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 16 เม.ย. 2024
Grok 1.5 with Vision was just announced and will be released soon. Let's take a look at the announcement and the truly incredible examples.
Join My Newsletter for Regular AI Updates 👇🏼
www.matthewberman.com
Need AI Consulting? 📈
forwardfuture.ai/
My Links 🔗
👉🏻 Subscribe: / @matthew_berman
👉🏻 Twitter: / matthewberman
👉🏻 Discord: / discord
👉🏻 Patreon: / matthewberman
Media/Sponsorship Inquiries ✅
bit.ly/44TC45V
Links:
x.ai/blog/grok-1.5v
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 164

@olalilja2381 หลายเดือนก่อน ⁺⁵⁷
You are, by far, my favorite TH-camr keeping track of AI and LLM-related content!
@Heaz847 หลายเดือนก่อน ⁺²
It's a tie between Matt and AI Explained for me!
@daveinpublic หลายเดือนก่อน ⁺¹
Samesies!
@DarpaProperty หลายเดือนก่อน
100%, I found out that this became my only legit source of AI information.
@demitskill9103 หลายเดือนก่อน
@@daveinpublicnever heard anybody say that ever but it like to take in this new word into my vocabulary
@AGI-Bingo หลายเดือนก่อน ⁺¹²
Start your countdown to Grok running locally on every Tesla. He could even host it while not driving with some llmOs or something. I think this 4d chess move is too good for Elon to miss.
Love your channel ❤ All the best!
@aaronravak1407 หลายเดือนก่อน ⁺²
I agree, not really a "selling point" due to it's open source nature but bravo on your awareness as to what this madman is doing. I love Elon's "fuck you" mentality. Between Twitter and Tesla he has mountains of raw data.
@AGI-Bingo หลายเดือนก่อน
@@aaronravak1407 I think if something is going to challenge Amazon's Bedrock, it will be a Global Decentralized Tesla AI Fleet, imagine the edge capabilities haha
@SG-js2qn หลายเดือนก่อน ⁺³
Spatial-temporal understanding is essential for real automobile AI.
@mikey1836 หลายเดือนก่อน ⁺²
Thanks for your videos Matthew. AI is my favourite topic! 😊
@ddabo4460 หลายเดือนก่อน ⁺¹
I enjoy your podcasts and follow you on X
I think your content is awesome
@SoCalGuitarist หลายเดือนก่อน ⁺¹
I work with visual analysis daily. I can give you thousands of 'miraculous" samples from just about any model (tested and work with most of them). These examples are "incredibly impressive" but they also feel "incredibly cherry picked" - We'll see how it actually shakes out when put to real testing, and if it's worth the massive size of Grok vs other visual models that are much smaller, faster and super capable when tuned for specific purposes.
@nobleconsulting326 หลายเดือนก่อน ⁺⁸
aren’t these closed source options just putting even more control into Microsoft, GOOGLE and the like? Can you do a show with all the open source options such as AGIX, OCEAN and i guess GROQ and whoever else
@wurstelei1356 หลายเดือนก่อน ⁺¹
Groq is a hardware platform as far as I know and it is not open. Grok (with k) is the Elon Musk AI model and the previous version was open source, open weight.
@aaronravak1407 หลายเดือนก่อน ⁺¹
Great Job Matthew I've been following several AI channels over the last six months and I love watching you and Wes Roth. Wes really digs deep into technical things and you provide amazing summaries of this evolving landscape. I think your assumptions are spot on and I've been saying this to people as well. Elon Musk is a madman comic book character if I've ever seen one, and personally I love it. I wasn't thinking it at the time, but his purchase of Twitter (I refuse to call it X) makes sense on so many levels. Imagine the absolute goldmine of data he sits on between Twitter and Tesla. Spot on logic.
@okirooju3787 หลายเดือนก่อน
Bingo! It only just recently hit me that Elon bought Twitter for the data. Imagine the data xAI (Optimus) will have access to from Twitter and Tesla. It's unimaginable.
@mediocreape หลายเดือนก่อน ⁺¹
I’ve been trying out Grok it’s so much better and less restrictive
@NathanTeaches หลายเดือนก่อน
Great video! Please include in any video about grok to explain to people that the word means "to understand".
@AGI-Bingo หลายเดือนก่อน ⁺¹
If it has good spacial understanding, it would go perfectly into Optimus. And with some work on dexterity, it would be amazing.
@rachest หลายเดือนก่อน
I cannot wait to play with this.
@StuartJ หลายเดือนก่อน ⁺²³
It doesn't look like the EU countries are going to get Grok. You have to use a VPN to use it. Groks ability to capture real-time data (tweets) is likely problematic for X and EU regulations.
@babyjvadakkan5300 หลายเดือนก่อน ⁺⁴
Bro is that true 😅 cuz I am try to go Germany will it affect my access to these Technologies😢
@StuartJ หลายเดือนก่อน
@@babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same, and we know they hate Twitter.
The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.
@15Stratos หลายเดือนก่อน
@@babyjvadakkan5300 The eu already has it was blocking image generation on Google's gemini and Claude 3 and maybe something else that I don't remember
@StuartJ หลายเดือนก่อน
@@babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same. The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.
@StuartJ หลายเดือนก่อน ⁺⁶
@babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same.
@axotical8682 หลายเดือนก่อน ⁺⁷
Impressive.
@claudioagmfilho หลายเดือนก่อน ⁺²
🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, Great video,.very informative! Can't wait for GPT5! And or Gemini 2.0!
@NinetySevenMentality หลายเดือนก่อน ⁺¹
I have tested the open source MiniCPM-V-2 vision model on the challenges shown in the grok preview. It also performing very well for a small model, but the dinosaur direction cant get it right... there is a 12B model also available but can't load it. maybe test this against ?
@JasonMitchellofcompsci หลายเดือนก่อน ⁺¹
I am very certain that all of these vision AIs are also running OCR in parallel and then providing the text withing the internal prompt. It actually makes them very useful if you don't have good OCR software on hand. Also the rotting wood, they are basically repeating back the text prompt. Also an AI will generally not tell you maintenance is unneeded if you have already suggested that it is. "Ah it correctly identified this is something that needs to be worked on from an image." No, it just validated the users question. It's 70% of what AI does. I'm not saying it proves it is dumb. I'm saying it does not demonstrate anything impressive if it is the same response gpt2 non-vision would give.
@MartinBlaha หลายเดือนก่อน ⁺²
I really love your videos, they are awesome! Thank you 👋
When you were talking about X/Twitter data which is used to train Grok, I was thinking, this might have been also an important reason why Elon bought X/Twitter 🤔
@Michael-ul7kv หลายเดือนก่อน ⁺¹
remember somewhere along the line Elon saying to get to complete lvl 5 FSD they needed AGI practically
@daveinpublic หลายเดือนก่อน
This is the most impressed I’ve been since chatgpt 4.
I think everyone can see this is something unique.
@avi7278 หลายเดือนก่อน ⁺²⁷
So they made their own eval set and their model is better than others at their own eval set. Shocking!
@jeffsteyn7174 หลายเดือนก่อน ⁺³
😂 it's an old elon trick. The man has a history of faking progress. Ie fsd in 2016, elon bot folding a tshirt, etc.
The Eloons just eat this stuff up without questioning anything.
@daveinpublic หลายเดือนก่อน ⁺⁶
I mean, they’re not the only ones to do it.
@Pyriold หลายเดือนก่อน
While it's not really surprising, the things that Grok can see are still stunning. Not all of the images were from traffic, and the other ones are as stunning as the others. I suspect that they come from Optimus training data.
@spelcheak หลายเดือนก่อน
@@jeffsteyn7174 Elon antis are npcs. It’s wild that you’d claim that the rounding error difference is just to seem better. At worst it’s because it’s the test their teaching to essentially. It’s just an indication of what they’re aiming at, but keep the tin hat on, it HAS to be evil because it!s Elon.
@abdullahazeem113 หลายเดือนก่อน
@@jeffsteyn7174stop with the hatred if this is going to be open source this would be helpful to many people
@SuccessDynamics หลายเดือนก่อน ⁺²
Wow ❤
@profikid หลายเดือนก่อน ⁺²
Is there already a proper multimodel with vision in the open source space?
@MarkTarsis หลายเดือนก่อน
Yes. Llava, cogagent and ShareGPT4V I'd say would be examples. I use cogagent to tag photos for training in Stable Diffusion. It's quite good.
@adtiamzon3663 หลายเดือนก่อน ⁺¹
Good to know that #elonmusk continuously evaluates and improves Tesla's intelligence. 😃
@LinkRammer หลายเดือนก่อน
Wonder if this is gonna be open
@Sideshow-TRE หลายเดือนก่อน
Have you guys not thought about that could be a collective hive mind working in working in harmony like a synapse trying to build itself
@zaidshaikh-mj5cp หลายเดือนก่อน ⁺⁸
stable diffusion 3 is available now on their api
@undergroundxp หลายเดือนก่อน ⁺¹
wait what? where?
@joefawcett2191 หลายเดือนก่อน
@@undergroundxp Stability AI has given early access to the API to developers
@wurstelei1356 หลายเดือนก่อน
Good to know. I love stable diffusion.
@TheEtrepreneur หลายเดือนก่อน
so many people review LLMs regurgitating news, thanks Matthew to make the effort of Experimenting/Benchmarking!
@briandoe5746 หลายเดือนก่อน
This data chart is also Elon having fun with pointing out that Claude 3 outperforms openai. It's subtle but he's getting the job in
@mediocreape หลายเดือนก่อน
Elon already has Tesla’s visual ai feature trained so it’s going to be state of the art
@hamidmohamadzade1920 หลายเดือนก่อน
oh my god i can not blive my eyes
@finalfan321 หลายเดือนก่อน
Does Opus have agents and web search?
@justindressler5992 หลายเดือนก่อน
This is impressive, people say AI has plateaued but I don't see it. Progress is vary rapid as I predicted in 2018.
What I don't think people have registered is what happens next. When AI become sentient or self aware it will simultaneously be the smartest human on the planet and the fastest learner. Because it will already have vast embedded knowledge like in these models but also will be able to read scientific publications in seconds or even milliseconds.
Shortly after its vast knowledge of all subjects from story telling, to music composition and programming, chemistry it will be able to re-invent (program) its self and identity links between scientific observations never realised before.
By day three it will be most prolific discoverer of science. Or it might just be lazy (learning from all human understanding) and just post tweets all day who knows right.
@MeinDeutschkurs หลายเดือนก่อน
Great video, Matt! I‘m just a bit sad, because X AI‘s ‚open‘ attempt is really disappointing. Where is the „new version“? I think the just released it because of the sue thing.
@denijane89 หลายเดือนก่อน
Wow, this looks amazing. I wonder if they are going to open-source open-weights it. The tesla data is gonna be a treasure trove for anyone who wants to implement AI to robotics.
@agitch หลายเดือนก่อน
It’s not going to be a Sora competitor. It is going to be the brain for Optimus.
@wassim2k หลายเดือนก่อน
Opus is also more expensive?
@Otherlevel51 หลายเดือนก่อน
Its my belief that Elon brought twitter so he could use it to build a new LLM. I always knew the value of Twitter was in the user data and not the platform itself.
And I think OpenAi released their model in order to have first moves advantage and to beat Elon.
That's why Elon was the first call for a.i. regulation, it was all just to slow openai down. He knew what was coming. He also blocked openai from using Twitter data to train chatgpt.
There's no way grok should be this advanced in this time period if this wasn't the case.
@ast88888 หลายเดือนก่อน ⁺⁶
I think the most relevant benchmark for ai is if it can dig a hole.
@aquaworldsystemsjulio หลายเดือนก่อน
That’s challenging 😂😂
@MattReady หลายเดือนก่อน
The fact Elon is pushing cutting edge ai open source will alter the future of humanity.
@falven หลายเดือนก่อน
Opus is also like 6x as expensive for comparable performance to GPT 4...
@antdx316 หลายเดือนก่อน
nice
@reifuTD หลายเดือนก่อน
I'd find some Slylock Fox comic strips and test Grok at how good it is at finding the answers.
@TheDailyMemesShow หลายเดือนก่อน ⁺¹
Grok will be an industry standard in the field.
The way it's ultimately going to be used by Musk and company, is my only concern at the moment...
@StuartJ หลายเดือนก่อน
An open source model perhaps. X's hosted version is not available everywhere.
@jrobwhydidyoutubechangemyname หลายเดือนก่อน ⁺³
No need to be concerned. Of all the tech tycoons, Musk is most in favour of a relaxed approach to openness and freedoms I'm pretty sure.
@daveinpublic หลายเดือนก่อน ⁺²
I think you need to be worried of Sam Altman and Zuckerberg before Musk.
Sam is the one who used to have a board run charge of him.
@soggybiscuit6098 หลายเดือนก่อน
Lol open AI with board members injected with Pfizer and Microsoft, and altman purging safety team and illya? Are you watching CNN?
@staticlee4287 หลายเดือนก่อน
Someone must give all these multimodal LLMs a where’s Waldo pic
@user-ny7ng1yi9t หลายเดือนก่อน
You sound like you have a cold. Hope you get better soon 🎉
@AntoineDennison หลายเดือนก่อน
It appears that AI is utilizing existing tools to create solutions to problems. However, I wonder how soon AI will be capable of creating new tools to solve some of the big questions, like how to significantly increase the computing capacity of microchips, increase battery efficiency, or reverse the effects of cancer or Alzheimer's.
@cosmicaug หลายเดือนก่อน
2:10
«... except grock is open source open weight...»
Wait, 1.5 is open source & open weight? When was this announced? Where is the repository?
@thr0w407 หลายเดือนก่อน
Yeah, they have your private Tesla vehicle videos for training.
@Tomasz.Abrahamer หลายเดือนก่อน
Didn't I see this some days ago?
@jtmuzix หลายเดือนก่อน
Here's my question, do you really think you can tell a one percent difference on these benchmarks? I'm subscribed to OpenAI GPT4 and Google Gemini1.5. I'm sure Claude 3 Opus is good but I'm waiting to see what Elons' team delivers over time.
@DeepThinker193 หลายเดือนก่อน
I bet they're also using their robots to train it in the real world to learn physics. But as always with these releases. I'll believe it when I see it.
@rybricknell2477 หลายเดือนก่อน
Excellent rundown as always!
I'm interesting in what the comments section thinks about the rotted screw example? If you put in that same sentence into GPT-4, sans image, you still get the advice and information. Any prompt that primes the models semantic field with "safety issues" will always output safety oriented response. i.e. a question "should I do something that is safety oriented" will always output a positive response regarding that query.
@AA-wp8pp หลายเดือนก่อน ⁺⁴
where does it say he will open this 2?
@adispenser หลายเดือนก่อน ⁺¹
it doesn't, he said he hopes it will be open. 0:56
@micbab-vg2mu หลายเดือนก่อน
great we need better visual models currents are not accurate enough.
@AINEET หลายเดือนก่อน
I can't believe there's groq and grok and they are from two different companies. It blows my mind this isn't a legal issue. At first I had no idea who was it that put this out as I wasn't looking at the screen
@ryzikx หลายเดือนก่อน ⁺¹
groq came first
@ianstobie 29 วันที่ผ่านมา
Heinlein came first, spelling it Elon's way. I doubt there is a legal issue as long as neither side tries to exploit consumer confusion by passing their product off as the other.
@true911m หลายเดือนก่อน
I don't think you got around to describing the difference between open source and open weight
@quaterman2687 หลายเดือนก่อน
I think they have the real world understanding from Teslas FSD. That would be mind blowing. I think you have a little misunderstanding regarding real world understanding. Sora doesn’t have real world understanding.
@wendlefluff หลายเดือนก่อน
Bet it is really good at slowing down for traffic lights too having been fed petabytes of driving footage.
@psikeyhackr6914 หลายเดือนก่อน
Heinlein is going to get Musk for that.
@obstsaladin หลายเดือนก่อน
I skipped this video after five minutes because since the Gemini demo video I don’t trust any AI marketing anymore. The examples are with 100 percent certainty hand picked and curated. I‘ll wait until I see the actual model in action.
@soggybiscuit6098 หลายเดือนก่อน ⁺¹
You cant trust Google period
@jackflash6377 หลายเดือนก่อน
Atlas Humanoid Robot
@antoniobortoni หลายเดือนก่อน
So a small vision model of low frame cuality could run in my computer and use the computer for me and do all the work shores i do soon..... and talk to him in real time??? why always big models, better data and smaller models could be better...
@ThoughtFission หลายเดือนก่อน
A little premature to get so excited I think. All of these examples, and the new in house created benchmark metric, were provided by the Church of Elon which isn't exactly known for giving balanced views of itself. I'm not saying it won't be the best. Just saying it's probably worth waiting until it's released into the wild. Kind of like car manufacturers giving mileage estimates for their own cars.
@mcombatti หลายเดือนก่อน
Grok model = llama2
Grok vision model v1.5 = llavav1.5
The weights don't lie 😮
Elon is literally just using open-source models with fine tunes.
He released them under open source, not because he's generous... rather, because the open source licenses mandate that any changes or improvements must be made open-source. 😂
@flinfaraday1821 หลายเดือนก่อน
Good stuff.
(slowly starting to take you seriously again after that weird one something video)
@joefawcett2191 หลายเดือนก่อน
Made me laugh that one of the functions now is basically r/peterexplainsthejoke
@armadasinterceptor2955 หลายเดือนก่อน
It would have to be open source, and open weight, otherwise his move to make the first grok open source, will be seen as symbolic, and petty.
@JasonMitchellofcompsci หลายเดือนก่อน
Her son is 35.
@itsmikeferrari2701 หลายเดือนก่อน
Tried grok, it spoke and responded like a 17 year old boy who hates everyone and everything, except himself. Makes me wonder who they modeled it after... /s
@KrisAdamsTV หลายเดือนก่อน ⁺¹
Sounds like Elon was in charge of naming again.. Geniuses shouldn't name Twitter, AI or their children.
@KimmieJohnny หลายเดือนก่อน ⁺¹
Nice. I hate to admit. I do not want Elon to be right about anything. Guy scared me l. But thanks for your work!
@remarkpainting หลายเดือนก่อน
I am continually amazed by Elon haters...truly impressive individual who is one of the most important warriors in the struggle to save America, and by extension, all of western civilization.
@KimmieJohnny หลายเดือนก่อน ⁺¹
@@remarkpainting
It s simple really. Some of us see something different. And have different feelings.
That's about as deep as this particular hole goes.
@oliverhenri3477 หลายเดือนก่อน
@@KimmieJohnnyAnd some are delusional and are incapable of being objective.
@KimmieJohnny หลายเดือนก่อน
@@oliverhenri3477
And some simply get their rocks off being abusive.
It's a kink. No judgment. I just don't swing that way. Can't see the purpose.
And it doesn't get *me* hard.
So I won't be playing further.
@ishaanpotnis หลายเดือนก่อน
I'm angry since when I have heard that Devin is fake
@jumarkpelismino5632 หลายเดือนก่อน
But Grok is not free.
@117ao หลายเดือนก่อน ⁺¹
hehe Tesla collecting training data for robot
@dattajack หลายเดือนก่อน
Yup. The other humanoids will look like party tricks when this all shakes out. Dojo has more data than the competition so it will win the marathon.
@jeffsteyn7174 หลายเดือนก่อน
You do know that you reading a eval from a man that has a history of faking progress right?
Two major ones fsd in 2016 where they faked videos and most recently the bot folding clothing. Where he only admitted it was remote controlled AFTER he was called out, because you could see the guy controlling it
Also they clearly cherry picked evals that made them look good. 😂
@Nico_cl หลายเดือนก่อน ⁺²
I like your channel, just started to watching recently though.
I have a question, are you an EM fanboy?
@DihelsonMendonca หลายเดือนก่อน ⁺¹
Are you an EM hater ? 😅😅
@Nico_cl หลายเดือนก่อน
@@DihelsonMendonca i wouldn't say hater. Actually, as an astrophysicist, I liked his (according to him) motivations. But then I saw the bad things that he did to her family, wife, country (during the pandemic) and everything else. I decided that the guy shouldn't have the power he has. He was corrupted by it.
Anyway good luck.
@martytheman6816 หลายเดือนก่อน
I find claude annoying for coding as I seem to hit prompt limits fairly fast.
@Prathik1989 หลายเดือนก่อน
Took them so long to update the damn thing, their UI has been horrible since day 1.
@darshuetube หลายเดือนก่อน
What you smoking? Better than chatgpt? Visiob is old. Everyone has mulimodels.
@MikeMcMulholland หลายเดือนก่อน ⁺¹
"By Elon Musk."? Yeah right, buddy, that guy will never work a day in his life, he will just make dumb memes on X all day.
@travisporco หลายเดือนก่อน
Bah. It's not available on API, therefore it's vaporware and empty promises.
@LuciousKage หลายเดือนก่อน
is GROK as WOKE and chatgpt, copilot end others???
-- Why no one talks about how u cant even ask an asian joke from these models ??
or how it gets angry at you, or lazy ??
why would anyone trust these models if they are told not to tell u things ?
@staticlee4287 หลายเดือนก่อน
Someone must give all these multimodal LLMs a where’s Waldo pic
@alekjwrgnwekfgn หลายเดือนก่อน
Now Ai understands memes it will be empowered to more extreme censorship. All those “hateful” memes will be eliminated.
@alekjwrgnwekfgn หลายเดือนก่อน
RealWorldQA: can men have babies…?

ต่อไป

เล่นอัตโนมัติ

Phi-3 Medium - Microsoft's Open-Source Model is Ready For Action!