Project Astra: Our vision for the future of AI assistants
ฝัง
- เผยแพร่เมื่อ 13 พ.ค. 2024
- Introducing Project Astra. We created a demo in which a tester interacts with a prototype of AI agents supported by our multimodal foundation model, Gemini.
There are two continuous takes: one with the prototype running on a Google Pixel phone and another on a prototype glasses device.
The agent takes in a constant stream of audio and video input. It can reason about its environment in real time and interact with the tester in a conversation about what it is seeing.
Learn more about Project Astra: goo.gle/3wAUwFh
#GoogleIO2024
Watch the full Google I/O 2024 keynote: th-cam.com/users/liveXEzRZ35u...
To watch this keynote with American Sign Language (ASL) interpretation, please click here: th-cam.com/users/live6rP2rEWs...
#GoogleIO
Subscribe to our Channel: / google
Find us on X: / google
Watch us on TikTok: / google
Follow us on Instagram: / google
Join us on Facebook: / google - วิทยาศาสตร์และเทคโนโลยี
Thrilled to be able to share with everyone how exciting Project Astra really is. Drop a ✋if you're ready to give it a go, and learn more about it here: goo.gle/3wAUwFh
me too
is this an actual demonstration, or is this to show what ur working towards in the future?
What will happen if we give this to very young children? What's your position on young children being educated by Google AI instead of their parents?
✋
This would be a Godsend for Blind and Low Vision users!
Yea
You're right.
Yea, but it takes Advanced subscription.
Yeah, at the end of the day it seems to me that the most sensible application for all this AI stuff is 1. portable secretary, and 2. medical device for the perceptively impaired. Although for 2. it would be pretty cool to eventually just develop bionic eyes and finish the job on hearing aids.
Not that you are necessarily wrong, but at this point that sentence has become an I meme lol.
google glasses are back babyyy 😎
I also thought about that lol.
Google glass you mean
@@syrus3k it's the same thing lol :)
Don't read my name!
It should have never left
Google glasses : as you can see, I’m not dead
Yibambe!
@@BarelyAverageDude beha
They resurrected these in 30 seconds as far as I'm concerned 😂 take my money? (maybe)
My death was... greatly exaggerated.
Google glasses were born 10 years before the Machine Learning and speech synthesis developments catch the today's capacity.
The good part of it is that they have the MKT hardware ready to refrit and sell it. They just need to give them a little twist to nuts and bolts!
Open AI's GPT 4o and now this. I'm loving this AI competition
Are you?
@@Nicdehouwer I'm loving it too! AI is awesome!
This isnt finished this is a "vision for the future" aka what they want to be able to do not what they are able to do
@@iCarIy i mean they did say they did it in real time so i think it is a real product. they def picked and chose what to show for the demo but still.
@@ZaynomR where did they say that?
What a time to be alive
We don't know the future of AI.
Skynet is already here.
two minute papers reference?
The reference is crazy lol
I don't know. There are better times. MUCH better
The ability to read and understand schematics is a gamechanger
Also being able to understand abstract toys' breed and drawings sounds so cool
I’m skeptical it did more than read the words Server and DB and ignored the rest, as if those were the only two words on the board it would surely have the same answer, so this particular video isn’t really testing its ability to understand schematics.
The ability for a dog and a soft toy to form a band is a game changer. AI can do anything these days.
So glad you think so!
@@corylong5808 I mean adding a cache is the most easy answer ever
@@Google creep
This is a big leap forward. Congratulations!
You sure about that? Google are notorious for faking videos. Look at last time they showed us a video of 'real-time' things. It was all fake. For a company that have no 'motive' they sure do seem to be trying their hardest to try to impress people and sway over OpenAI's fans.
We're always learning and improving - thanks for being on the journey.
@@Google :D
@@Google Even Google is going for parasocial relationships!? /j
@@Google What a way to tell everyone I'm right and you faked the whole thing.
Google Glass has RE-entered the chat😎
Using those glasses on while I'm reading a technical math or programming book or online and asking questions about what we are seeing would be a game changer. Imagine trying to learn a new math equation or algorithm and asking the glasses to explain it while looking at it. Learning would be dramatically improved.
This would really be amazing to have!
being able to unpack parts of a formula/symbol in real time would be something
Learning would be dramatically impaired. A person learns when it hurts, when it's hard to get it.
@@dwat3r a person learns what benefit him/her most.
@@dwat3ridiot
hope that atleast this thing will not fool people as your previous AI documentation regarding video input
yes google has ruined their reputation on ai presentation
hi @google can you remove these comments please they are ruining my chat experience
@@fergalhennessy775 snitch.
@@fergalhennessy775 You mean they contain facts
@@fergalhennessy775 "Hey google, I don't like to be reminded that you lied multiple times from GNoME to Gemini, can you ban those comments? I don't have a reason but why not though? After all, I prefer being a sheep instead of thinking about the ads I watch. It's much simpler to take everything for granted than question them."
Google Glasses making a strong comeback indeed
Still waiting for the google glasses, thanks.
from 2012? lol
They are the rayban meta glasses
That was an incredible demo! The glasses especially blew me away.
Wow, it's amazing how quickly the AI has progressed! I don't think anyone anticipated it becoming this good so soon
Your enthusiasm means a lot - thanks for being on the journey with us.
Why do I get the feeling that it's Gemini responding to comments? 😂
I do think the google glass was ahead of its time and now is the ideal time to rerelease it especially after Apple and Facebook sudden interest in this market. Greetings from 🇭🇹
We hope that these features will be available in Egypt. We will always be the last people to receive these amazing features. I am a big fan of the basic Gemini system. Thank you. ❤
Google assistant + Google lens + Gemini + Glasses = Perfect combination
What excited you the most?
Gotta go with glasses
@@Google being able to automate tasks on my phone with my voice.
Remember, when something is free, you're the product. You will hand over your data (video recording, like the glasses in this clip) willingly.
I rather wait for open source local models to become this advanced.
The problem is you need chips the size of a small pizza that dump 25KW of heat to train these things. Do you have one of those?
@@LarsLarsen77 No, you don't need one of those in the future. Harvard Mark II mass was 23000kg and that thing did barely any calculations by today's standard.
AI models will become much tinier when we learn to innovate (removing deadweight parameters as example), tech always aims for efficiency. I can already today run a small llama3 model locally which is not that far off from GPT4.
I think it's ok if we are the product. Atleast we have freedom to use it or not. Like most social media in internet. We control the tech or the tech control us, it's our choice.
youtube is free dude, so is google search, so is Gmail, so is the internet.
4o seems to be faster with responses and the voice seems to be more human like
but they don't have glasses YET
yea 4o seems to have a much better understanding of tone and inflection which makes it really powerful when it comes to speech. This just feels like a simple TTS of a chatbot
The fact that it remembered the glasses was pretty impressive though. Not sure if 4o could do that
@@madhavraghuI just think they turned up the dail on informal word choice and happy intention. It's almost to optimistic... This just sounds practical, to the point. Just another choice of how to dail in your AI personality. I bet Google could also make it react as your best buddy.
@@Zaphod_well 4o remembered that someone walked behind the Host. Made funny gestures and left. I think it can remember the glasses
Damn this would be so useful for people that keep losing their stuff
Good catch, we're super excited about that!
This has the potential to help so many people and could save lives. So many possibilities, can't wait to see what the world will look like in even a few years!
We're always learning and improving - thanks for being on the journey!
Wow, that truly is amazing! Well done Google. The way it remembered where your glasses were while you were just scanning over the room is insanely impressive!!
We're excited for you to try it!
@@Google Can't wait!!
The real world application of the Needle-in-a-Haystack memory test
Okay, you got me Google. I'm genuinely impressed. This would be revolutionary for people with visual impairments.
Mind blowing. 🤯 I want these glasses. 😊
Me too
To do what exactly?
@@bombombalu Can experience iron man's live.
@@bombombalu Anything you want.
@@bombombalu Also, you can use this in exams as regular glasses. 😆
THIS IS EXACTLY HOW I IMAGINED A PERFECT AI ASSISTANT!!! This does precisely what I would want it to do.
Glad to hear you're excited!
@@GoogleI’m glad you’re answering comments here, huge respect to the community managers and the one who made this decision!
@@mark_shagal you know its a AI, right?
@@DreamingConcepts you're sure it's not a community manager?
@@mark_shagal yes, unless there's hundreds of community managers to answer all comments on all videos 24/7
This is amazing! love to see how far things have come from glass
Guys!! Please make Gemini better at handling mundane tasks like setting reminders and alarms etc. That's the only reason I'm waiting on adopting Gemini on my pixel
The link on the description doesn't work.
Type the URL into google or google chrome
This explains why OpenIA made their realease
Oh, finally a multimodal on Gemini for real I have ever seen on this video! 😢
Hats off to the teams! Exciting times, indeed.
Thanks for being a part of this with us!
“our vision for the future” means they don’t have anything that does that. We had that vision in sci-fi decades ago. Put it in our hands and I’ll get excited about it.
Well played here! Excited to see more of this.
Ready to make magic happen together! ✨
Very cool and very unnerving.
I can't wait for the ability to give an AI a stranger's photos and ask the AI to pinpoint where the photo was taken.
Or to casually walk passed someone at their computer and ask the AI to memorize all the keystrokes they press when they're logging into their bank account.
But the limerick was cool :)
Very cool technology
Google is on fire!!! Enjoying my Pixel Phone and Waiting for those glasses now!!!!
Available to you this decade !
Gemini Vs Gpt 4.o.... You both are fabulous ❤❤❤
Love and great respect to all Great minds working behind this magic ✨✨
I really feel happy to see that everyday when you wake up you have something to look up to...and it is so fascinating that it makes me feel worth living for.
Your vision is already realized by yesterday's OAI announcement
Gpt4o is so much better but the presentation from Google was next lever boring!
Basically both were working on same things.
this looks a bit better than GPT4o but it is probably better for some things and worse for others.
It will become better at a faster rate though with the huge android user base
It's good that they're competing. Big tech is already monopolistic enough as it is.
So much potential! 👏
We can't wait for you to try it for yourself!
Woah, that last bit with Schrodinger's cat and Golden Stripes was actually quite impressive!
This is gold!! We want a golden stripes version of Assistant!
This looks like ancient technology bcuz of open ai's gpt 4o
noob
Looks dope!
Glad to hear you're excited about that one!
This is awesome if it works as presented, I'd love to try it myself and find out if it can actually piece together information from contextual clues and infer information from evidence that isn't conclusive. This is TRULY the future, holy schnitzels. Please be real!! And thank you if it is :) I'm ready to move on!!!
Just yes…. Hope and excitement is back in tech! I’m rooting for all the companies, present and future. Amazing work all!11
Couldn't agree more 🙌
This is so futuristic
Thrilled you're as excited as we are!
You have to release this Text To Speech voice now! We are tired of only one option since it was launched as bard.
It took the AI Revolution to revive the Google Glasses! I swear, as impressive as this demo was, it was overshadowed by the sheer joy of seeing the Glasses pull an undertaker!
this will demolish every smartphone on the market, if they include this on the next Pixel phone, hands down.
They will still need the phone to operate.
I never use assistant
Amazing 😍
We're just as excited as you are ✨
The race is on!!!! Gpt 4 o or Astra?
GPT-4o it sounded more natural
gpt4o clear winner. Just see the 'be my eyes' video from OpenAI, and what user experience they offered.
Stop thinking like that. Water or air? Do you listen to one band, like one TV show, use one app?
@@dougchampion8084 agreed
@@dougchampion8084just think of it as a capitalistic rivalry.
This is beyond imagination. Goodluck On your journey Google!
Innovation is a team effort! Thank you for being part of it.
What glasses are she wearing? I want a pair!
This will now finally complete all those DIY Iron Man helmets!
This is MINDBOGGLING! What a time to be alive! Quick question: I noticed that the mic was on mute when the presenter asked "what is that part of the speaker called" at 0:22. How did the AI pick up that command? Also, more demos of system design use cases please!!
When will we get access to this?!?! I wpuld love to try it!
Without a doubt one of the best commercials of the year.
We're glad you're loving it!
Google trying to catch up to the GPT-4o hype train
Waiting for Apple response 👀
@@djbombba Meet Apple Car 🤡
lol Astra product and its announcement was not made in a single day
what a noob comment
@@ondrazposukie but a quick response from google because of the Livestream yesterday is possible
So you guys have been putting that Focals by North acquisition to good use😍
Hey Google, I have always been a great admirer of your technology since my childhood. Now I am a graduate student in data science. Please could you provide me with free access to your Gemini advanced products ?
@CJScopeBetter
Having a slight Google Glass "I'm Alive" moment here, this is exactly how I imagined they could be when I tried them out 10 years back, congratulations Google team - cannot even imagine the work involved to get here.
Are are those Google glasses?
Really excited about this! The Great Sage in Tensura could come to reality for sure!
Yeah. Google glasses. I hope, it will has not any big camera again. But it's very very super with Gemini.
Google saw OpenAI drop GPT4o, and were like "..Oh. Well we better get moving" lmao. You love to see competition fostering innovation!
they had the keynote already booked and open ai dropped the day before, they were nervous tho😂
This is it. The stupidest comment on this video.
oh look, the rabbit r1 killer's already here
The r1 is pointless, they spend their money on the marketing
Scary but exciting at the same time.
The world will be completely different in 10 years.
12 years have passed since the first demonstration of Google Glass at Google I/O 2012. Augmented reality glasses are fast approaching. It's time to revolutionize the market and make smartphones a thing of the past like CDs or MP4 players.
Personalized matters🤔🤫😘
OMG HI GOOGLE❤
Impressive. I hope the longer you wear it, it can learn about your life and help you make decisions to optimise your life. Or it can motivate you and guide you through your day. You can talk to it how you’re feeling, it can analyse your sleep and workout schedule and tell you how much cardio/weights you should do to improve and will guide you through your workouts. Like a productivity lifecoach that is more proactive than a silly question-and answer AI. Want to save for a house in a certain area? Would be good if this thing guides you through it, live and has access to your bank account. Lol.
This will be just incredible for a visually impaired person i can't imagine the good this will bring and the glasses are back!!
HUGE! if true
False
If it's not fake and staged then a great product.
Wow... This is so cool. Im hoping AI will lead us to find more internet security and solutions against cyber threats
Very impressive. This is a *much* more meaningful demo (vs the ChatGPT-4o fluff roadshow)
Did Google just pull a Kendrick Lamar on OpenAI? Dope!
openai probably had insiders and decided to announce gpt4o before google io
its not even out and already behind the curve
really bad considering the resources google has
not even a live demo... tells you all you need to know
search also still sucks
google is a failing business dependent on advertising revenue
Google offered me a job once, turning them down was the best decision I've ever made. I'd rather chew glass.
You can obviously tell this is not a real demo, but it's an amazing concept. Can't wait!
It still a concept for Google ?and there openai has already created the Jarvis
It’s like Google forgot why Google Glass failed dismally
ai and better internet is a big change
These aren’t even the same product lol, the glasses just have a camera.
@@yzz9833 lmao, they failed because they had a camera and icked everyone out. Adding AI doesn’t change that
Amazing like only Google could
openai did this yesterday what are you talking about
Des lunettes qui mixte l'IA et la AR 🤩
Je sens que les 10 prochaines années vont être folle ^^
Goldn'Stripes was pretty clever. The person who thought to ask it about finding one's glasses needs to be memorialized.
this is pre generated btw, admission by google 'some content has been pre generated'
I saw a white man coding and even a white dog! Is this allowed in Google?
Don't worry, Google is an equal opportunity employer. They'll accept any dog, regardless of its fur color
@@DeepThinkingGPU Except white, that's racist.
figuring out your location just by looking out the window is insane
I always liked the idea of those google glasses and now they are back!
chatgpt 4o is better
GPT-4o 🤨
Google Glasses is the star of the show! They are getting rebuilt and they are coming back!
With this new technology, life will become a walk simulator. I would like to see some Standley Parable scenario when Gemini complains when it recommends you something and you do the opposite xD.
The last Gemini demo was faked so there's no reason to believe this wasn't also faked lol
My thoughts exactly 😂
Chat gpt 4.o is far far better
Thank you Mr troll 😄
🗿
Why, don't people see what game openai is playing they just lunch oneday before it and as it is first you are amazed and now Google one seemed just fast and open ai one game changing they know this strategy from start
regarding this feature; in what sense?
@@sandenium 😅
Gpt-4o vs Astra. Can't wait to try them! 😃
I am looking forward to using these glasses!
You can tell that they cooked up some random BS in just a day to show the investors that they got it too. OpenAI is dominating Google.
I really hope you make the voice customizable. The quality of the actual voice synthesis here is pretty lacking for someone that has used paid tts services. Hopefully that can be improved in the future but the functionality looks great. Looking forward to this when it becomes available.
Wow. This is such a scary and surreal time to be alive. The object identification feature will be super useful for me!