Building with Gemini 2.0: Native image output

Google for Developers

มุมมอง 55 918

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 ธ.ค. 2024

ความคิดเห็น • 137

@bastoscc 2 วันที่ผ่านมา ⁺²⁴
This is the calmest advertisement for something so game changing I've ever seen
@PetrinaRobins 2 ชั่วโมงที่ผ่านมา
So far I've given it 4 requests and it has come back with "Sorry, I wasn't able to generate the images you requested". Nothing difficult - for example put a face on this banana.
@DominicI1 4 วันที่ผ่านมา ⁺¹⁷⁹
I'm so surprised this hasn't gotten a lot of attention yet... but i mean it has only been 20 minutes
@mikethespike056 4 วันที่ผ่านมา ⁺⁵
im already going insane
@austriasdaughterssons3617 4 วันที่ผ่านมา ⁺¹
Did it work for you?
@mikethespike056 4 วันที่ผ่านมา
@@austriasdaughterssons3617 it's not editing images for me, it fails and hallucinates
@Crux69 4 วันที่ผ่านมา ⁺¹
20 minutes these days is closer to 20 hours pre-AI time. perfectly understandable
@ksprdk 4 วันที่ผ่านมา
just tried, didn't work
@zyang056 4 วันที่ผ่านมา ⁺⁴⁹
The most expansive part of training CNN models is labeling. This is a game changer for generating ground truth data for robotics and self driving.
@dhruvbnaik 4 วันที่ผ่านมา ⁺⁶
How does this change labelling issues? Am I missing something?
@rotoblur 3 วันที่ผ่านมา ⁺²
@dhruvbnaik we need to hire many workers to create a labeled dataset, which is kinda expensive, however with gemini we can just create a generated dataset and do verify with humans or so
@zyang056 3 วันที่ผ่านมา ⁺²
Keep in mind you can prompt with more than just text. You can prompt a bounding box and ask to generate an image with cat inside the bounding box. Repeat a million times and now you have a million photos of cat and known bounding box.
@dhruvbnaik 2 วันที่ผ่านมา
@@zyang056 isn’t that just based off previous labelling?
@소금-v8z 3 วันที่ผ่านมา ⁺³³
So, I'm guessing this image generation feature isn't available to regular users at the moment? I tried using Gemini 2.0 Flash Experimental in AI Studio to generate some images, but it kept just describing the image instead of creating it.
@islambaraka6552 3 วันที่ผ่านมา ⁺⁵
Same here !!
@nathannitzel910 2 วันที่ผ่านมา ⁺³
Same, I haven't been able to find it in aistudios. I can't wait!
@VtuberSpace 2 วันที่ผ่านมา ⁺⁵
0:35 It seems that it is not open to the general public yet.
@IgnatiousReilly วันที่ผ่านมา ⁺³
A few hours after the video came out, it claimed to be creating images, but the imgur links it gave me were blank. I figured I'd give it a couple of days to iron out whatever issue that was, but now it says it doesn't do images at all. I'm disappointed. I was excited. It now says that it may have hallucinated an ability to generate images. We're living in interesting times for computers.
@jzajzz 5 ชั่วโมงที่ผ่านมา ⁺¹
@@VtuberSpace early testers within AI studio is what we thought it meant... I guess not.
@jmg9509 4 วันที่ผ่านมา ⁺¹¹
The combination/blending feature is incredible. I can imagine many use cases for creatives.
@Faune-addict 4 วันที่ผ่านมา ⁺²¹
Wow, omnigen finally has some competition !!!! So excited
@AgustinCaniglia1992 3 วันที่ผ่านมา ⁺²
omnigen is bad quality ime
@jackh_irl 4 วันที่ผ่านมา ⁺⁷
This is very impressive, especially considering the simplicity of the input prompts
@cholst1 3 วันที่ผ่านมา ⁺¹
If it worked it would be.
@v1nigra3 4 วันที่ผ่านมา ⁺¹⁸
Okay that’s completely next level
@jayanthAILab 3 วันที่ผ่านมา ⁺³
I see the incredible capability of the model!. The model is exactly following the instruction with very good accuracy, latency and quality of image. Hatts off to the research team. 👍👍
@RaulKong898 4 วันที่ผ่านมา ⁺²⁶
I tried to do pretty much the same thing with a picture from my laptop, and it doesn't generate anything, it gives me text, especially on Google AI Studio and the Gemini web app.
@dominik4496 4 วันที่ผ่านมา ⁺¹
Same for me (Germany). Only generates text
@simtangaranvijay273 4 วันที่ผ่านมา ⁺¹
@@dominik4496 yup same this it give me code not image output
@Plant_V_ZombieApocalypse.G0TY 4 วันที่ผ่านมา
@@simtangaranvijay273 it sometimes give you the image but the image is blocked by google content moderation : (
@TheRohit901 4 วันที่ผ่านมา ⁺¹⁴
because this feature is not available yet, and will be released next year. Currently only available to early testers.
@bucketofbarnacles 3 วันที่ผ่านมา
US and UK only for now, I believe.
@bibarud3018 4 วันที่ผ่านมา ⁺¹⁸
It is not currently working with the Gemini 2.0 Flash Exp model on AI Studio.
@KFCOSMOS-kq6cz 4 วันที่ผ่านมา ⁺⁶
ur right I tried it as well just returns a bunch of meta data instead of an image
@restedrelaxed 4 วันที่ผ่านมา ⁺⁴
same
@somthingz3928 4 วันที่ผ่านมา ⁺⁶
Read video description:
"These new output modalities are available to early testers, with wider rollout expected next year."
@KFCOSMOS-kq6cz 4 วันที่ผ่านมา
@@somthingz3928We got access to Gemini 2.0 as part of the plan but not these features
@austriasdaughterssons3617 4 วันที่ผ่านมา ⁺⁷
Ho do we become early testers?
@vladgheorghe4413 2 วันที่ผ่านมา ⁺¹
I found no way to do this in AI studio. Gemini says it cannot generate images. In the Gemini app it uses Imagen.
@tharunchowdary7238 4 วันที่ผ่านมา ⁺⁸
WOW! 4o was announced months ago with still no release, and Gemini just flashed through! 🎉
I’m blown away
@darcos-i6s 4 วันที่ผ่านมา ⁺²
what? 4o? maybe o1? and it is available now (not preview)
@klinstone 3 วันที่ผ่านมา
@@darcos-i6sThey introduced multimodal features (like single model image generation) back in May. But they still haven't released it.
Search for "hello gpt-4o"
@PodHighlights1 3 วันที่ผ่านมา
@@darcos-i6s 4o was announced months ago and we still dont have image or video capabilities
@maddog2622 3 วันที่ผ่านมา
Not really, this isn’t available till next year
@PodHighlights1 3 วันที่ผ่านมา
@@maddog2622 it is availabe on ai studio
@ankitnmnaik229 4 วันที่ผ่านมา ⁺¹⁷
We can't do it yet in ai studio??
@dominik4496 4 วันที่ผ่านมา ⁺⁴
Apparently not until next year
@hetthummar9582 3 วันที่ผ่านมา ⁺²
@@dominik4496 yeah.. not available
@cholst1 3 วันที่ผ่านมา
@@hetthummar9582 I have the model in my studio, it never succeeds in generating images for me tho, either confused thinking it doesnt have the capability - or tries and i get content warnings on the most innocent of requests (sunset over ocean, futuristic font, whatever).
@JoshBrownKramer 4 วันที่ผ่านมา ⁺⁴
When I ask it to do this, it says "sure!" and then hallucinates an imgur url . The model I'm using is "Gemini 2.0 Flash Experimental". I guess I am not one of the early testers who gets the new output modalities. Is there some way to see this from aistudio, or do we just have to try and see if it works?
@imdb6942 4 วันที่ผ่านมา ⁺¹
Same here. Most likely we're not the early testers they speak of.
@sherlockhighland8292 4 วันที่ผ่านมา ⁺⁵
That’s the real gpt4o right there
@mohamedkarim-p7j 42 นาทีที่ผ่านมา ⁺¹
Thank for sharing
@martinezjw1 2 วันที่ผ่านมา
So the general public can't use these features yet? The image editing looks absolutely amazing!
@Sharcos1498 4 วันที่ผ่านมา ⁺¹
And with this image generation has been perfected after 3.5 years. Now it is possible to create any image with character consistency. I wonder how long it would take to perfect video with sound to get movie creation
@TheLMMish 3 วันที่ผ่านมา
Amazing and frightening speed that we're already at the "are you tired of complex prompts?" phase already...
@AllConcord 3 วันที่ผ่านมา
pixel 9 users will get the build early?
@waelmashal7594 2 วันที่ผ่านมา
Do we have Api's for it ?
@VividhKothari-rd5ll 3 วันที่ผ่านมา ⁺¹
From Google's blog: "These new output modalities are available to early testers, with wider rollout expected next year."
So my question to Google is, why do they use phrases like "It can NOW natively generate images...blah blah."
@nikitasushko5955 4 วันที่ผ่านมา ⁺⁴
Yay, I now wanna try prompting it to draw the chain of thought instead of writing it.
@AIrtesan 4 วันที่ผ่านมา ⁺¹
I concur.
@fullcrum2089 4 วันที่ผ่านมา
i want try it draw the screenshot with the mouse click instead of say the positions.
@verteyetfel 3 วันที่ผ่านมา
is the image editing live yet? I cant fine this feature?
@akhilkushwaha4 3 วันที่ผ่านมา ⁺¹
i don't think so, I was also searching for same.
@TheAIPivot 4 วันที่ผ่านมา ⁺¹
This is literally blowing my mind.
@KFCOSMOS-kq6cz 4 วันที่ผ่านมา
Hey guys one of the early tester here! The model is fantastic to play with but Idk when I am generating an image it doesn't show it visually just gives me some bunch of code what should I do about that?
@hykris541 3 วันที่ผ่านมา ⁺³
then you're not a early tester lol
@pareak 3 วันที่ผ่านมา
How... how can this only have 30k views after a day?
@apatsa_basiteni 4 วันที่ผ่านมา ⁺⁵
That's some impressive stuff.
@Reactor10k 4 วันที่ผ่านมา ⁺⁴
I’d like to see it do that, with the car at 45 degrees.
@jmg9509 4 วันที่ผ่านมา ⁺¹
i'd love to see it switch the POV of the car to inside the driver's seat whilst flying through the clouds. The creativity you can have if this is possible is immeasurable.
@Ownedyou 3 วันที่ผ่านมา
That... Is actually really impressive. Can you do that through API?
@agnosticatheist4093 4 วันที่ผ่านมา
Not working as of now, it stuck on infinite buffering
@HemantGiri 4 วันที่ผ่านมา
i am trying i am not getting whats wrong
@eladwarshawsky7587 4 วันที่ผ่านมา
This is definitely rf inversion in practice
@quixel-1676 3 วันที่ผ่านมา
finally this might be the one which i've wanted for a long time.
@corsy2930 2 วันที่ผ่านมา
Google IS coming back ! 👍
@sintetico82 3 วันที่ผ่านมา
I try it and it doesn't work. The answer is an HTML code...Did someone try it?
@SagePerkins-e5f 2 วันที่ผ่านมา
Same here.
@Rxdlad 4 วันที่ผ่านมา ⁺¹
This is really imagining!
@richbynoon 3 วันที่ผ่านมา ⁺²
this is gonna hurt photoshop a bit
@cdyax 3 วันที่ผ่านมา
This still cannot be done in Google AI Studio, let them put things that can be done now.
@llmtime2178 4 วันที่ผ่านมา ⁺²
THEY'RE NOT RELEASING THIS UNTIL NEXT YEAR WTFF??
@abdoashraf4655 4 วันที่ผ่านมา ⁺²
Calm down! It's just two weeks for hitting the new year!
@fernandodiaz8231 3 วันที่ผ่านมา
Quisiera que amplíes con información sobre el uso de la API de Gemini 2.0
@Luisa-pf9ss 4 วันที่ผ่านมา ⁺²
i’m star struck!
@HemantGiri 4 วันที่ผ่านมา
no this does not work i wnated this feature so badly i am keep on trying its not working
@alwayson999 2 วันที่ผ่านมา ⁺¹
I'm scared for the future of our national security. This is the voice of our nation's men now. They are not strong enough to defend our country. What would you need to do to an adult man to make him sound like this? Not good.
@vercai7256 6 ชั่วโมงที่ผ่านมา
Fake, the image editor doesnt work. It allucinates uploading the resulting image to imguir and shows code. No the edited image
@mil546 4 วันที่ผ่านมา ⁺¹
Thank you.
@kelvinmunyimbili6078 4 วันที่ผ่านมา ⁺⁷
More good demonstrations Google will never ship.
@TheOrionMusicNetwork 4 วันที่ผ่านมา ⁺¹
I would normally agree, but they just shipped their realtime voice API, and it's pretty impressive. I have a little bit more confidence now that what they are saying they can do here is actually possible and will be made available, but let's see
@lightdarkness818 2 วันที่ผ่านมา
Im in google a.i studio and using gemini 2.0,
how can i access the voice api?
thanks in advance
@SreehariVariar 2 วันที่ผ่านมา
That wasn't what I was expecting when you said cat on the skate board
@dillanio9191 4 วันที่ผ่านมา
I tried this literally with the same exact car and prompt and it couldn't do.
The multimodal live AI thing is working crazy good though. Google is cooking
@CODE7X 4 วันที่ผ่านมา
I dont think it will still work on humans in the image:/ this is kinda not that good because of it , sincealot of images have humans in it even if not intentionally
@SoloJetMan 3 วันที่ผ่านมา
whoa ... is this sorcery?
@sulaimanadewale320 4 วันที่ผ่านมา
This is next level!
@Selene-xf9yi 3 วันที่ผ่านมา
Very exciting
@ambessashield9360 2 วันที่ผ่านมา
This is almost scary. How will humanity handle this when the lines between reality and fiction is invisible?
@JC-gj7zj 8 ชั่วโมงที่ผ่านมา ⁺¹
meet people in real life and get off the internet
@entrepreneerit4490 3 วันที่ผ่านมา
It doesn’t do anything with images and says it can’t manipulate images. It’s amazing to me how Gemini literally never works
@MindUnveiled-c1z 4 วันที่ผ่านมา
better than gemni 1.5
@user-uk9er5vw4c 3 วันที่ผ่านมา
this is amazing
@pandoraeeris7860 4 วันที่ผ่านมา
Give me agents and AIOS.
@kuzeyrl 4 วันที่ผ่านมา ⁺¹
actually cool
@Vlican 3 วันที่ผ่านมา
exciting times
@ivymike9479 4 วันที่ผ่านมา ⁺²
Goodbye Photoshop🖐
@ShpanMan 4 วันที่ผ่านมา
Google seems pretty back.
@dominik4496 4 วันที่ผ่านมา
For this to happen, it would have to be published promptly - I suspect OpenAI is not sleeping. We need Gemini for Android Auto and Pixel Watch 👍🏼
@Aditya-f8t5z 3 วันที่ผ่านมา
🍀🍀🍀🍀🍀🍀🍀🍀🍀🍀🍀🍀
@roxxxxxy 4 วันที่ผ่านมา ⁺³
and people said google is lacking behind
@PohonPinuss 3 วันที่ผ่านมา
Just postpone your subscribe to any AI tools. Just wait for this gemini
@PohonPinuss 3 วันที่ผ่านมา
Holly molly
@jml5291 3 วันที่ผ่านมา
牛逼
@__________________________6910 2 วันที่ผ่านมา
good job, now make it free
@p4rzy123 4 วันที่ผ่านมา ⁺²
daim open ai be gettin smoked rn
@cholst1 3 วันที่ผ่านมา
Oh woooow this is realllllllly willing to hallucinate answers. Just gave it a quick try, and asked it about a bunch of stuff that either doesnt exist or it doesnt have knowledge of, and it will just go oooon and oooon inventing stuff. Would not use for most things.
@afterglow5285 4 วันที่ผ่านมา
Vaporware

ต่อไป

เล่นอัตโนมัติ

ВАЖНО! "Вот и все". Ученый объяснил, почему утонули танкеры в Керченском проливе. «Волгонефть-212»