Moondream: how does a tiny vision model slap so hard? - Vikhyat Korrapati
ฝัง
- เผยแพร่เมื่อ 9 ก.พ. 2025
- Psst! Wanna learn how to build an AI model that punches way above its weight? Beats models 4 times its size and competes with the models from Meta, Google and OpenAI? In this talk I’ll spill the beans on how I pulled it off, and how you can too. I’ll share my unexpected journey that led to the creation of Moondream, a tiny open-source vision language model that kicks ass. I’ll share my journey and the technical hurdles that I faced along the way. I’ll also explain why small models are the future of AI. Join me for a story of accidental innovation, the democratization of AI, and how sometimes, thinking small can lead to big results.
Recorded live in San Francisco at the AI Engineer World's Fair. See the full schedule of talks at www.ai.enginee... & join us at the AI Engineer World's Fair in 2025! Get your tickets today at ai.engineer/2025
About Vikhyat
Vik's work focuses on developing efficient AI models that can run on resource-constrained devices without sacrificing performance. His mission is to democratize AI technology, making advanced computer vision accessible to developers and businesses of all sizes. Prior to his current endeavors, Vik spent 9 years at AWS, gaining valuable experience in large-scale computing systems.
Watching the Moondream grow for a half a year now - great progress!
One of the most enjoyable and down-to-earth videos in this series. Great stuff! Thanks!
Very impressive demo and talk. I have not had any ideas for a multi-modal or vision based ai application but if I do I will be sure moondream is at the top of the list of models to try out.
Demo was the coolest part ! Well done !
Moondream is incredible. Thank you, Vikhyat
We love Moondream!
The demo was impressive. Had no idea what the presentation was till the demo
Great work Vikhyat! Rooting for moondream :)
great talk, thanks for sharing
Moondream 🙏
amazing video ai engineer thanks for sharing, please keep up with finding and sharing great content
Super interesting
Good job 👍
this is awesome
Awesome
Go vik..to the moon 🌝
He's David vs. Goliath. ❤
Does the model also work for extracting data from documents? Thanks!
Lots of good golden nuggets in there
What are some good use cases of this tech?
Moondream is great
🎉
Fucking AAAAAAA AMAZING!!!!!