Can you do a video on how to build a RAG app in Google cloud? I've seen a few of these videos and they are good but don't explain the process of building a rag app on Google cloud. For example, you have the embeddings API which can generate text embeddings of your query, then you have vector search which can find the closest related answer with semantic meaning. But what's next? How do you integrate these directly with LLMs to complete the experience?
Want to learn more about AI/ML concepts?
Check out more episodes of Real Terms for AI. → goo.gle/AIwordsExplained
Beautifully explained thank you so much
Really good explanation for beginners!
Very timely, good reference!
Been trying to optimize size of the embeddings or use 16-bit integer math
Thank for sharing👍
Can you do a video on how to build a RAG app in Google cloud? I've seen a few of these videos and they are good but don't explain the process of building a rag app on Google cloud.
For example, you have the embeddings API which can generate text embeddings of your query, then you have vector search which can find the closest related answer with semantic meaning. But what's next? How do you integrate these directly with LLMs to complete the experience?