I'm so excited about this! This is the coolest videos about Java! I've build an accounting software and I was planning to add a help section where users could ask questions and get relevant answers. I have around 30 pdf documents of different laws and regulations. This is the perfect thing at a perfect time ❤
Amazing video, Dan Thank you. These videos are motivating that Java can be used for AI and build enterprise solutions. we dont want to expose the organization data to public and we need to keep it private. So using Ollama is great choice. Request to provide more resources on this appraoch.
Great intro Dan, I always enjoy watching your videos. ❤ As for the model and the retrieval themselves, well, 250 bps priced in? Seriously? Leaving aside the average quality of Tika and the warts of PdfBox, and the fact that it’s by far not enough to just leverage the existing open-source libraries to come close to anything production-grade, that toy sample is fun to showcase that the shape looks right. But I’m still struggling to come up with a justification for pulling in the confines and cruft of Spring into the realm of Python, where you get everything from the papers immediately. I believe, Spring AI has managed to retain the relevance of Spring in the coming couple years. But Spring is unsuitable for the agile and often experimental workflows in data science. An average Spring dev can’t do proper data science, and an average data scientist can’t do proper Java and much less so proper Spring. So I don’t know any other reason why VMware had decided and Broadcom has agreed to keep it up. I wish Spring had focused on improving what it’s good at and reigning in some consistency at last. I know, different people, different projects, but today it’s often just enough to add a Flask or a FastAPI wrapper and be done with it.
Thank you Dan for this one! I have a question regarding how the ChatClient is instantiated using the Context. How it will deal with large dataset from the vectorStore : any performance issue that could be forseen?
Can you provide guidance on how RAG can work in real-time to read dynamic data? I have a project for an e-commerce website integrated with a chatbot. I want the chatbot to be able to read updated data (such as price changes) in real-time, instead of having to restart everything from scratch. Please, help me
It's returned in metadata of each call to ChatClient. Also spring-ai team added those data as metrics in newer verrsion (at least in 1.0.0-M3) - so all you need just to add actuator to classpath.
Hello Dan Vega, I'm currently working with Spring AI to retrieve structured data from AI calls. At present, I’m using a method structured as follows: java Copy code private T promptChatClient(Class responseType, Resource loanData, String textData) { return chatClient.prompt() .user(u -> { u.text(loanData); u.param("data", textData); }).call() .entity(responseType); } This method utilizes chatClient.prompt() to send structured data (loanData and textData) and expects a response in the specified responseType. Would you have any suggestions how to get number token used other information that I get when I call chatResponse()? Thank you for your time and expertise!
Great video, Dan - thanks! It would be very interesting to dive into vector database mechanics as well 👍
I'm so excited about this! This is the coolest videos about Java!
I've build an accounting software and I was planning to add a help section where users could ask questions and get relevant answers. I have around 30 pdf documents of different laws and regulations.
This is the perfect thing at a perfect time ❤
The best video about RAG I have ever seen
That's great video. Going from general picture to code examples is great idea!
Amazing video, Dan
Thank you.
These videos are motivating that Java can be used for AI and build enterprise solutions.
we dont want to expose the organization data to public and we need to keep it private. So using Ollama is great choice. Request to provide more resources on this appraoch.
Excellent as usual
Thank you Dan, for touching RAG, if prompting is require may you point out some resources you recommend for the topic, thank you
This was really great, I appreciate it!
Thank you Cote!
Great intro Dan, I always enjoy watching your videos. ❤ As for the model and the retrieval themselves, well, 250 bps priced in? Seriously? Leaving aside the average quality of Tika and the warts of PdfBox, and the fact that it’s by far not enough to just leverage the existing open-source libraries to come close to anything production-grade, that toy sample is fun to showcase that the shape looks right. But I’m still struggling to come up with a justification for pulling in the confines and cruft of Spring into the realm of Python, where you get everything from the papers immediately. I believe, Spring AI has managed to retain the relevance of Spring in the coming couple years. But Spring is unsuitable for the agile and often experimental workflows in data science. An average Spring dev can’t do proper data science, and an average data scientist can’t do proper Java and much less so proper Spring. So I don’t know any other reason why VMware had decided and Broadcom has agreed to keep it up. I wish Spring had focused on improving what it’s good at and reigning in some consistency at last. I know, different people, different projects, but today it’s often just enough to add a Flask or a FastAPI wrapper and be done with it.
nice video, can u tell me what is your font u are using in your IDE
Thank you Dan for this one!
I have a question regarding how the ChatClient is instantiated using the Context. How it will deal with large dataset from the vectorStore : any performance issue that could be forseen?
Can you provide guidance on how RAG can work in real-time to read dynamic data?
I have a project for an e-commerce website integrated with a chatbot. I want the chatbot to be able to read updated data (such as price changes) in real-time, instead of having to restart everything from scratch.
Please, help me
Just came from X thanks.
So the images are not processed right. They also hold some data, how can we include images too?
How do we know ho much input token and output token were used in this request so that we can track of the Financials
It's returned in metadata of each call to ChatClient.
Also spring-ai team added those data as metrics in newer verrsion (at least in 1.0.0-M3) - so all you need just to add actuator to classpath.
Nice, but couldn't posting it to Ollama ...
What problems did you have with Ollama? This should work the same just swapping out OpenAI for Ollama and an open source model.
@@DanVega
Hello, I can't post on Ollama llamma3.1 either.
I/O error on POST request for "localhost:11434/api/embed": timeout
Hello Dan Vega,
I'm currently working with Spring AI to retrieve structured data from AI calls. At present, I’m using a method structured as follows:
java
Copy code
private T promptChatClient(Class responseType, Resource loanData, String textData) {
return chatClient.prompt()
.user(u -> {
u.text(loanData);
u.param("data", textData);
}).call()
.entity(responseType);
}
This method utilizes chatClient.prompt() to send structured data (loanData and textData) and expects a response in the specified responseType.
Would you have any suggestions how to get number token used other information that I get when I call chatResponse()?
Thank you for your time and expertise!
The PG Vector blew my mind.
What part blew your mind?