bro thanks first for ur effort, I have a problem when I am trying to create a new deployment in "model deployments'" it shows me everytime that 'No quota available Select a different version or deployment type or free up the resources ... none of the options of Chat-Gpt are free..idk either how to become this Quota
I don't think azure openai has a free tier currently. If you don't have quota you might want to try recreating your openai service in a different region. Sometimes subscriptions have limited quotas on certain regions but your have to check the documentation for details. If you're looking for a free easy option for a similar ai model I have a video on my channel for how to set up a local ai model with ollama that would also work for this demo
Thanks for this video. It is really helpful; I am wondering if the tokens get consumed when we supply the chat history. What happens if the tokens run out (the limit set by each model)? How can we clear the history when we know that the tokens have ran out?
Thanks! Chat history does count towards token history - anything you pass to the AI generally counts as tokens. The larger your token limit for a given model, the more history you could send, for example GPT-4 32k runs out faster than GPT-4 128k. Luckily the chat history is just a standard list, so whenever you send a prompt you can update your history list to only include the X number of most recent messages, whatever you think will fit in the token limit for your model. I believe the AI model auto-truncates the tokens provided and just ignores the excess but I'd have to double check on that. You could also store all the messages in one list for the user and for display purposes, but then just take the X most recent number of messages to actually send to the AI.
Thank you for another great content :)
bro thanks first for ur effort, I have a problem when I am trying to create a new deployment in "model deployments'" it shows me everytime that 'No quota available
Select a different version or deployment type or free up the resources ... none of the options of Chat-Gpt are free..idk either how to become this Quota
I don't think azure openai has a free tier currently. If you don't have quota you might want to try recreating your openai service in a different region. Sometimes subscriptions have limited quotas on certain regions but your have to check the documentation for details. If you're looking for a free easy option for a similar ai model I have a video on my channel for how to set up a local ai model with ollama that would also work for this demo
Thanks for this video. It is really helpful; I am wondering if the tokens get consumed when we supply the chat history. What happens if the tokens run out (the limit set by each model)? How can we clear the history when we know that the tokens have ran out?
Thanks! Chat history does count towards token history - anything you pass to the AI generally counts as tokens. The larger your token limit for a given model, the more history you could send, for example GPT-4 32k runs out faster than GPT-4 128k. Luckily the chat history is just a standard list, so whenever you send a prompt you can update your history list to only include the X number of most recent messages, whatever you think will fit in the token limit for your model. I believe the AI model auto-truncates the tokens provided and just ignores the excess but I'd have to double check on that. You could also store all the messages in one list for the user and for display purposes, but then just take the X most recent number of messages to actually send to the AI.
@@alexthecodewolf Thanks for such detailed answer.
Actually, I had the same question. Thanks for asking and thanks for answering :)