190
864 761

Qwen2.5-1M and Qwen2.5 VL have been released!

17:29

OpenAI Agents are Here! (Testing out OpenAI's Operator agent)

17:03

Now you can just prompt for web data! *INSANE*

5:16

DeepSeek-R1 competes with OpenAI o1!

19:14

Gemini 2.0 as an AI Research Assistant! MINDBLOWING

3:16

Google announces Gemini 2.0 - The multi-agentic era is here! (Tested)

13:57

How to create stunning presentations with AI! (Tutorial)

How to build stunning visual presentations with AI tools like Gamma.
Try Gamma for free here: gamma.app/
--
Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/
Use code TH-cam20 to get an extra 20% off.

มุมมอง: 362

วีดีโอ

Qwen2.5-1M and Qwen2.5 VL have been released!

17:29

Qwen2.5-1M and Qwen2.5 VL have been released!

มุมมอง 4.8K9 ชั่วโมงที่ผ่านมา

Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Overview of Qwen2.5-1M and Qwen2.5 VL. 00:00 - Introduction 01:35 - Overview of Qwen2.5-1M 11:06 - Overview of Qwen2.5-VL 13:20 - Qwen2.5-VL Demos Qwen2.5-1M: qwenlm.github.io/blog/qwen2.5-1m/ Qwen2.5 VL: qwenlm.github.io/blog/qwen2.5-vl/ #ai #artificialinte...

OpenAI Agents are Here! (Testing out OpenAI's Operator agent)

17:03

OpenAI Agents are Here! (Testing out OpenAI's Operator agent)

มุมมอง 3.5K19 ชั่วโมงที่ผ่านมา

Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. I introduce OpenAI Operator and Agents and test it out on several tasks. 00:00 - Introduction to OpenAI Operator 00:34 - Recap of Operator 02:57 - Results 03:36 - Testing Operator 05:25 - Operator for News 10:48 - Operator for Research #ai #chatgpt #tech

Now you can just prompt for web data! *INSANE*

5:16

Now you can just prompt for web data! *INSANE*

มุมมอง 85021 ชั่วโมงที่ผ่านมา

Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Firecrawl introduced their new /extract endpoint where you can just prompt for the web data you need. Try it here: www.firecrawl.dev/extract #ai #tech #chatgpt

19:14

DeepSeek-R1 competes with OpenAI o1!

มุมมอง 7Kวันที่ผ่านมา

DeepSeek-R1 competes with OpenAI o1! paper: github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf try it here: chat.deepseek.com/ Learn how to build with AI in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Limited-time offer! 00:00 DeepSeek-R1 Introduction 01:34 DeepSeek-R1 Paper Overview 12:36 DeepSeek-R1Demo #ai #deepseek #tech

Gemini 2.0 as an AI Research Assistant! MINDBLOWING

3:16

Gemini 2.0 as an AI Research Assistant! MINDBLOWING

มุมมอง 4.8Kหลายเดือนก่อน

Gemini 2.0 as an AI Research Assistant. #ai #science #tech

Google announces Gemini 2.0 - The multi-agentic era is here! (Tested)

13:57

Google announces Gemini 2.0 - The multi-agentic era is here! (Tested)

มุมมอง 7Kหลายเดือนก่อน

Learn how to build with AI in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Limited-time offer! Google announces Gemini 2.0! 00:00 Introduction 01:08 Spatial understanding 03:39 New output modalities 05:11 Talk to Gemini 07:28 Native image output 09:15 Native tool use 11:50 Multimodal Live API 12:28 Agents #ai #tech #artificialintelligence

OpenAI Sora is finally here! How GOOD is it? (Reaction and Review)

27:19

OpenAI Sora is finally here! How GOOD is it? (Reaction and Review)

มุมมอง 551หลายเดือนก่อน

OpenAI finally released Sora, their advanced video generation model. #ai #chatgpt #sora #tech

2:27

OpenAI o1 is really IMPRESSIVE at math!

มุมมอง 697หลายเดือนก่อน

Learn how to build with AI in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Limited-time offer! OpenAI o1 is really IMPRESSIVE at math! #ai #chatgpt #tech

6:38

Building AI Agents with Claude! (Demo)

มุมมอง 2.9Kหลายเดือนก่อน

Learn how to build with AI agents in my new courses here: dair-ai.thinkific.com/ Use code CM24 to get a 35% off. The offer expires in the next 24 hrs! Create your own AI Agents with Claude & MCP! (Demo) #ai #chatgpt #tech

OpenAI o1 writes IMPRESSIVE code from a SINGLE image!

5:19

OpenAI o1 writes IMPRESSIVE code from a SINGLE image!

มุมมอง 11Kหลายเดือนก่อน

Learn how to build with AI agents in my new courses here: dair-ai.thinkific.com/ Use code CM24 to get a 35% off. The offer expires in 2 days. Prices will go up after this promotion ends. OpenAI o1 writes code from a single image. Full video: th-cam.com/video/1PWD21aeMx8/w-d-xo.htmlsi=nhRkZZGieXqFszyk #ai #chatgpt #tech

OpenAI full o1 - FASTER and more RELIABLE (Tested)

20:37

OpenAI full o1 - FASTER and more RELIABLE (Tested)

มุมมอง 9Kหลายเดือนก่อน

Learn how to build with OpenAI models and AI agents in my new courses here: dair-ai.thinkific.com/ Use code CM24 to get a 35% off. The offer expires in 3 days! Overview of OpenAI full o1 model inside of ChatGPT. 00:00 Overview 02:27 Inference Speed 05:38 Math Puzzle 06:28 Math Reasoning 08:11 Complex Knowledge Understanding 11:46 Image Understanding 14:13 Coding #ai #chatgpt #artificialintellig...

QwQ-32B-Preview challenges OpenAI's o1 models. (Tested)

11:44

QwQ-32B-Preview challenges OpenAI's o1 models. (Tested)

มุมมอง 1.3K2 หลายเดือนก่อน

Learn how to build with AI in my new courses here: dair-ai.thinkific.com/ Use code BLACKFRIDAY to get a 35% off. The offer expires tomorrow. QwQ-32B-Preview challenges OpenAI's o1 models. #ai #chatgpt #artificialintelligence #tech

Windsurf coding agent is AWESOME! (Tested)

14:27

Windsurf coding agent is AWESOME! (Tested)

มุมมอง 2K2 หลายเดือนก่อน

Windsurf coding agent is AWESOME! (Tested)

12:28

Cursor AI Agent is IMPRESSIVE! (Tested)

มุมมอง 9K2 หลายเดือนก่อน

Cursor AI Agent is IMPRESSIVE! (Tested)

DeepSeek releases new reasoning LLM | DeepSeek R1 Lite Preview | **IMPRESSIVE** (Tested)

13:59

DeepSeek releases new reasoning LLM | DeepSeek R1 Lite Preview | **IMPRESSIVE** (Tested)

มุมมอง 3.9K2 หลายเดือนก่อน

DeepSeek releases new reasoning LLM | DeepSeek R1 Lite Preview | IMPRESSIVE (Tested)

Introducing Forge Reasoning APIs | The future of building with agents?

10:02

Introducing Forge Reasoning APIs | The future of building with agents?

มุมมอง 1.1K2 หลายเดือนก่อน

Introducing Forge Reasoning APIs | The future of building with agents?

ChatGPT can now look at your coding tools! 👀

5:43

ChatGPT can now look at your coding tools! 👀

มุมมอง 1.2K2 หลายเดือนก่อน

ChatGPT can now look at your coding tools! 👀

Qwen2-5-Coder is the NEW best open code LLM!

10:33

Qwen2-5-Coder is the NEW best open code LLM!

มุมมอง 1.8K2 หลายเดือนก่อน

Qwen2-5-Coder is the NEW best open code LLM!

Claude PDF Analyzer | Ridiculously GOOD! (Tested)

18:01

Claude PDF Analyzer | Ridiculously GOOD! (Tested)

มุมมอง 2.7K2 หลายเดือนก่อน

Claude PDF Analyzer | Ridiculously GOOD! (Tested)

Claude 3.5 Haiku | 4x the price of previous model | TOO EXPENSIVE!?

9:13

Claude 3.5 Haiku | 4x the price of previous model | TOO EXPENSIVE!?

มุมมอง 1.8K2 หลายเดือนก่อน

Claude 3.5 Haiku | 4x the price of previous model | TOO EXPENSIVE!?

How to evaluate using LLM-as-a-Judge (Tutorial)

7:57

How to evaluate using LLM-as-a-Judge (Tutorial)

มุมมอง 7102 หลายเดือนก่อน

How to evaluate using LLM-as-a-Judge (Tutorial)

OpenAI introduces ChatGPT Search! (Watch this before trying it)

14:49

OpenAI introduces ChatGPT Search! (Watch this before trying it)

มุมมอง 3K3 หลายเดือนก่อน

OpenAI introduces ChatGPT Search! (Watch this before trying it)

10:56

Web Scraping Agent (Tutorial)

มุมมอง 2.5K3 หลายเดือนก่อน

Web Scraping Agent (Tutorial)

Claude 3.5 Sonnet now available in GitHub Copilot!

10:27

Claude 3.5 Sonnet now available in GitHub Copilot!

มุมมอง 15K3 หลายเดือนก่อน

Claude 3.5 Sonnet now available in GitHub Copilot!

5:28

Building a ReAct AI Agent (Tutorial)

มุมมอง 4.1K3 หลายเดือนก่อน

Building a ReAct AI Agent (Tutorial)

1:13:03

NotebookLM Crash Course

มุมมอง 23K3 หลายเดือนก่อน

NotebookLM Crash Course

Google ships custom audio overviews! (NotebookLM Updates)

11:15

Google ships custom audio overviews! (NotebookLM Updates)

มุมมอง 2.2K3 หลายเดือนก่อน

Google ships custom audio overviews! (NotebookLM Updates)

Mistral AI introduces Ministral 3B & 8B | Most capable small language models!?

9:31

Mistral AI introduces Ministral 3B & 8B | Most capable small language models!?

มุมมอง 1.8K3 หลายเดือนก่อน

Mistral AI introduces Ministral 3B & 8B | Most capable small language models!?

Evaluating Fairness in ChatGPT (Research)

11:05

Evaluating Fairness in ChatGPT (Research)

มุมมอง 4373 หลายเดือนก่อน

Evaluating Fairness in ChatGPT (Research)

ความคิดเห็น

@kahumadaniels2813 2 วันที่ผ่านมา
Does it have the mobile application
@scitechaiutopia 2 วันที่ผ่านมา
Output tokens? Will this give short responses like every other llm?
@ledererova 2 วันที่ผ่านมา
Guess what the largest position in Michael Burry's (featured in The Big Short movie) portfolio is. Yes, it's Alibaba.
@jaysonp9426 3 วันที่ผ่านมา
But I can't get 7b models to do well on 1k tokens lol
@habibmrad8116 3 วันที่ผ่านมา
That's awesome!!!
@expeditiontoabyss3597 3 วันที่ผ่านมา
love it, thanks
@HaraldEngels 4 วันที่ผ่านมา
Thank you, excellent video. I am intensively using Qwen since a year and it gets better and better with every new release. And let us do not forget, that the DeepSeek R1 open-source models are also based on Qwen.
@AR-iu7tf 2 วันที่ผ่านมา
Deepseek base models are pretrained from scratch by deepseek / we can see this from the Huggingface page. They did distill small Qwen models from their bigger DeepSeek models
@Phobos11 2 วันที่ผ่านมา
Only some of the distillations, not R1 itself
@elvissaravia 4 วันที่ผ่านมา
Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off.
@ThePeacefullOasis 4 วันที่ผ่านมา
Thanks for the review. How is it compared to windsurf?
@DresElMagnifico 5 วันที่ผ่านมา
You could make it request quotes from different suppliers. Let's say you need a new car insurance or something?
@designmy743 5 วันที่ผ่านมา
Can it control other programs other than the web?
@FamilyYoutubeTV-x6d 2 วันที่ผ่านมา
The web is not a program. Do you mean if it can control other things besides browsers? no, but most things have web apps today, so yes, in a sense.
@habibmrad8116 7 วันที่ผ่านมา
Ask him to trade forex
@chedda3420 8 วันที่ผ่านมา
ask it to play a game of GeoGuessr
@elvissaravia 8 วันที่ผ่านมา
Please comment on what specific tasks you would like me to try. I will be doing another video on different use cases and I will be taking notes on you suggestions.
@bernardbernard2158 8 วันที่ผ่านมา
Try getting him to play chess. Maybe he has access to software on your computer, like video editing software, have him edit it. Can you activate the youtube function which allows videos to be translated into different languages?
@zleoko 8 วันที่ผ่านมา
Check out how often Donald Trump has been in pictures with Elon Musk in the past month. Write what other politicians and businessmen Musk met with during this time. List the information with dates in an xlsx file and save to my drive (locally or Google Drive). -- to do: Web search, checking file formats, recognizing objects in a photo, searching for data and linking it to objects in photos, grouping data, saving (Google Drive - login, locally - saving in a designated folder
@malteepmeier 8 วันที่ผ่านมา
How far can you go while taking control? Can you just start something completely different like watching a youtube video? Effectively using it as a virtual machine?
@elvissaravia 8 วันที่ผ่านมา
I will be trying the TH-cam stuff and see what’s possible.
@Achievly 8 วันที่ผ่านมา
I wish you would have shown it actually doing data entry. I want it to receive form data and then enter it into s spreadsheet :D Most of your use cases can be done faster just using regular GPT with web search capabilities.
@elvissaravia 8 วันที่ผ่านมา
Sounds interesting. I can test it.
@vincentjohnflorio 8 วันที่ผ่านมา
I tested data entry with it. I told it to go to Google Sheets and it went there up to the point where I needed to log in. I logged in and gave it back control and it did my test case: * make a new sheet * add five boys names vertically, then add a number beside each boy name * copy that * make a second sheet in the same 'file' and paste the original data, then beside the boy name and number on each row, add a girl name. Not only did it do it flawlessyly, but I could watch it live, it could export a video if the entire process (trimmed for skippable moments, including showing a timecode that sped up when it did) and even sent me a screenshot back in the chat of the finished results.
@habibmrad8116 9 วันที่ผ่านมา
Nice tool. What if a paper contains images/graphs or other unstructured format?
@elvissaravia 9 วันที่ผ่านมา
Learn how to build with LLMs, RAG, and AI Agents in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off.
@elvissaravia 10 วันที่ผ่านมา
Learn how to build with AI in my new courses here: dair-ai.thinkific.com/ Use code TH-cam20 to get an extra 20% off. Limited-time offer!
@TheVisionaryX 10 วันที่ผ่านมา
❤
@user-nbfkxngjmyb 10 วันที่ผ่านมา
please make a video for tool/function calling agents with Deepseek-R1
@ahom_ahom_ahom 11 วันที่ผ่านมา
I'll be watching this one very closely. Thanks for the video
@GrindThisGame 11 วันที่ผ่านมา
Someone at some point is going to generate an executable and run it. I wonder what could go wrong.
@emport2359 11 วันที่ผ่านมา
Would you say it's better than o1 when it comes to your personal testing?
@dandushi9872 10 วันที่ผ่านมา
^
@elvissaravia 10 วันที่ผ่านมา
I still need to do further testing along things like latency and efficiency. But this model is comparable in terms of precision.
@felipecferreiraa 11 วันที่ผ่านมา
Hi Elvis, I'm in your online community (dair-ai).. loving your amazing job.. \o/
@elvissaravia 11 วันที่ผ่านมา
Thank you for the support, Felipe!
@juniorsartori4964 15 วันที่ผ่านมา
Hello! I have a question about Anthropic's prompt caching that I couldn't find in the documentation. Is it possible to share a cache between different prompts? For example, uploading a file in Prompt 1 and then accessing that same cached file in Prompt 2?
@youranimalfunclub2.096 17 วันที่ผ่านมา
Hi please i am hoping an reply for my question. is it possible to fine tune a model using cloud service given and export the fine tuned model to deploy locally. If yes how can it be done
@yuli.kamakura 19 วันที่ผ่านมา
se la vie
@ajayc815 20 วันที่ผ่านมา
Huge thanks please cover other techniques
@ajayc815 20 วันที่ผ่านมา
Huge thanks
@alanhg 21 วันที่ผ่านมา
not support jetbrains
@SonnySangha 23 วันที่ผ่านมา
Great explanation with the multi-turn caching! Appreciate it keep up the great content🔥🤝
@andreiandrei2826 24 วันที่ผ่านมา
There is a 1/5 probability that by choosing a random answer, you will choose answer 3. Moreover, the only options that stand out after minimal filtering are 3 and 4 (the shortest or the longest candle), the rest are negligible, I believe that even ChatGPT wouldn't choose 1, 2, or 5. So, the real probability of finding the correct answer, even for the most primitive language model, I would say is somewhere around 50%. Out of 100 cases, in 50 of them, even the most primitive model would probably say it's 3. So, the answer is not impressive. A coin, if you flip it, would know to choose candle number 3 in 50% of the cases. But that doesn't mean the coin is good at math or that it's smart. So the fact that it answered with 3 is not at all impressive to me. What is impressive is the following logic, which seems to make sense and be correct.
@zxlokixz711 25 วันที่ผ่านมา
So clear that it is easy to remember, thank you bro!
@UnperfectFeeling 28 วันที่ผ่านมา
I am impressed by the quality of your content and how you properly provide context and data to ensure we can keep track of all the details provided.
@e.r.2236 หลายเดือนก่อน
I appreciate you taking the time to share your experience. However, this is not a tutorial. You must educate yourself on the pedagogy of creating tutorials before uploading additional content.
@elvissaravia หลายเดือนก่อน
It literally says Demo on the title. Thanks for the recommendation but I believe I have enough experience on creating tutorials. The tutorial part of this demo is in the works so stay tuned.
@lorenzoleongutierrez7927 หลายเดือนก่อน
Great !
@patrykpilat หลายเดือนก่อน
Awesome' Very useful content.
@obieda_ananbeh หลายเดือนก่อน
Have you try notebookllm plus? If so how is it ?
@jackf6622 หลายเดือนก่อน
If you don't compare to the regular model then how do you know its working. I am yet to see a TH-cam video of a fine tuned model that works.. btw with the training loss greater than validation loss, it seems like over fitting and it wouldn't work
@rfactorstudy หลายเดือนก่อน
Can you create a custom Gem and share it with targeted people? I want an assistant that I only share with key people that I know. Any help would be greatly appreciated. Thank you!
@timwhite1783 หลายเดือนก่อน
I have had mixed results personally. I found o1-preview did not reliably produce better results. It frequently brushed over implementation details and was unwilling to get in to deep and complex topics.
@vbywrde หลายเดือนก่อน
I agree that results are inconsistent, but I'm puzzled by the supposition that it is cursor itself that is the cause of the problem you identified at the end with the model. Isn't that code being generated by the model on the back end and then sent back to cursor? So if it got the wrong model, wouldn't that be the LLMs fault, not cursor's? Or is cursor intercepting the token flow from the model and purposely injecting the wrong model? I would think that it's the model failing to follow the instructions consistently, rather than cursor intercepting and screwing the results. Not sure but that would be my first guess. I also agree that you really need to be a developer already to work effectively with these tools because they are inconsistent and will lead inexperienced developers down torturous paths if they don't already know programming well. For example, I asked cursor (sonnet) to create a somewhat complex t-sql routine. It created something extremely complicated because it didn't realize it could use a pre-existing t-sql function that comes shipped with SQL Server. But I knew. So I told it that its solution was way more complicated than necessary, and mentioned the function. It replied, "You're absolutely right!" and went ahead and reduced the 30 lines of code it had created to the 3 lines it actually needed. Etc, etc, etc, etc. But I don't blame cursor for this. It is the LLMs (sonnet, gpt4, etc) that cannot produce reliable results. Lastly, despite the flaws of the LLMs, because I am an experienced programmer who knows how to avoid the pitfalls presented by the LLMs, I am able to use the tools and save a tremendous amount of time. So they are good. It's just that you have to already be an expert to actually gain a real advantage from them. Newbies are likely to fall into every pit the models spit out at them, simply because they don't know better and will assume the model is doing things correctly. Bad assumption, sorry. Anyway, great video. I was wondering about agent. I think I'll give it a test drive. Thanks.
@jackbauer322 หลายเดือนก่อน
a pif ?
@TopperThanToppest หลายเดือนก่อน
Coders digging their own grave

Elvis Saravia

ความคิดเห็น