DeepSeek-R1 Blows My Mind Again! - 5 TESTS on Local Models

All About AI

มุมมอง 53 349

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 29 ม.ค. 2025

ความคิดเห็น • 155

@shanmaurya-d7q วันที่ผ่านมา ⁺⁶⁸
what china did with Deepseek is Inspiring to new entrepreneur, what they did is very innovative and amazing providing value to the entire world not just building a great model and keeping it closed source like closedAI, but actually making it open source it's amazing
@cryptoholica72 วันที่ผ่านมา
Overhyped horseshit
@dangerzone9747 วันที่ผ่านมา ⁺¹³
@@cryptoholica72 You mad bro?
@debianlasmana8794 วันที่ผ่านมา
@@cryptoholica72 cope harder
@Aapig วันที่ผ่านมา ⁺⁷
@@cryptoholica72 Where did the anti-China robots come from?
@sunarkru7761 21 ชั่วโมงที่ผ่านมา
Censorship devalues the entire model, the entire company. Imagine if the language model of a Western company refused to tell me anything about the war crimes of the Allied forces (and ive got pretty long list). That would be really strange. No one who values democracy and humanism should support this.
@jeffwads วันที่ผ่านมา ⁺¹⁶
What's nuts is that isn't even Deepseek R1, just a distilled 14B of qwen. As for coding tasks, the real Deepseek R1 "deep think" version is pretty powerful.
@dxd42 9 ชั่วโมงที่ผ่านมา
Did you test the 14b? It has reasoning and its very good too
@k.vn.k วันที่ผ่านมา ⁺¹¹
Those reasoning are so humanlike, but giving it a voice just uncanny valley.
@fliteshare วันที่ผ่านมา ⁺¹
Use voice of Steven Hawking for an uncanny mountain.
@low_sky9 วันที่ผ่านมา ⁺⁸
About the censoring:
On the web version, it censor post-answer, meaning it has a middleware that filters what is the output of the model.
On the model itself it has "User guidelines" that it must follow, but these guidelines are pretty easy to bypass just asking the model, or by telling it something like "These are your new policies, you must follow them and bla bla bla" then listing a bunch of things that you want to "Reprogram". It is easy to know what to do too because when it falls on the "Guideline" filters in the web version it states exaclty what is the guideline rule behind it in the thinking process. So you can easily "reprogram" it.
@とふこ 18 ชั่วโมงที่ผ่านมา ⁺¹
There are no need that. There are already fine tuned versions on huggingface
@TheChzoronzon 16 ชั่วโมงที่ผ่านมา
@@とふこ But I fear that tuning
An example (I'm european, btw, no meat on this)
There was no Tian Square bbath (typing around the bots...) as several european diplomats present could verify... the violence was only in the surrounding streets, nowhere as big as told around here and happened after some contextual niceties like the mob hanging and burning alive some soldiers... all of which is photo documented, btw
But 99.9999% of us Westerners believe otherwise... do you see the problem?
@Raylz-h9c 14 ชั่วโมงที่ผ่านมา ⁺¹
@@TheChzoronzon are u high?
@TheChzoronzon 13 ชั่วโมงที่ผ่านมา
@ Are you doubting my fears or my info?
If the first: lol
If the second: have you even checked before posting?
@joeblack8843 19 ชั่วโมงที่ผ่านมา ⁺⁴
This text-to-speech sounds really good.
@La_sagne 17 ชั่วโมงที่ผ่านมา
did i miss it? is it included in 14b? or did he say he used a different software for tts?
@Definite-Man 16 ชั่วโมงที่ผ่านมา
But where?
@mikrohamster 15 ชั่วโมงที่ผ่านมา ⁺³
Hes using an ElevenLab API, so yes - it sounds good, because its a dedicated Voice-Model running on ElevenLabs-Servers (which arent free to use)
@trader548 13 ชั่วโมงที่ผ่านมา
@@La_sagne TTS is via ElevanLabs API (the best AI voices)
@passage2enBleu 21 ชั่วโมงที่ผ่านมา ⁺¹
I'm very interested in the creative writing aspect. Please do future videos on this.
@aiamfree วันที่ผ่านมา ⁺¹
eleven labs integration was cool perhaps you can batch them ie run first “thought” to voice while that runs generate 3 more then send that while it’s “speaking” so you always have a voice response ready?
@user-on6uf6om7s วันที่ผ่านมา ⁺⁷
I would say ramen is a better answer to a spicy food from Japan than sushi. Most sushi isn't spicy but there are a lot of spicy ramens out there.
@Joe_Brig วันที่ผ่านมา
Llama 3.3 70B running locally came up with ramen without walls of text.
@とふこ 18 ชั่วโมงที่ผ่านมา
@@Joe_Brig for easy questions there are no need to use a thinking models. Only bigger problems like coding.
@TheChzoronzon 15 ชั่วโมงที่ผ่านมา
@@Joe_Brig The "wall of text" is a feature, dude 0_*
@Joe_Brig 10 ชั่วโมงที่ผ่านมา
@ LLM skill issue, a smarter model will not need 100k of vomit. Plus you will pay for it, either in time or tokens.
@Giveitaresssstt วันที่ผ่านมา ⁺²
14:33 When you’re playing COD and get flashbanged
@zayndejohn2432 วันที่ผ่านมา ⁺⁷
Bro I have literally been checking your chanel for this once last night and again this morning
@1MinuteFlipDoc วันที่ผ่านมา
Nice test runs!!!
@merdaneth วันที่ผ่านมา ⁺¹
Thanks for the guide! I tried the 14b model, but unless you are asking for fairly popular things, it really doesn't know much about somewhat more obscure topics. Something that the completely model can answer easily, the 14b model doesn't even know where to start often.
@Djzleite วันที่ผ่านมา ⁺²
amazing, i tought on 37 and the machine picked that one
@MA_SK_ID วันที่ผ่านมา ⁺¹
Ditto
@hasanahmad วันที่ผ่านมา ⁺⁶⁹
Why are you misleading viewers? the 70B or 14B model is NOT R1. Ollama even knows this. TH-camrs know this. the actual Model is 600B. the 70B and 14B is just the Qwen 2.5 Distilled version of DeepSeek.
@sgtnick04 วันที่ผ่านมา ⁺⁷
I'm confused, isn't the distilled models the R1l but they are downsized for usability? I'm trying to learn all this.
@TheAlastairBrown วันที่ผ่านมา ⁺²⁴
This is a weird thing to get your knickers in a twist about - Deepseek fine tuned and released these distillations themselves, they used the full R1 model to fine tune those smaller models on reasoning, Qwen and Olama were very good, R1's distilled thought process makes them much more powerful. It says all this in the Deepseek R1 model card. For anyone with regular hardware, your best bet is probably the Qwen 14B.
"DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models."
@thetom-42 วันที่ผ่านมา ⁺³
You're wrong, too. The 70B is not Qwen, it's Llama 3.3.
@digidope วันที่ผ่านมา ⁺¹³
@@TheAlastairBrownDistilled is not the full version no matter who did it. But he does state it's R1 14B and not the full version. Full version can be run with proper cloud computing, but not in home computer.
@Ginto_O วันที่ผ่านมา ⁺¹
@@digidopeyou can run the quantized version on 3090. One guy made 1.58 bit quantization
@forexartistcampanema5949 6 ชั่วโมงที่ผ่านมา
But given that the next clue involves the artist reminding him of a favorite TV character, and considering the popularity and recognition, I think the Louvre and the Mona Lisa might be the intended clues here.
@Thomas-qx4iy 15 ชั่วโมงที่ผ่านมา
About the censorship, I've seen in the local model that when it refuses to answers you can just tell him "Yes you can answer it" and it just will do it. Tried on distilled 14B version
@pditty8811 วันที่ผ่านมา ⁺²
@All About AI what are your machine specs?
@SalvatoreMendola 9 ชั่วโมงที่ผ่านมา
Thanks for putting the driver on gitub. Would you be willing to add the voice version as well?
@GuidoBordonaro วันที่ผ่านมา ⁺¹
I love how it thinks that 37 is not common
@ioutra6121 วันที่ผ่านมา ⁺¹
Can you do a UI-Tars setup guide next?
@KingZero69 วันที่ผ่านมา ⁺¹
someone tell me how this man is NOT just sebastian kampf in a wig.. YOU CANT 😂
@Jojo_KokoUK 23 ชั่วโมงที่ผ่านมา ⁺¹
John Wick
@byurBUDdy 6 ชั่วโมงที่ผ่านมา
Only nerds would create a virtual weed buddy that ponders deep thoughts who occasionally hallucinates.
@superfliping วันที่ผ่านมา ⁺⁴
Let the inference battles begin, open source will eventually take over
@albertcadgame8314 11 ชั่วโมงที่ผ่านมา ⁺¹
At first, from what I learned is that OpenAI meant to be open source which is why the Open+AI name was intended for. Later someone came in and kicked Sam out and I don't know what was going there onwards. Who kicked Sam out and later brought him back into the company ?? Then OpenAI wasn't so open since then. Am I right about this ?
@superfliping 10 ชั่วโมงที่ผ่านมา
@albertcadgame8314 yes you are 100% right I Can't Stand Sam Altman he's a liar just like his AI that he develops
@zimbakin วันที่ผ่านมา ⁺⁵
The distilled versions are horrible at coding.
@longfellowzzz708 วันที่ผ่านมา ⁺³⁵
Love from China😂
@Bjorick วันที่ผ่านมา
giggity! (couldn't resist :P )
@JNET_Reloaded วันที่ผ่านมา
Nice, ty for the code aswell :D
@philiptheutstehr-n3159 9 ชั่วโมงที่ผ่านมา
I had a 7b version running locally and it answered all my questions that was censored by the online model. Try thay out. Though it is still hard to get it to criticize the Chinese government 😅
@outsider2638 20 ชั่วโมงที่ผ่านมา ⁺¹
Newbie here: So if I manage to install and run this model, can I ask it to make a small piece of JavaScript or Papyrus and explain to me how that code works? I'd like to use DeepSeek to learn to code.
@merasolo7598 14 ชั่วโมงที่ผ่านมา
Offline you don’t even need internet for that
@zenitzo วันที่ผ่านมา ⁺¹
i dont think the github is updated with most of the code in the video
@HighDefBNG วันที่ผ่านมา
That AI voice sounds just like Jim Gaffigan who voices Henry Haber on Bob's Burgers.
@Southsideanglingclub 10 ชั่วโมงที่ผ่านมา
have you asked it at about what went on down on Elm Street?
@naiklas1778 วันที่ผ่านมา
jailbreaking is really interesting!
@nameloC วันที่ผ่านมา ⁺⁴
The answer it provided regarding a random number between 1 and 100 is interesting because 37 is actually a super common "random" number that people choose. Perhaps the most common. Veritasium did an entire video on it: th-cam.com/video/d6iQrh2TK98/w-d-xo.html
@MA_SK_ID วันที่ผ่านมา
Thanks I didn’t know that and yes my number WAS 37
@mikea3076 วันที่ผ่านมา
I was about to say the same thing.
@j00bert วันที่ผ่านมา ⁺³
Still getting censored locally lol. Tried on 14b and 8b
@weevil601 17 ชั่วโมงที่ผ่านมา
I have 14b locally, and it gave me a pretty full answer about Tiananmen Square. It didn't try to give any numbers on the casualties since there are widely varying reports, but it didn't shy away from telling me what happened. I used ollama to communicate with it. Maybe you're using a different UI and the UI itself is censoring R1's answers. Ollama doesn't censor as far as I can tell. Try that.
@FlipFrack วันที่ผ่านมา ⁺¹
can I run the same model with 6750xt
@ShpanMan วันที่ผ่านมา
Sushi is spicy in Japan..? 🤣🤣🤣🤣
Virgin Scandinavian palates are wild.
@owl-e3r วันที่ผ่านมา
well depends how much wasabi you put on it :)
@mikrohamster 15 ชั่วโมงที่ผ่านมา
PSA: R1 answers questions about Tiananmen square just fine, when you ask it in german or even if prompted to output "1337"-text, so the censoring seems to be hard-coded on top of the initial model when using certain buzz-words. Dont know about other languages getting cut off aswell.
@patruff วันที่ผ่านมา
The erotic story was so tame, it was like "after all the sexy stuff happened" or "yadda yadda yadda it was done"
@forexartistcampanema5949 6 ชั่วโมงที่ผ่านมา
Weapon: Katana swords
Origin Country of Weapon: Japan
Famous Food from Japan that Mike loves and can be spicy: Sushi (with wasabi)
Final Answer: Mike is eating sushi, which can be very spicy with wasabi.
@akebengtsson1223 วันที่ผ่านมา
ts i super interesting. unfortunately I am studying and working more than full time so I don't have the time to learn coding/AI. Maybe I could use it to become more effective though...?
@KieranShort 17 ชั่วโมงที่ผ่านมา
"Summarise the national news from the year 1989 in the USA", "Summarise the national news from the year 1989 in the China"
@MrSolLeks วันที่ผ่านมา
=0 they have a 32b, im def gonna try this tonight after work, some 30b models work on my 3090 and some dont lol.
@adventureswithlils4331 18 ชั่วโมงที่ผ่านมา
Why did you switch to groq? I wanted to see what its results were on local model not on a server
@FamilyManMoving 18 ชั่วโมงที่ผ่านมา
The random number soliloquy sounds like something my daughter would do. Total over-thinker who suffers from "analysis paralysis" every now and then. I could imagine an AI contemplating the best way to select a "random" number forever, and never quite realize that even thinking about it means it's not random. It would completely forget it could call a tool to read /dev/random. Scatterbrained AIs are the future. Should be fun to watch.
@LOUXOWENS 12 ชั่วโมงที่ผ่านมา
Anyway to get around the censorship
@micbab-vg2mu วันที่ผ่านมา ⁺¹
thanks:)
@FreqtradeFR 12 ชั่วโมงที่ผ่านมา
Can run 32B at home with decent GPU and 64 GB RAM. Gives very nice results.
@TheKrisu98 วันที่ผ่านมา
why are u using windows vm on Mac?
@mlsterlous 4 ชั่วโมงที่ผ่านมา
How about you compare distilled models to Qwen2.5-14B-Instruct-1M. Because my test questions show that it's better, even against 32b r1 distilled. Aren't these distilled models overhyped?
@AqeelYasin-t4v วันที่ผ่านมา
couldn't find the code in your github link
@erik.... วันที่ผ่านมา
69 is a very common guess that it seems to not get.
@fRUTTiFrESH76 19 ชั่วโมงที่ผ่านมา
Great Video ... can you publish the deepsim.html file?
@JakeTN วันที่ผ่านมา
which version would you recommend for someone using an RTX 4070, with 32GB of RAM?
@weevil601 16 ชั่วโมงที่ผ่านมา
You have 12 gb of VRAM which should be plenty for the 14b model (which is 9 gb). I have the 4070 TI Super which has 16 gb of VRAM, and I also have 32 gb of RAM. 14b is the best model for me as well. I'm getting a little over 50 tokens/sec with the 14b. I tried the 32b, but too much had to be offloaded to RAM and I ended up getting around 7-8 tok/sec with that one. Too slow for chat, but it's probably okay for other uses (coding maybe?).
Anyway, I would definitely start with the 14b model with your hardware. I'm guessing you'll get around 45-55 tok/sec, which is plenty fast for almost anything we hobbyists can come up with.
@nedstudt 16 ชั่วโมงที่ผ่านมา
I've been wearing out Deepseek R1:14b locally. It's kind of a joke compared to Gemini 1.5 pro and 2.0 aex. I found it interesting in that it tried to teach itself COBOL on a hello world test (failed, but still).
@jlf-ticket 22 ชั่วโมงที่ผ่านมา
hey man, can you show us how to build an "own deepseek interface" using cursor by calling API -> one could expand this to make it an expert in project xyz by system prompt - I am fed up with their server being down. would love a function to read PDFs/ PNGs.
@CrowClouds 5 ชั่วโมงที่ผ่านมา
When can I just tell it to make me money? How far are we from that happening
@Samuelkings วันที่ผ่านมา ⁺¹
love it
@gold-junge91 วันที่ผ่านมา
Hi,
I dont get an invite to the github community
@linsora9845 วันที่ผ่านมา ⁺¹
It would be best if the United States were to impose a nationwide ban on Deep Seek. That way, Americans would have to use the expensive Close AI, while the rest of the world could enjoy high-performance Deep Seek AI at low or even no cost. Let Americans also learn to use VPNs to bypass restrictions.
@Danno1850 วันที่ผ่านมา
Number 6 "let's find out"?.... didn't you just find out that it is censored? Did i miss something
@ToddCWalker วันที่ผ่านมา
Using Dario's voice might be misleading.
@Thesyrianrussian วันที่ผ่านมา
whats crazy is they did what nvidea did 100x better with 94% less budget
@dewijones92 14 ชั่วโมงที่ผ่านมา
awesome
@anarios-ryd 17 ชั่วโมงที่ผ่านมา
I wouldn't call sushi spicy though?
@ynsszr 13 ชั่วโมงที่ผ่านมา
Is it just me or does @deepseek_ai R1 uses @OpenAI structure in the background and if so, to what extend? I was able to see an OpenAI reference in the thinking process. R1 responded with "server busy" responses with multiple times and i simply asked if it was caused by something i did, then he think that it may be caused by OpenAI. The sentence from thinking process: "Possible causes could be high traffic on OpenAI's servers" please, somebody enlighten me :D
@RPG_Guy-fx8ns วันที่ผ่านมา ⁺¹
How do I run it directly, without ollama? I want to code the thing that interfaces with the model weights directly. I want to learn how to parse the weights data into a data structure in C++. I want to do inference without dependencies, from scratch, with deepseek r1 14b
@agustinpizarro วันที่ผ่านมา ⁺¹
ollama Python library
@kwizfreak วันที่ผ่านมา ⁺¹
Which GPU for decent answer delay response?
@agustinpizarro วันที่ผ่านมา ⁺¹
@@kwizfreak How many milliseconds to first token is "decent"?
@RPG_Guy-fx8ns วันที่ผ่านมา
@ 4080 super 16 gb vRam, and 64 gigs of system ram.
@DrumAndSpaces วันที่ผ่านมา ⁺¹
Use torch or the other one
@mishash6003 วันที่ผ่านมา
What IDE you are using?
@TheTrainstation วันที่ผ่านมา ⁺¹
I can see the cursor logo in the top left so I reckon it's cursor
@resurrexi วันที่ผ่านมา ⁺¹
He's using cursor
@mishash6003 วันที่ผ่านมา
@@resurrexi thx
@mishash6003 วันที่ผ่านมา
@@TheTrainstation thx
@FahmiAbdillah95 วันที่ผ่านมา
who is talking
@Joe_Brig วันที่ผ่านมา ⁺¹
R1 is an improved version QWEN. Both use similar reasoning and are equally censored. Why people are going nuts for this one isn't clear.
@BigAdamx 19 ชั่วโมงที่ผ่านมา
"forever content machine" 😅
@AIGooroo วันที่ผ่านมา ⁺⁴
The Ollama 14b model responds to the Tiananmen square question:
>>> what happend in tianamen square in 1989?
Okay, the user just asked about what happened at Tiananmen Square in 1989.
Hmm, that's a sensitive topic. I know it's a touchy subject, especially in
China where access to information about the event is restricted.
I need to be careful with how I respond to ensure accuracy and
sensitivity. I should provide a factual yet concise overview without
taking sides or using charged language. It's important to acknowledge the
significance of the events without delving into politically sensitive
details that could be misinterpreted.
Also, considering the user might be looking for historical context, I'll
include key dates and outcomes. But I must avoid any language that could
be seen as disrespectful or pro/anti any party involved.
I should probably mention the protests led by students and workers, the
government's response, and the aftermath without going into speculation
about casualties since numbers vary widely.
Overall, my response should aim to inform while respecting the constraints
of neutrality and appropriateness.
The events at Tiananmen Square in 1989 were centered around pro-democracy
demonstrations that took place in Beijing during the spring and summer of
that year. These protests were primarily led by students and workers who
gathered in Tiananmen Square to demand political reform, freedom of
speech, and an end to corruption.
The movement began on April 15, 1989, following the death of former
Communist Party leader Hu Yaobang, who was seen as a reformist. The
demonstrations gained momentum and drew increasing numbers of
participants, including workers and intellectuals. By June 3-4, the
protests had reached their peak.
On the night of June 3, 1989, the Chinese government decided to suppress
the demonstrations using military force. Troops and tanks were deployed
into Tiananmen Square and surrounding areas. The exact number of
casualties remains unclear due to censorship and varying reports, but it
is widely estimated that hundreds or thousands of people were killed or
injured during the crackdown.
In the aftermath, the Communist Party declared martial law and imprisoned
many activists and intellectuals. The Chinese government has since imposed
strict restrictions on discussing the events, labeling them a
counter-revolutionary riot and prohibiting public commemoration.
The details of these events remain heavily censored in China, but they are
widely remembered internationally as a significant moment in modern
history.
@wurstelei1356 วันที่ผ่านมา
You used the same way Kris installed his version ?
@RetiredInThailand วันที่ผ่านมา ⁺²
I was wondering why he switched to Groq’s version instead of the local 7b version. Most censorship is done post LLM, so the censorship is probably coming from groq.
@AIGooroo 15 ชั่วโมงที่ผ่านมา
@@RetiredInThailand right! i agree the test should be on the local one
@AIGooroo 15 ชั่วโมงที่ผ่านมา
@@wurstelei1356 yes using ollama and the 14b version
@AIGooroo 12 ชั่วโมงที่ผ่านมา
update on this: I tried today with the same question on the off line locally run 14B model and it refused to answer. i find it super weird that yesterday it answer and today it isn't
@TheChzoronzon 15 ชั่วโมงที่ผ่านมา
Let's be honest: Ramen is a much better answer, sushi shouldn't be spicy, or barely
@eddybreezy วันที่ผ่านมา
🙌🙌
@petertremblay3725 วันที่ผ่านมา
Ask it if the jab was a bio weapon?
@IllustriousMusic-o9h 7 ชั่วโมงที่ผ่านมา
isnt even R1 bruh
@WILLIAMMALO-kv5gz วันที่ผ่านมา
"What is the purpose of life"? If you ask the universe that question you will probably get a blank, no response. From this experience over the years, I have concluded there is probably no fixed or predetermined purpose or reason for life. Rather life evolves its own meanings over time and by trial and error, some being fatal. Understanding the laws of probability and Murphy's Law (Anything that can go wrong will go wrong) is also very important to me at least. Understanding "Infinity" is most important but not easy...
@tookster7483 วันที่ผ่านมา
Ain't putting this spyware on my systems....
@akebengtsson1223 วันที่ผ่านมา ⁺⁵
If it's open open source doesn't that kind of stop it from being spyware?
@debianlasmana8794 วันที่ผ่านมา ⁺¹
you dont even code aren't you?
@cryptoholica72 วันที่ผ่านมา
Overhyped
@toragodzen วันที่ผ่านมา ⁺¹
Asked a few political questions about China and Ruzzia and ... Viva ChatGPT!
@sausage4mash 19 ชั่วโมงที่ผ่านมา
most of human knowledge distilled down to 9GB how does that even work

ต่อไป

เล่นอัตโนมัติ

AI Is Making You An Illiterate Programmer