The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

Explaining OpenAI's o1 Reasoning Models

AI isn't gonna keep improving

LIFEHACK😳 Rate our backpacks 1-10 😜🔥🎒

เจ๊บีเปิดคาเฟ่แมว ขนมเพียบเลย | น้องบีม

Spider web cleaner Brush Making in Factory #shorts

Microsoft's Phi 3.5 - The latest SLMs

Sam Witteveen

มุมมอง 14 362

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 20 ก.ย. 2024

ความคิดเห็น • 31

@thenoblerot หลายเดือนก่อน ⁺¹⁰
Thanks Sam! You always have good content in a sea of clickbait nonsense :)
@samwitteveenai หลายเดือนก่อน ⁺²
Thanks this is what I am trying to go for. This who space has gotten sop hype focused over the past couple of years.
@supercurioTube หลายเดือนก่อน ⁺¹
Thanks for the coverage, I'd be interested in a tool use / RAG and other utilities comparison with Llama 3.1 8B quantized aggressively to bridge the gap in RAM and performance!
@blossom_rx 29 วันที่ผ่านมา ⁺³
Unfortunately every Phi model I tested so far had a model collapse after 3 to 5 queries. I have this only with Microsoft models OR models I truncated on my own. I do not understand the hype and do not trust the benchmarks. Just to make clear: I have about 15 different official models running locally that were not tampered with and NONE except the Microsoft models have this issue.
@thmo_ 29 วันที่ผ่านมา ⁺¹
the MoE wasn't wrong, the correct answer for that calculation was exactly 9.9996, rounding _is_ the next step. So I'd say it did better at that specific question..
@Alex29196 หลายเดือนก่อน ⁺²
Phi 3.5 is mindblowing. Works crazy fast and accurate for function calling, and json answers also.
@NoidoDev 28 วันที่ผ่านมา
Which version, what functions?
@mukilanru 24 วันที่ผ่านมา ⁺¹
Is it faster than Llama-3.1-8b-Instruct float16 for json response? Also which model, mini, right?
@jeremybristol4374 หลายเดือนก่อน
Surprisingly good. Better than v3. But still get's stuck in loops as the response context length grows. Experimenting with prompts to avoid this.
@user-th7cu9ll4j 23 วันที่ผ่านมา
What are some different use cases for Mini and MoE? For example if you want to do a RAG application, which would be more suitable?
@0cano หลายเดือนก่อน
Always top notch content Sam!
@erniea5843 หลายเดือนก่อน
Nice overview!
@NetZeroEarth หลายเดือนก่อน
🔥 🔥 🔥
@Diego_UG หลายเดือนก่อน
Is there any cheap way to finetune these small models with proprietary data?
@samwitteveenai หลายเดือนก่อน ⁺¹
yeah you can do FTs with Unsloth etc quite easily for these.
@WillJohnston-wg9ew หลายเดือนก่อน ⁺¹
Does anyone know of a source for community/conversation on LLMs and business? I'm a technologist developing an app and would really like to find a good source for discussing ideas and what's working/not working.
@xthesayuri5756 หลายเดือนก่อน ⁺⁸
It's funny. Every time a new Phi model comes out I get so insanely bearish for LLMs because they always suck. Just gaming the benchmark but are horrendous to use.
@hidroman1993 หลายเดือนก่อน ⁺²
100% agreed, just ask a slightly different question and Phil goes NUTS
@Spathever หลายเดือนก่อน
This is what I noticed too. Went crazy on the 2nd time. There was no 3rd. Maybe newer bigger ones would work. Probably will need to fine-tune.
@Alex29196 หลายเดือนก่อน ⁺¹
This kind of models are like gold for people working with NLP.
@SavinaAzzahra-i9k หลายเดือนก่อน
😂
@samwitteveenai หลายเดือนก่อน
Can I ask what you are using it for that you are finding it sux. Curious is it a chat kind of app etc?
@hidroman1993 หลายเดือนก่อน ⁺¹
Definitely first
@ArianeQube หลายเดือนก่อน ⁺¹
o fucks given.
@IdPreferNot1 หลายเดือนก่อน ⁺¹
How much longer are we going to pretend that these are in any way practical? No on prem running for anyone except large corp and many of the privacy issues open source was supposed to address arise come back once you start using someone else's hardware. Guess Its great to see smaller models improve and push foundation models, but if you want to do stuff with any off these, especially with agentic processes gobbling thousands of tokens, latency and performance demand hosted service.... might as well go free flash, mini with no setup or hosting issues.
@pwinowski หลายเดือนก่อน ⁺¹
Well, you actually can run a crew of Phi models on a MacBook Pro. The M3 Pro with 36 GB of system memory, can allocate around 27 GB of that pool solely to GPUs for inference.
@IdPreferNot1 หลายเดือนก่อน
@@pwinowski Its not about can/cant. What is the tokens/sec doing that locally? Now consider hitting the gemini-flash API with 128k tokens 15 times a minute for free.

ต่อไป

เล่นอัตโนมัติ

The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

Explaining OpenAI's o1 Reasoning Models

Explaining OpenAI's o1 Reasoning Models

AI isn't gonna keep improving

AI isn't gonna keep improving

LIFEHACK😳 Rate our backpacks 1-10 😜🔥🎒

LIFEHACK😳 Rate our backpacks 1-10 😜🔥🎒

เจ๊บีเปิดคาเฟ่แมว ขนมเพียบเลย | น้องบีม

เจ๊บีเปิดคาเฟ่แมว ขนมเพียบเลย | น้องบีม

Spider web cleaner Brush Making in Factory #shorts

Spider web cleaner Brush Making in Factory #shorts

[FULL EP.16] เซียนพาลุย "เผือก-ฟรอยด์" กรี๊ดลั่น จนปูหนี | เฮ็ดอย่างเซียนหรั่ง | One Playground

[FULL EP.16] เซียนพาลุย "เผือก-ฟรอยด์" กรี๊ดลั่น จนปูหนี | เฮ็ดอย่างเซียนหรั่ง | One Playground

AWS CEO - The End Of Programmers Is Near

AWS CEO - The End Of Programmers Is Near

Moshi The Talking AI

Moshi The Talking AI

AI Realism Breakthrough & More AI Use Cases

AI Realism Breakthrough & More AI Use Cases

My Brain after 569 Leetcode Problems

My Brain after 569 Leetcode Problems

AgentWrite with LangGraph

AgentWrite with LangGraph

I Built an AI That Does My Work For Me

I Built an AI That Does My Work For Me

Building a LangGraph ReAct Mini Agent

Building a LangGraph ReAct Mini Agent

InternLM - A Strong Agentic Model?

InternLM - A Strong Agentic Model?

AI Automation: Making AI Work for You - now with GPT-4o Fine-Tuning!

AI Automation: Making AI Work for You - now with GPT-4o Fine-Tuning!

คุณเคยแกล้งเพื่อนแบบนี้มั้ย ? #roblox #shots #funny #robloxไทย #พี่แป้ง #ฟีด #มาแรง #fyp #ตลก #ฮาๆ

คุณเคยแกล้งเพื่อนแบบนี้มั้ย ? #roblox #shots #funny #robloxไทย #พี่แป้ง #ฟีด #มาแรง #fyp #ตลก #ฮาๆ

สาวทอผ้าไหม - อ๋อมแอ๋ม เพชรบ้านแพง「New Version」

สาวทอผ้าไหม - อ๋อมแอ๋ม เพชรบ้านแพง「New Version」

สพฐ. ตรวจใหม่เอง คะแนนสอบ "ครูเบญ"

สพฐ. ตรวจใหม่เอง คะแนนสอบ "ครูเบญ"

MANG bamm - เจ็บก็สิอดเอา (Stay Strong) | OFFICIAL M/V

MANG bamm - เจ็บก็สิอดเอา (Stay Strong) | OFFICIAL M/V

Trapdoors in the Desert Challenge! - funny minecraft animation #shorts #cartoon

Trapdoors in the Desert Challenge! - funny minecraft animation #shorts #cartoon

สุริยะจักรวาลมีอะไรบ้างน้าา ? 🤣 #KTTVjourney #ก้อยตูนทะเลเวลา #KTTalay #เปิ้ลเน็กxกันเอง

สุริยะจักรวาลมีอะไรบ้างน้าา ? 🤣 #KTTVjourney #ก้อยตูนทะเลเวลา #KTTalay #เปิ้ลเน็กxกันเอง

Never thought this girl can be a killer #shorts #cdrama #coupleofmirrors #movie #drama

Never thought this girl can be a killer #shorts #cdrama #coupleofmirrors #movie #drama

สพป.บุกพบ 'ครูเบญ' พาตรวจข้อสอบตัวเอง - เปิดผลสอบสาวติดที่ 1 แทน เก่งระดับหัวกะทิ

สพป.บุกพบ 'ครูเบญ' พาตรวจข้อสอบตัวเอง - เปิดผลสอบสาวติดที่ 1 แทน เก่งระดับหัวกะทิ