host ALL your AI locally

NEW "Harmonized" Chain of Thought (CoT) Complexity

AI can't cross this line and we don't know why.

คลิปช่วยน้ำท่วมเชียงราย 2024

เปิดบ้านใหม่พี่โค้ดดี้ที่เช่าอยู่🏡

สพป.บุกพบ 'ครูเบญ' พาตรวจข้อสอบตัวเอง - เปิดผลสอบสาวติดที่ 1 แทน เก่งระดับหัวกะทิ

Do not use Llama-3 70B for these tasks ...

Discover AI

มุมมอง 3 142

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 20 ก.ย. 2024
A detailed data analysis of the 1 mio votes by the AI community of the performance of LLMs open up new insights to areas where LLMs outperform, and areas where you better do not use a particular LLM, but opt for a better performance LLM.
all rights w/ authors:
What’s up with Llama 3? Arena data analysis
lmsys.org/blog...
#airesearch #ai #newtechnology

ความคิดเห็น • 12

@gileneusz 4 หลายเดือนก่อน ⁺⁵
this is great video! really amazing explanation
@code4AI 4 หลายเดือนก่อน
One of the best comments today! 😊
@martinsherry 4 หลายเดือนก่อน ⁺⁵
“of course, those people were wrong”…..hahahaha.
@code4AI 4 หลายเดือนก่อน ⁺²
Finally, someone is laughing ! Success! 😂
@IdPreferNot1 4 หลายเดือนก่อน
Love how your critiques shred the populist AI community while providing useful info.
@henkhbit5748 4 หลายเดือนก่อน ⁺¹
If an opensource llm perform well for your particular usecase then, for me, it Will always have my preference than a big monolithic closed source llm from ClosedAi!
@thedoctor5478 4 หลายเดือนก่อน ⁺²
I couldn't care less about friendliness. We can get that from low param models and use them to reform texts. Larger models should just care about reasoning above all else.
@TheReferrer72 4 หลายเดือนก่อน
Now I know you are tripping. Unless I can't read that graph properly you are trying tell us that a 44-45% win rate is a big loss!
Especially as this is a 70b open weights model, while the others are all closed weights.
And as another commenter noted Llama 3 has only 4k context window so of course it will be poor at summarisation and other tests that rely on a long context.
We will be getting longer context versions from Meta, multi model and huge parameters.
@code4AI 4 หลายเดือนก่อน
Llama 3 was trained on 8192 token 😂
@TheReferrer72 4 หลายเดือนก่อน
@@code4AI ok it has a 8k token length, GPT4 Turbo 128k, Claude 200K, Gemini 1000K+, so 16 times longer my point still stands.
And I notice how you did not address my first point, Like I said you are tripping.
@peterbell663 4 หลายเดือนก่อน
I found it essentailly useless and a waste of my time. I gave it a dataset of 10,000 lines with 22 variables and asked for summary statistics in cumulative blocks of 1000. 10 blocks in total, I reposed this question about 8 times over hours and each time the answer was DRIBBLE. And that was a very easy task. Imagine giving it a little bit more difficulta task like time series modelling. I will check the alternatives.
@dennisestenson7820 4 หลายเดือนก่อน
Maybe you should choose an appropriate tool for the task.

ต่อไป

เล่นอัตโนมัติ

host ALL your AI locally

host ALL your AI locally

NEW "Harmonized" Chain of Thought (CoT) Complexity

NEW "Harmonized" Chain of Thought (CoT) Complexity

AI can't cross this line and we don't know why.

AI can't cross this line and we don't know why.

คลิปช่วยน้ำท่วมเชียงราย 2024

คลิปช่วยน้ำท่วมเชียงราย 2024

เปิดบ้านใหม่พี่โค้ดดี้ที่เช่าอยู่🏡

เปิดบ้านใหม่พี่โค้ดดี้ที่เช่าอยู่🏡

สพป.บุกพบ 'ครูเบญ' พาตรวจข้อสอบตัวเอง - เปิดผลสอบสาวติดที่ 1 แทน เก่งระดับหัวกะทิ

สพป.บุกพบ 'ครูเบญ' พาตรวจข้อสอบตัวเอง - เปิดผลสอบสาวติดที่ 1 แทน เก่งระดับหัวกะทิ

เจ๊บีเปิดคาเฟ่แมว ขนมเพียบเลย | น้องบีม

เจ๊บีเปิดคาเฟ่แมว ขนมเพียบเลย | น้องบีม

NEW CORE of AI Agents (MIT, Stanford)

NEW CORE of AI Agents (MIT, Stanford)

Self-Play by Noam Brown

Self-Play by Noam Brown

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

New LLM BEATS LLaMA3 - Fully Tested

New LLM BEATS LLaMA3 - Fully Tested

Llama 8b Tested - A Huge Step Backwards 📉

Llama 8b Tested - A Huge Step Backwards 📉

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Encrypting Data in the Browser - Exploring Web Crypto APIs by Aakansha Doshi

Encrypting Data in the Browser - Exploring Web Crypto APIs by Aakansha Doshi

Yuval Noah Harari: “We Are on the Verge of Destroying Ourselves” | Amanpour and Company

Yuval Noah Harari: “We Are on the Verge of Destroying Ourselves” | Amanpour and Company

Llama3: Comparing 8B vs 70B Parameter Models - Which One is Right for You?

Llama3: Comparing 8B vs 70B Parameter Models - Which One is Right for You?

ถ้าเพลงกีฬาสีมีท่อนเดียว

ถ้าเพลงกีฬาสีมีท่อนเดียว

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

🔴Live สด! 𝐏𝐆𝐒 𝐀𝐏𝐀𝐂 𝐐𝐔𝐀𝐋𝐈𝐅𝐈𝐄𝐑𝐒 𝟐𝟎𝟐𝟒 𝐏𝐇𝐀𝐒𝐄 𝟐 | PLAY-IN วันที่ 1

🔴Live สด! 𝐏𝐆𝐒 𝐀𝐏𝐀𝐂 𝐐𝐔𝐀𝐋𝐈𝐅𝐈𝐄𝐑𝐒 𝟐𝟎𝟐𝟒 𝐏𝐇𝐀𝐒𝐄 𝟐 | PLAY-IN วันที่ 1

The joker favorite#joker #shorts

The joker favorite#joker #shorts

เปิดบ้านใหม่พี่โค้ดดี้ที่เช่าอยู่🏡

เปิดบ้านใหม่พี่โค้ดดี้ที่เช่าอยู่🏡

[LIVE] : ONE ลุมพินี 80 | คู่เอก "รักษ์ vs ยอดนำชัย"

[LIVE] : ONE ลุมพินี 80 | คู่เอก "รักษ์ vs ยอดนำชัย"

My Daughter Disturbed Dad And Was Kicked Out,But She Was Very Happy To Receive 100#funny#cute#comedy

My Daughter Disturbed Dad And Was Kicked Out,But She Was Very Happy To Receive 100#funny#cute#comedy

ONE ลุมพินี 80 Full Fight | 20 ก.ย. 2567 | Ch7HD

ONE ลุมพินี 80 Full Fight | 20 ก.ย. 2567 | Ch7HD