How To Fine-Tune Command R Plus On Custom Dataset

Mind Blowing Function Calling by New Hermes 2 on Llama 3 Locally

How Context Length of LLM is Increased by Adjusting RoPE Theta

Family Love #funny #sigma

หาทำ EP.53 : ก๋วยเตี๋ยวหมูตุ๋น สูตร"ใบเฟิร์น พิมพ์ชนก" | จือปาก

aespa 에스파 'Whiplash' MV

Testing 1 Million Context Length of Llama 3 8B Locally

Fahd Mirza

มุมมอง 3 655

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 26 ต.ค. 2024

ความคิดเห็น • 21

@antonvinny 5 หลายเดือนก่อน ⁺¹
You can increase in the context size in LM Studio, it should be there in the Model Inspector if i remember correctly.
@fahdmirza 5 หลายเดือนก่อน
yes tried that but same issue, but thanks
@HassanAllaham 5 หลายเดือนก่อน
I am not sure, but I think there is some kind of upper limit the renderer of LM Studio can accept.. Which is the max character limit that can be parsed from the textarea input element in the front end to the backend of LM studio. This max limit is not related to the context size set directed to the LLM engine. Unfortunately, LM Studio is a closed-source app so it is very hard to make sure of such a maximum limit. That's why it is better to test increased context size models in terminal dependent app like Ollama.
@HassanAllaham 5 หลายเดือนก่อน
Thanks for this video. Will you please try to give it some gradually increased length of text using ollama and evaluate the result. You might do it without any GUI using the terminal where there is no limit (before testing it I believe it is better to create a clone of it using the right template and parameters when creating the MODELFILE ..The template and parameter are provided on ollama website).. I think this increase in window context size should have some bad issues (gibrish, halucination or looping) which might be observed after a specific context length . The needle injection test which they did on this model does not give a real evaluation of its performance. The only real benefit of such increase is to make dealing with big data size more easy where there should be no need for RAG ...
@fahdmirza 5 หลายเดือนก่อน ⁺¹
Would have to check.
@omarawad117 5 หลายเดือนก่อน
Is there more than 8k input context window??
@fahdmirza 5 หลายเดือนก่อน
Not for this one.
@lorenzo9196 5 หลายเดือนก่อน ⁺¹
The infinite output I thinks it's the chat template
@fahdmirza 5 หลายเดือนก่อน
yes
@myideaspotxyz5618 5 หลายเดือนก่อน
can you share your pc confg?
@fahdmirza 5 หลายเดือนก่อน
already in video
@pensiveintrovert4318 5 หลายเดือนก่อน
Doesn't do much of anything for using in agentic code generation. Looping behavior.
@fahdmirza 5 หลายเดือนก่อน
Would have to check.
@basilbrush7878 5 หลายเดือนก่อน
I tried it out on Ollama /set parameter num_ctx 1024000, and my example produced 70,000 words in the response 😮
@testales 5 หลายเดือนก่อน
But did it also remember things from the beginning and the middle?
@basilbrush7878 5 หลายเดือนก่อน
@testales at some stage, it started repeating the same paragraphs
@testales 5 หลายเดือนก่อน
@@basilbrush7878 Yeah, I know this behavior, for benchmarking and testing I asked some models to recite some lengthy boring text, for some reason the communist manifesto came to my mind. :D Anyway, sometimes the models hestitates telling me it can't do that but usally it actually can do it very precisely. But then either it stops reciting at a random point or get stuck in an infinite loop reciting only the same passages. If I gave the instruction only to recite up to say chapter 2, this usually gets ignored which is to me a strong indicator that the instruction is lost just after a few thousands tokens. I've yet too see a model that runs locally and has an actually working context length "hack".
@fahdmirza 5 หลายเดือนก่อน ⁺¹
Agreed. I think the repetition is more due to the GPU card's limitation.

ต่อไป

เล่นอัตโนมัติ

How To Fine-Tune Command R Plus On Custom Dataset

How To Fine-Tune Command R Plus On Custom Dataset

Mind Blowing Function Calling by New Hermes 2 on Llama 3 Locally

Mind Blowing Function Calling by New Hermes 2 on Llama 3 Locally

How Context Length of LLM is Increased by Adjusting RoPE Theta

How Context Length of LLM is Increased by Adjusting RoPE Theta

Family Love #funny #sigma

Family Love #funny #sigma

หาทำ EP.53 : ก๋วยเตี๋ยวหมูตุ๋น สูตร"ใบเฟิร์น พิมพ์ชนก" | จือปาก

หาทำ EP.53 : ก๋วยเตี๋ยวหมูตุ๋น สูตร"ใบเฟิร์น พิมพ์ชนก" | จือปาก

aespa 에스파 'Whiplash' MV

aespa 에스파 'Whiplash' MV

The Driver EP.259 - นุ่น วรนุช

The Driver EP.259 - นุ่น วรนุช

why llama-3-8B is 8 billion parameters instead of 7?

why llama-3-8B is 8 billion parameters instead of 7?

RoPE Rotary Position Embedding to 100K context length

RoPE Rotary Position Embedding to 100K context length

OpenVLA An Open Source Vision Language Action Model（Stanford 2024）

OpenVLA An Open Source Vision Language Action Model（Stanford 2024）

Making 1 MILLION Token Context LLaMA 3 (Interview)

Making 1 MILLION Token Context LLaMA 3 (Interview)

How to code long-context LLM: LongLoRA explained on LLama 2 100K

How to code long-context LLM: LongLoRA explained on LLama 2 100K

Llama 3 - 8B & 70B Deep Dive

Llama 3 - 8B & 70B Deep Dive

Llama 3 RAG Demo with DSPy Optimization, Ollama, and Weaviate!

Llama 3 RAG Demo with DSPy Optimization, Ollama, and Weaviate!

Llama-3 🦙 with LocalGPT: Chat with YOUR Documents in Private

Llama-3 🦙 with LocalGPT: Chat with YOUR Documents in Private

Llama 3 Fine Tuning for Dummies (with 16k, 32k,... Context)

Llama 3 Fine Tuning for Dummies (with 16k, 32k,... Context)

มายคราฟ แต่ คุณห้ามออกจากหัวใจ...! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

มายคราฟ แต่ คุณห้ามออกจากหัวใจ...! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

The Wall Song ร้องข้ามกำแพง| EP.216 | ไอซ์ / กอล์ฟ / โอม / เล้ง / ไรอัล / แก้ม | 24 ต.ค. 67 FULL EP

The Wall Song ร้องข้ามกำแพง| EP.216 | ไอซ์ / กอล์ฟ / โอม / เล้ง / ไรอัล / แก้ม | 24 ต.ค. 67 FULL EP

“บิวบอง” กว่าจะมีวันนี้ไม่ใช่เรื่องง่าย ทั้งเสวกับควาย นอนแคมป์สังกะสี โดดถีบแฟนคลับ @Sianstudio

“บิวบอง” กว่าจะมีวันนี้ไม่ใช่เรื่องง่าย ทั้งเสวกับควาย นอนแคมป์สังกะสี โดดถีบแฟนคลับ @Sianstudio

'최초 공개' aespa - Whiplash #엠카운트다운 EP.868 | Mnet 241024 방송

'최초 공개' aespa - Whiplash #엠카운트다운 EP.868 | Mnet 241024 방송

Fenerbahçe 1-1 Manchester United | Europa League 24/25 Match Highlights

Fenerbahçe 1-1 Manchester United | Europa League 24/25 Match Highlights

ROSÉ & Bruno Mars - APT. (Official Lyric Video)

ROSÉ & Bruno Mars - APT. (Official Lyric Video)