TinyLlama: The Era of Small Language Models is Here

Artificial Intelligence Video 0003: How to Get Frames Out of a Video Using Python

Create Python Based Trading Strategy

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

ถ้าครูบามีโปเกม่อน #shorts

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

Run Mixtral 8x7B MoE in Google Colab

Prompt Engineering

มุมมอง 9 917

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 พ.ย. 2024
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 31

@snypzzz8702 10 หลายเดือนก่อน ⁺²
i was Dyingggg for this tutorial. thanks mannnn
@gangs0846 9 หลายเดือนก่อน
Thank you my friend. One question. You dont use the following, why? :
The template used to build a prompt for the Instruct model is defined as follows:
[INST] Instruction [/INST] Model answer [INST] Follow-up instruction [/INST]
@publicsectordirect982 10 หลายเดือนก่อน ⁺³
You are my go to guy for anything open source. Thanks for your work bhai 🙏
@engineerprompt 10 หลายเดือนก่อน ⁺¹
Glad it's helpful
@thisurawz 10 หลายเดือนก่อน ⁺⁷
Can you do a video on finetuning a multimodal LLM (Video-LlaMA, LLaVA, or CLIP) with a custom multimodal dataset containing images and texts for relation extraction or a specific task? Can you do it using open-source multimodal LLM and multimodal datasets like video-llama or else so anyone can further their experiments with the help of your tutorial. Can you also talk about how we can boost the performance of the fine-tuned modal using prompt tuning in the same video?
@WesnerSEI 9 หลายเดือนก่อน
This guy is right on! Asking the right questions! Although I don't expect him to answer all of that, you are def. in the right direction!
@adityashinde436 10 หลายเดือนก่อน ⁺²
make a video on fine tuning Mixtral 8x7b and how to use in production
@path1024 10 หลายเดือนก่อน ⁺¹
I have a 2060 Super. Only 4% slower than a 3060, but only 8GB VRAM. I have 64 GB of DDR5 RAM and a 14900K CPU (with an NPU). I bet I could run it in 2 bit, but I never thought I'd go below 4 bit. Frankly I just see 8x7B as being a less efficient version of having several models fine tuned to a specific task. A couple 4 bit 7B models can fit in 8GB VRAM.
@engineerprompt 10 หลายเดือนก่อน ⁺¹
I think for general tasks it might be a good option. If you are working on a specific application, I will also recommend to fine tune a smaller model and use that instead. Will probably be a better option
@path1024 10 หลายเดือนก่อน
@@engineerpromptYeah, the total is smaller than its implied parts, so for a general-purpose model it's probably more efficient. 8 7B models at 16 bit would usually take around 112GB instead of 90.
@jayr7741 10 หลายเดือนก่อน ⁺¹
Please bring some multilingual (Hindi) TTS voice cloning on colab.
@DavidSegura99 10 หลายเดือนก่อน ⁺¹
Thank you this is amazing i will use it for sure!, could make a video using this method with Free Kaggle, since t you can use 2 16gb T4 cards at the same time in the same instance also with 30 GB of RAM, this should run a lot faster, pretty please, also im sure that Free Kaggle tier videos will make you a tons of views for your channel, best of wishes for you and your love ones and happy 2024!
@engineerprompt 10 หลายเดือนก่อน ⁺¹
Thank you for the wishes, happy new year to you too! Kaggle is a great option. I haven't looked at it in a while but will see what I can do. Didn't know that they now offers two GPUs. Will explore that further.
@kunalsoni7681 10 หลายเดือนก่อน ⁺¹
Amazing
@bennguyen1313 8 หลายเดือนก่อน
I imagine it's costly to run LLMs.. is there a limit on how much Google Colab will do for free?
I'm interested in creating a Python application that uses AI.. from what I've read, I could use ChatGPT4 Assistant API and I as the developer would incur the cost whenever the app is used.
Alternatively, I could host a model like Ollama, on my own computer or on the cloud (beam cloud/ Replicate/Streamlit/replit)?
@8888-u6n 10 หลายเดือนก่อน
Grate video, is there a way to upload your own RAG documents to this
@curiouslycory 9 หลายเดือนก่อน
The model can be 30+GB. Not surprising that it takes a while to load.
@geniusxbyofejiroagbaduta8665 10 หลายเดือนก่อน
Thanks
@gangs0846 9 หลายเดือนก่อน
How to let it write several pages text? Eventhough I set the max tokens to 32k and tell him to write 10 pages it still Outputs only 1 page of text
@lostInSocialMedia. 10 หลายเดือนก่อน
can we run uncensored model ?
@engineerprompt 10 หลายเดือนก่อน ⁺¹
I think yes but it needs to be converted into HQQ format
@DezorianGuy 10 หลายเดือนก่อน ⁺¹
Is this better than chatgpt 3.5?
@valm7397 10 หลายเดือนก่อน ⁺¹
yes
@engineerprompt 10 หลายเดือนก่อน ⁺¹
On benchmarks, yes
@alx8439 10 หลายเดือนก่อน
You can run quantized 4bit mixtral literally on any recent computer with 32 gb of RAM without any GPU at all. I don't understand why you need Google Collab here, memory is ultracheap these days
@unkim7085 10 หลายเดือนก่อน ⁺¹
Do you have a reference for a tutorial about how to do it? Thanks
@alx8439 10 หลายเดือนก่อน
@@unkim7085 or do the same in ollama - it just works there
@mavrick23 10 หลายเดือนก่อน
can it work on 8gb ram?
@oliviertorres8001 10 หลายเดือนก่อน ⁺¹
Is there a way to make this model works in oobabooga Text generation WebUI that run in a Google Collab? Thx,

ต่อไป

เล่นอัตโนมัติ

TinyLlama: The Era of Small Language Models is Here

TinyLlama: The Era of Small Language Models is Here

Artificial Intelligence Video 0003: How to Get Frames Out of a Video Using Python

Artificial Intelligence Video 0003: How to Get Frames Out of a Video Using Python

Create Python Based Trading Strategy

Create Python Based Trading Strategy

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

ถ้าครูบามีโปเกม่อน #shorts

ถ้าครูบามีโปเกม่อน #shorts

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel

SOLAR-10.7B: Merging Models is The Next Big Thing | Beats Mixtral MoE

SOLAR-10.7B: Merging Models is The Next Big Thing | Beats Mixtral MoE

The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!

The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!

O1’s Chain of Thought: I Built a System to Mimic It-Here’s How It Went!

O1’s Chain of Thought: I Built a System to Mimic It—Here’s How It Went!

Exploring the Rise of Small Language Models

Exploring the Rise of Small Language Models

Prompt Engineering: Is it a Skill Worth Learning?

Prompt Engineering: Is it a Skill Worth Learning?

Stop Losing Context! How Late Chunking Can Enhance Your Retrieval Systems

Stop Losing Context! How Late Chunking Can Enhance Your Retrieval Systems

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Llama-2 with LocalGPT: Chat with YOUR Documents

Llama-2 with LocalGPT: Chat with YOUR Documents

Fine-Tune Your Own Tiny-Llama on Custom Dataset

Fine-Tune Your Own Tiny-Llama on Custom Dataset

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊 A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 #shorts

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊 A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 #shorts

ใครหน้าจอแตกเอาไปเลย

ใครหน้าจอแตกเอาไปเลย

Mac mini คอมที่คุ้มที่สุด ราคา 20,900 บาท มันดียังไง? #apple #macmini #m4 #ข่าวไอที

Mac mini คอมที่คุ้มที่สุด ราคา 20,900 บาท มันดียังไง? #apple #macmini #m4 #ข่าวไอที

I tested the Craziest Xiaomi Gadgets!

I tested the Craziest Xiaomi Gadgets!

พรีวิว realme GT 7 Pro - Snap 8 Elite รุ่นแรกของโลก & แบต 6,500 mAh รุ่นแรกของ realme 🤯

พรีวิว realme GT 7 Pro - Snap 8 Elite รุ่นแรกของโลก & แบต 6,500 mAh รุ่นแรกของ realme 🤯

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊 A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10#4rabetind #livebigagency

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊 A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10#4rabetind #livebigagency

World’s smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

World’s smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2