Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

BOWKYLION - วิงวอน (ex-change) [Official MV]

Try this prank with your friends 😂

HIGHLIGHTS : PORT FC 1-2 BG PATHUM UNITED | CHANG FA CUP 2024/25 (ROUND OF 64)

Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84

Stanford MLSys Seminars

มุมมอง 6 393

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 พ.ย. 2024

ความคิดเห็น • 8

@voncolborn9437 11 หลายเดือนก่อน ⁺²
Great presentation. It is interesting to see the practical side of running a bunch of LLMs. Ops makes it happen. Coming from the old, really old, school of computing with massive multi-user, time-share systems, it is interesting to see how no matter how much computing changes, aspects of it remain the same. Through-put, latency, caching and scheduling is still central. All that seems to have changed is the problem domain. We do, in deed, live in intereswting times.
@conan_der_barbar ปีที่แล้ว ⁺¹
great talk! still waiting for the open source release 👀
@suleimanshehu5839 11 หลายเดือนก่อน
Please create a video on fine tuning MoE LLM using LoRa adapters such as Mixtural 8x7B MoE LLM within your framework
@Gerald-iz7mv 7 หลายเดือนก่อน
hi, do you have any links to benchmarks you can run to measure latency, throughput for different model and frameworks etc?
@fastcardlastname3353 ปีที่แล้ว
This shall change the landscape of multiple agents if it's promised.
@mohamedfouad1309 11 หลายเดือนก่อน
Github link😅
@nithinrao7191 ปีที่แล้ว
Second
@absbi0000 ปีที่แล้ว
First

ต่อไป

เล่นอัตโนมัติ

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

BOWKYLION - วิงวอน (ex-change) [Official MV]

BOWKYLION - วิงวอน (ex-change) [Official MV]

Try this prank with your friends 😂

Try this prank with your friends 😂

HIGHLIGHTS : PORT FC 1-2 BG PATHUM UNITED | CHANG FA CUP 2024/25 (ROUND OF 64)

HIGHLIGHTS : PORT FC 1-2 BG PATHUM UNITED | CHANG FA CUP 2024/25 (ROUND OF 64)

เกษมบัณฑิต เอฟซี พบ บลูเวฟ ชลบุรี MEA ฟุตซอลไทยลีก2024 นัดที่ 15

เกษมบัณฑิต เอฟซี พบ บลูเวฟ ชลบุรี MEA ฟุตซอลไทยลีก2024 นัดที่ 15

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

LLM inference optimization: Architecture, KV cache and Flash attention

LLM inference optimization: Architecture, KV cache and Flash attention

The Next 100x - Gavin Uberti | Stanford MLSys #92

The Next 100x - Gavin Uberti | Stanford MLSys #92

How We've Scaled Dropbox

How We've Scaled Dropbox

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Scalable Parallel Programming with CUDA on Manycore GPUs

Scalable Parallel Programming with CUDA on Manycore GPUs

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Teaching LLMs to Use Tools at Scale - Shishir Patil | Stanford MLSys #98

Teaching LLMs to Use Tools at Scale - Shishir Patil | Stanford MLSys #98

พลาด เพื่อนขี้เหล้าเมาตกตึกจากดาดฟ้า!! พ่อเครียดหนัก แกล้งคน

พลาด เพื่อนขี้เหล้าเมาตกตึกจากดาดฟ้า!! พ่อเครียดหนัก แกล้งคน

สิ่งที่ซ่อนในคำว่า #อิงล็อต ย้อนวันเคลียร์ใจ เผยแผนชีวิตในอนาคต | Chairs to Share EP.63

สิ่งที่ซ่อนในคำว่า #อิงล็อต ย้อนวันเคลียร์ใจ เผยแผนชีวิตในอนาคต | Chairs to Share EP.63

LISA - ALTER EGO (Official Album Teaser)

LISA - ALTER EGO (Official Album Teaser)

แอม ไซยาไนด์ ถูกศาลอาญา สั่งประหารชีวิต พร้อมสั่งจำคุกอดีตสามี 1 ปี ทนายพัชโดนด้วย 2 ปี ไม่รอลงอาญา

แอม ไซยาไนด์ ถูกศาลอาญา สั่งประหารชีวิต พร้อมสั่งจำคุกอดีตสามี 1 ปี ทนายพัชโดนด้วย 2 ปี ไม่รอลงอาญา

想停哪裏畫哪裏？四川男子在路邊自畫停車位，交警：立即恢復原狀！#神操作 #新聞 #真實事件 #shorts

想停哪裏畫哪裏？四川男子在路邊自畫停車位，交警：立即恢復原狀！#神操作 #新聞 #真實事件 #shorts

อยู่มา2ปีไม่เคยรื้อเตียง และะสิ่งที่เจอ!

อยู่มา2ปีไม่เคยรื้อเตียง และะสิ่งที่เจอ!

Satisfying ASMR: Smart Gadget for a Clean & Fun Mealtime 🍴😋 #reaction #asmr #cat #gadget

Satisfying ASMR: Smart Gadget for a Clean & Fun Mealtime 🍴😋 #reaction #asmr #cat #gadget

#搞笑创作#放松时刻#搞笑#农村趣事#乡村正能量

#搞笑创作#放松时刻#搞笑#农村趣事#乡村正能量